Automated code generation and performance tuning techniques for concurrent architectures such as GPUs, Cell and FPGAs can provide integer factor speedups over multi-core processor organizations for data-parallel, floating-point computation in SPICE model-evaluation. Our Verilog AMS compiler produces code for parallel evaluation of non-linear circuit models suitable for use in SPICE simulations where the same model is evaluated several times for all the devices in the circuit. Our compiler uses architecture specific parallelization strategies (OpenMP for multi-core, PThreads for Cell, CUDA for GPU, statically scheduled VLIW for FPGA) when producing code for these different architectures. We automatically explore different implementation con...
Circuit-level simulation is a computationally intensive problem that has proven to be particularly d...
Digital circuit simulation often requires a large amount of computation, resulting in long run times...
Many computationally intensive scientific applications involve repetitive floating point operations ...
Automated code generation and performance tuning techniques for concurrent architectures such as GPU...
Automated code generation and performance tuning tech-niques for concurrent architectures such as GP...
Spatial processing of sparse, irregular, double-precision floating-point computation using a single ...
SPICE, from the University of California, at Berkeley, is the de facto world standard for circuit si...
Due to ever increasing complexity of circuits, EDA tools and algorithms are demanding more computati...
Abstract—Single-FPGA spatial implementations can provide an order of magnitude speedup over sequenti...
In this paper, we developed a simulation-based architecture evaluation framework for field-programma...
ii Spatial processing of sparse, irregular floating-point computation using a single FPGA enables up...
Frequency scaling and precision reduction optimization of an FPGA accelerated SPICE circuit simulato...
Automated, offline precision-analysis of dataflow computation containing elementary functions (e.g. ...
Fine-grained dataflow processing of sparse Matrix-Solve computation (A~x = ~b) in the SPICE circuit ...
Fine-grained dataflow processing of sparse matrix-solve computation (Ax = b) in the SPICE circuit si...
Circuit-level simulation is a computationally intensive problem that has proven to be particularly d...
Digital circuit simulation often requires a large amount of computation, resulting in long run times...
Many computationally intensive scientific applications involve repetitive floating point operations ...
Automated code generation and performance tuning techniques for concurrent architectures such as GPU...
Automated code generation and performance tuning tech-niques for concurrent architectures such as GP...
Spatial processing of sparse, irregular, double-precision floating-point computation using a single ...
SPICE, from the University of California, at Berkeley, is the de facto world standard for circuit si...
Due to ever increasing complexity of circuits, EDA tools and algorithms are demanding more computati...
Abstract—Single-FPGA spatial implementations can provide an order of magnitude speedup over sequenti...
In this paper, we developed a simulation-based architecture evaluation framework for field-programma...
ii Spatial processing of sparse, irregular floating-point computation using a single FPGA enables up...
Frequency scaling and precision reduction optimization of an FPGA accelerated SPICE circuit simulato...
Automated, offline precision-analysis of dataflow computation containing elementary functions (e.g. ...
Fine-grained dataflow processing of sparse Matrix-Solve computation (A~x = ~b) in the SPICE circuit ...
Fine-grained dataflow processing of sparse matrix-solve computation (Ax = b) in the SPICE circuit si...
Circuit-level simulation is a computationally intensive problem that has proven to be particularly d...
Digital circuit simulation often requires a large amount of computation, resulting in long run times...
Many computationally intensive scientific applications involve repetitive floating point operations ...