Automated code generation and performance tuning tech-niques for concurrent architectures such as GPUs, Cell and FPGAs can provide integer factor speedups over multi-core processor organizations for data-parallel, floating-point com-putation in SPICE Model-Evaluation. Our Verilog AMS compiler produces code for parallel evaluation of non-linear circuit models suitable for use in SPICE simulations where the same model is evaluated several times for all the de-vices in the circuit. Our compiler uses architecture specific parallelization strategies (OpenMP for multi-core, PThreads for Cell, CUDA for GPU, statically scheduled VLIW for FPGA) when producing code for these different architec-tures. We automatically explore different implementation ...
In the past decade, FPGAs and GPUs have become increasingly common as hardware accelerators when dea...
In this paper, we developed a simulation-based architecture evaluation framework for field-programma...
Frequency scaling and precision reduction optimization of an FPGA accelerated SPICE circuit simulato...
Automated code generation and performance tuning techniques for concurrent architectures such as GPU...
Spatial processing of sparse, irregular, double-precision floating-point computation using a single ...
Abstract—Single-FPGA spatial implementations can provide an order of magnitude speedup over sequenti...
Due to ever increasing complexity of circuits, EDA tools and algorithms are demanding more computati...
Recently, FPGAs have been integrated into HPC clusters in order to boost their computational perform...
As CPU clock frequencies plateau and the doubling of CPU cores per processor ex-acerbate the memory ...
Fine-grained dataflow processing of sparse matrix-solve computation (Ax = b) in the SPICE circuit si...
ii Spatial processing of sparse, irregular floating-point computation using a single FPGA enables up...
SPICE, from the University of California, at Berkeley, is the de facto world standard for circuit si...
Fine-grained dataflow processing of sparse Matrix-Solve computation (A~x = ~b) in the SPICE circuit ...
In this paper, we report on a preliminary investigation of the potential performance gain of program...
The design cycle for complex special-purpose computing systems is extremely costly and time-consumin...
In the past decade, FPGAs and GPUs have become increasingly common as hardware accelerators when dea...
In this paper, we developed a simulation-based architecture evaluation framework for field-programma...
Frequency scaling and precision reduction optimization of an FPGA accelerated SPICE circuit simulato...
Automated code generation and performance tuning techniques for concurrent architectures such as GPU...
Spatial processing of sparse, irregular, double-precision floating-point computation using a single ...
Abstract—Single-FPGA spatial implementations can provide an order of magnitude speedup over sequenti...
Due to ever increasing complexity of circuits, EDA tools and algorithms are demanding more computati...
Recently, FPGAs have been integrated into HPC clusters in order to boost their computational perform...
As CPU clock frequencies plateau and the doubling of CPU cores per processor ex-acerbate the memory ...
Fine-grained dataflow processing of sparse matrix-solve computation (Ax = b) in the SPICE circuit si...
ii Spatial processing of sparse, irregular floating-point computation using a single FPGA enables up...
SPICE, from the University of California, at Berkeley, is the de facto world standard for circuit si...
Fine-grained dataflow processing of sparse Matrix-Solve computation (A~x = ~b) in the SPICE circuit ...
In this paper, we report on a preliminary investigation of the potential performance gain of program...
The design cycle for complex special-purpose computing systems is extremely costly and time-consumin...
In the past decade, FPGAs and GPUs have become increasingly common as hardware accelerators when dea...
In this paper, we developed a simulation-based architecture evaluation framework for field-programma...
Frequency scaling and precision reduction optimization of an FPGA accelerated SPICE circuit simulato...