UnrestrictedRecently, high-end computing systems have been introduced that employ reconfigurable hardware as application-specific hardware accelerators for general-purpose processors. These systems provide new opportunities for high-performance implementations of scientific applications. However, they also pose new design challenges, including utilization of available hardware resources, exploitation of multiple levels of memory, and hardware/software co-design.; In this work, we investigate high-performance designs for floating-point based linear algebra on reconfigurable computing systems. The operations studied are fundamental kernels for scientific computing, including dense and sparse matrix-vector multiplication, matrix multiplicatio...
Abstract. We present an efficient and scalable programming model for the development of linear algeb...
UnrestrictedWith recent technological advances, it has become possible to use reconfigurable hardwar...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
Field Programmable Gate Arrays (FPGAs) enable powerful performance acceleration for scientific compu...
Achieving high-performance while reducing power consumption is the key question as tech-nology scali...
UnrestrictedThe large capacity of field programmable gate arrays (FPGAs) has prompted researchers to...
Technology scaling trends have enabled the exponential growth of computing power. However, the perfo...
The dissemination of multi-core architectures and the later irruption of massively parallel devices,...
Recent years have witnessed a tremendous surge of interest in accelerating sparse linear algebra app...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
In the last decade floating-point matrix multiplication on FPGAs has been studied extensively and ef...
textIn the past, we could rely on technology scaling and new micro-architectural techniques to impro...
Abstract. If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced ...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...
Previous research has shown that the performance of any computation is directly related to the archi...
Abstract. We present an efficient and scalable programming model for the development of linear algeb...
UnrestrictedWith recent technological advances, it has become possible to use reconfigurable hardwar...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
Field Programmable Gate Arrays (FPGAs) enable powerful performance acceleration for scientific compu...
Achieving high-performance while reducing power consumption is the key question as tech-nology scali...
UnrestrictedThe large capacity of field programmable gate arrays (FPGAs) has prompted researchers to...
Technology scaling trends have enabled the exponential growth of computing power. However, the perfo...
The dissemination of multi-core architectures and the later irruption of massively parallel devices,...
Recent years have witnessed a tremendous surge of interest in accelerating sparse linear algebra app...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
In the last decade floating-point matrix multiplication on FPGAs has been studied extensively and ef...
textIn the past, we could rely on technology scaling and new micro-architectural techniques to impro...
Abstract. If multicore is a disruptive technology, try to imagine hybrid multicore systems enhanced ...
Basic Linear Algebra Subprograms (BLAS) and Linear Algebra Package (LAPACK) form basic building bloc...
Previous research has shown that the performance of any computation is directly related to the archi...
Abstract. We present an efficient and scalable programming model for the development of linear algeb...
UnrestrictedWith recent technological advances, it has become possible to use reconfigurable hardwar...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...