textIn the past, we could rely on technology scaling and new micro-architectural techniques to improve the performance of processors. Nowadays, both of these methods are reaching their limits. The primary concern in future architectures with billions of transistors on a chip and limited power budgets is power/energy efficiency. Full-custom design of application-specific cores can yield up to two orders of magnitude better power efficiency over conventional general-purpose cores. However, a tremendous design effort is required in integrating a new accelerator for each new application. In this dissertation, we present the design of specialized compute fabrics that maintain the efficiency of full custom hardware while providing enough flexibil...
Technology scaling trends have enabled the exponential growth of computing power. However, the perfo...
UnrestrictedRecently, high-end computing systems have been introduced that employ reconfigurable har...
Matrix multiplication is a core building block for numerous scientific computing and, more recently,...
textIn the past, we could rely on technology scaling and new micro-architectural techniques to impro...
Achieving high-performance while reducing power consumption is the key question as tech-nology scali...
The emergence of multicore and heterogeneous architectures requires many linear algebra algorithms t...
Field Programmable Gate Arrays (FPGAs) enable powerful performance acceleration for scientific compu...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
On the road to exascale computing, the gap between hardware peak performance and application perform...
Recent years have witnessed a tremendous surge of interest in accelerating sparse linear algebra app...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
The high performance computing (HPC) community is obsessed over the general matrix-matrix multiply (...
The objective of high performance computing (HPC) is to ensure that the computational power of hardw...
In recent years, the field of high-performance computing has been facing a new challenge: achieving ...
Previous research has shown that the performance of any computation is directly related to the archi...
Technology scaling trends have enabled the exponential growth of computing power. However, the perfo...
UnrestrictedRecently, high-end computing systems have been introduced that employ reconfigurable har...
Matrix multiplication is a core building block for numerous scientific computing and, more recently,...
textIn the past, we could rely on technology scaling and new micro-architectural techniques to impro...
Achieving high-performance while reducing power consumption is the key question as tech-nology scali...
The emergence of multicore and heterogeneous architectures requires many linear algebra algorithms t...
Field Programmable Gate Arrays (FPGAs) enable powerful performance acceleration for scientific compu...
Achieving high computation efficiency, in terms of Cycles per Instruction (CPI), for high-performanc...
On the road to exascale computing, the gap between hardware peak performance and application perform...
Recent years have witnessed a tremendous surge of interest in accelerating sparse linear algebra app...
Dense linear algebra(DLA) is one of the most seven important kernels in high performance computing. ...
The high performance computing (HPC) community is obsessed over the general matrix-matrix multiply (...
The objective of high performance computing (HPC) is to ensure that the computational power of hardw...
In recent years, the field of high-performance computing has been facing a new challenge: achieving ...
Previous research has shown that the performance of any computation is directly related to the archi...
Technology scaling trends have enabled the exponential growth of computing power. However, the perfo...
UnrestrictedRecently, high-end computing systems have been introduced that employ reconfigurable har...
Matrix multiplication is a core building block for numerous scientific computing and, more recently,...