With computing systems becoming ubiquitous, numerous data sets of extremely large size are becoming available for analysis. Often the data collected have complex, graph based structures, which makes them difficult to process with traditional tools. Moreover, the irregularities in the data sets, and in the analysis algorithms, hamper the scaling of performance in large distributed highperformance systems, optimized for locality exploitation and regular data structures. In this paper we present an approach to system design that enable efficient execution of applications with irregular memory patterns on a distribute, many-core architecture, based on off-the-shelf cores. We introduce a set of hardware and software components, which provide a d...
Applications that exhibit irregular, dynamic, and unbalanced parallelism are grow-ing in number and ...
This project studies FPGA-based heterogeneous computing architectures with the objective of discove...
Applications running on custom architectures with hundreds of specialized processing elements (PEs) ...
With computing systems becoming ubiquitous, numerous data sets of extremely large size are becoming ...
The recent emergence of large-scale knowledge discovery, data mining and social network analysis, ir...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
The performance gap between CPUs, and memory memory has diverged significantly since the 1980's maki...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
Restricted until 13 Feb. 2011.Field-programmable gate array (FPGA) is a representative reconfigurabl...
Irregular workloads are programs organized around pointer-based data structures such as graphs. The...
Algorithms that exhibit irregular memory access patterns are known to show poor performance on multi...
Over the past few years there has been increased interest in building custom computing machines (CCM...
The work presented in this thesis focuses on the exploration of alternative architectures and comput...
We propose an intermediate approach between full custom hardware systems and full-software tools. Fi...
Processing large-scale graphs is challenging due to the nature of the computation that causes irreg...
Applications that exhibit irregular, dynamic, and unbalanced parallelism are grow-ing in number and ...
This project studies FPGA-based heterogeneous computing architectures with the objective of discove...
Applications running on custom architectures with hundreds of specialized processing elements (PEs) ...
With computing systems becoming ubiquitous, numerous data sets of extremely large size are becoming ...
The recent emergence of large-scale knowledge discovery, data mining and social network analysis, ir...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
The performance gap between CPUs, and memory memory has diverged significantly since the 1980's maki...
The last two decade has witnessed two opposing hardware trends where the DRAM capacity and the acces...
Restricted until 13 Feb. 2011.Field-programmable gate array (FPGA) is a representative reconfigurabl...
Irregular workloads are programs organized around pointer-based data structures such as graphs. The...
Algorithms that exhibit irregular memory access patterns are known to show poor performance on multi...
Over the past few years there has been increased interest in building custom computing machines (CCM...
The work presented in this thesis focuses on the exploration of alternative architectures and comput...
We propose an intermediate approach between full custom hardware systems and full-software tools. Fi...
Processing large-scale graphs is challenging due to the nature of the computation that causes irreg...
Applications that exhibit irregular, dynamic, and unbalanced parallelism are grow-ing in number and ...
This project studies FPGA-based heterogeneous computing architectures with the objective of discove...
Applications running on custom architectures with hundreds of specialized processing elements (PEs) ...