The motivation of this research was to evaluate the main memory performance of a hybrid super computer such as the Convey HC-x, and ascertain how the controller performs in several access scenarios, vis-à-vis hand-coded memory prefetches. Such memory patterns are very useful in stencil computations. The theoretical bandwidth of the memory of the Convey is compared with the results of our measurements. The accurate study of the memory subsystem is particularly useful for users when they are developing their application-specific personality. Experiments were performed to measure the bandwidth between the coprocessor and the memory subsystem. The experiments aimed mainly at measuring the reading access speed of the memory from Application Engi...
The primary goal of the presented experiment was to judge the usefulness of FPGA technology in the s...
This paper describes a performance evaluation of Image-Processing applications on FPGA-based coproce...
In this paper, we report on a preliminary investigation of the potential performance gain of program...
The motivation of this research was to evaluate the main memory performance of a hybrid super comput...
FPGA-SoCs like Xilinx's Zynq-7000 and Altera's Generation 10 SoCs provide an integrated platform for...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
The ever-increasing complexity of high-performance computing applications limits performance due to ...
Abstract—Local memories increase the efficiency of hardware accelerators by enabling fast accesses t...
The complexity of today’s embedded applications requires mod-ern high-performance embedded System-on...
As designers and researchers strive to achieve higher performance, field-programmable gate arrays (F...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Abstract—Developing FPGA implementations with an input specification in a high-level programming lan...
The high potential performance of FPGAs cannot be exploited if a design suffers a memory bottleneck....
The decreasing cost of DRAM has made possible and grown the use of in-memory databases. However, mem...
Modern processors incorporate several performance monitoring units, which can be used to count event...
The primary goal of the presented experiment was to judge the usefulness of FPGA technology in the s...
This paper describes a performance evaluation of Image-Processing applications on FPGA-based coproce...
In this paper, we report on a preliminary investigation of the potential performance gain of program...
The motivation of this research was to evaluate the main memory performance of a hybrid super comput...
FPGA-SoCs like Xilinx's Zynq-7000 and Altera's Generation 10 SoCs provide an integrated platform for...
Many algorithms and applications in scientific computing exhibit irregular access patterns as consec...
The ever-increasing complexity of high-performance computing applications limits performance due to ...
Abstract—Local memories increase the efficiency of hardware accelerators by enabling fast accesses t...
The complexity of today’s embedded applications requires mod-ern high-performance embedded System-on...
As designers and researchers strive to achieve higher performance, field-programmable gate arrays (F...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Abstract—Developing FPGA implementations with an input specification in a high-level programming lan...
The high potential performance of FPGAs cannot be exploited if a design suffers a memory bottleneck....
The decreasing cost of DRAM has made possible and grown the use of in-memory databases. However, mem...
Modern processors incorporate several performance monitoring units, which can be used to count event...
The primary goal of the presented experiment was to judge the usefulness of FPGA technology in the s...
This paper describes a performance evaluation of Image-Processing applications on FPGA-based coproce...
In this paper, we report on a preliminary investigation of the potential performance gain of program...