When accessing streams in vector multiprocessor machines, degradation in the interconnection network and conflicts in the memory modules are the factors that reduce the efficiency of the system. In this paper, we present a synchronous access mechanism that allows conflict-free access to streams in a SIMD vector multiprocessor system. Each processor accesses the corresponding elements out of order, in such a way that in each cycle the requested elements do not collide in the interconnection network. Moreover, memory modules are accessed so that conflicts are avoided. The use of the proposed mechanism in present-day architectures would allow conflict-free access to streams with the most common strides that appear in real applications. The add...
Vector supercomputers, which can process large amounts of vector data efficiently, are among the fas...
The purpose of this paper is to show that multi-threading techniques can be applied to a vector proc...
Presently, the highest performance computer systems are the vector processors which are mainly emplo...
When accessing streams in vector multiprocessor machines, degradation in the interconnection network...
The synchronized and simultaneous access to several vectors that form a single stream occurs in SIMD...
The poor bandwidth obtained from memory when conflicts arise in the modules or in the interconnectio...
The synchronized and simultaneous access to several vectors that form a single stream occurs in SIMD...
The high latency of memory accesses is one of the factors that most contribute to reduce the perform...
Address transformation schemes, such as skewing and linear transformations, have been proposed to ac...
Address transformation schemes, such as skewing and linear transformations, have been proposed to ac...
The performance of a vector processor accessing vectors is strongly dependent on the conflicts produ...
Most existing analytical models for memory interference generally assume random bank selection for e...
Address transformation schemes, such as skewing and linear transformations, have been proposed to ac...
On many commercial supercomputers, several vector register processors share a global highly interlea...
Abstract A multiprocess program executing on a modern multiprocessor must issue explicit commands to...
Vector supercomputers, which can process large amounts of vector data efficiently, are among the fas...
The purpose of this paper is to show that multi-threading techniques can be applied to a vector proc...
Presently, the highest performance computer systems are the vector processors which are mainly emplo...
When accessing streams in vector multiprocessor machines, degradation in the interconnection network...
The synchronized and simultaneous access to several vectors that form a single stream occurs in SIMD...
The poor bandwidth obtained from memory when conflicts arise in the modules or in the interconnectio...
The synchronized and simultaneous access to several vectors that form a single stream occurs in SIMD...
The high latency of memory accesses is one of the factors that most contribute to reduce the perform...
Address transformation schemes, such as skewing and linear transformations, have been proposed to ac...
Address transformation schemes, such as skewing and linear transformations, have been proposed to ac...
The performance of a vector processor accessing vectors is strongly dependent on the conflicts produ...
Most existing analytical models for memory interference generally assume random bank selection for e...
Address transformation schemes, such as skewing and linear transformations, have been proposed to ac...
On many commercial supercomputers, several vector register processors share a global highly interlea...
Abstract A multiprocess program executing on a modern multiprocessor must issue explicit commands to...
Vector supercomputers, which can process large amounts of vector data efficiently, are among the fas...
The purpose of this paper is to show that multi-threading techniques can be applied to a vector proc...
Presently, the highest performance computer systems are the vector processors which are mainly emplo...