For years, SIMD/vector units have enhanced the capabilities of modern CPUs in High-Performance Computing (HPC) and mobile technology. Typical commercially-available SIMD units process up to 8 double-precision elements with one instruction. The optimal vector width and its impact on CPU throughput due to memory latency and bandwidth remain challenging research areas. This study examines the behavior of four computational kernels on a RISC-V core connected to a customizable vector unit, capable of operating up to 256 double precision elements per instruction. The four codes have been purposefully selected to represent non-dense workloads: SpMV, BFS, PageRank, FFT. The experimental setup allows us to measure their performance while varying the...
This paper presents an experimental study on cache memory designs for vector computers. We use an ex...
SIMD accelerators are ubiquitous in microprocessors from different computing domains. Their high com...
The emergence of a new, open, and free instruction set architecture, RISC-V, has heralded a new era ...
Modern scientific applications are getting more diverse, and the vector lengths in those application...
In this paper, we present Ara, a 64-bit vector processor based on the version 0.5 draft of RISC-V's ...
Since the first vector supercomputers in the mid-1970’s, the largest scale applications have traditi...
The purpose of this paper is to show that multi-threading techniques can be applied to a vector proc...
A number of vector supercomputers feature very large memories. Unfortunately the large capacity memo...
This paper presents data confirming the fact that traditional vector architectures can not reduce th...
Vector processing has become commonplace in today's CPU microarchitectures. Vector instructions impr...
English: Power consumption has become one of the dominant issues in processor design, especially imp...
While parallel architectures based on clusters of Processing Elements (PEs) sharing L1 memory are wi...
The maturity level of RISC-V and the availability of domain-specific instruction set extensions, lik...
Vector extensions are a popular mean to exploit data parallelism in applications. Over recent years,...
Moore’s Law predicted that the number of transistors on a chip would double approximately every 2 ye...
This paper presents an experimental study on cache memory designs for vector computers. We use an ex...
SIMD accelerators are ubiquitous in microprocessors from different computing domains. Their high com...
The emergence of a new, open, and free instruction set architecture, RISC-V, has heralded a new era ...
Modern scientific applications are getting more diverse, and the vector lengths in those application...
In this paper, we present Ara, a 64-bit vector processor based on the version 0.5 draft of RISC-V's ...
Since the first vector supercomputers in the mid-1970’s, the largest scale applications have traditi...
The purpose of this paper is to show that multi-threading techniques can be applied to a vector proc...
A number of vector supercomputers feature very large memories. Unfortunately the large capacity memo...
This paper presents data confirming the fact that traditional vector architectures can not reduce th...
Vector processing has become commonplace in today's CPU microarchitectures. Vector instructions impr...
English: Power consumption has become one of the dominant issues in processor design, especially imp...
While parallel architectures based on clusters of Processing Elements (PEs) sharing L1 memory are wi...
The maturity level of RISC-V and the availability of domain-specific instruction set extensions, lik...
Vector extensions are a popular mean to exploit data parallelism in applications. Over recent years,...
Moore’s Law predicted that the number of transistors on a chip would double approximately every 2 ye...
This paper presents an experimental study on cache memory designs for vector computers. We use an ex...
SIMD accelerators are ubiquitous in microprocessors from different computing domains. Their high com...
The emergence of a new, open, and free instruction set architecture, RISC-V, has heralded a new era ...