International audienceWe present Simty, a massively multi-threaded RISC-V processor core that acts as a proof of concept for dynamic inter-thread vector-ization at the micro-architecture level. Simty runs groups of scalar threads executing SPMD code in lockstep, and assembles SIMD instructions dynamically across threads. Unlike existing SIMD or SIMT processors like GPUs or vector processors, Simty vector-izes scalar general-purpose binaries. It does not involve any instruction set extension or compiler change. Simty is described in synthesizable RTL. A FPGA prototype validates its scaling up to 2048 threads per core with 32-wide SIMD units. Simty provides an open platform for research on GPU micro-architecture, on hybrid CPU-GPU micro-archi...
The performance improvement of conventional processor has begun to stagnate in recent years. Because...
International audienceThreads of Single-Program Multiple-Data (SPMD) applications often execute the ...
This paper presents a novel, non-standard set of vector instruction types for exploring custom SIMD ...
International audienceWe present Simty, a massively multi-threaded RISC-V processor core that acts a...
Simty is a massively multi-threaded processor core that dynamically assembles SIMD instructions from...
Abstract—The wide availability and the Single-Instruction Multiple-Thread (SIMT)-style programming m...
International audienceSingle-Instruction Multiple-Thread (SIMT) micro-architectures implemented in G...
An important class of compute accelerators are graphics processing units (GPUs). Popular programming...
Simodense is a high-performance open-source RISC-V (RV32IM) softcore, optimised for exploring custom...
SIMT architectures improve performance and efficiency by ex-ploiting control and memory-access struc...
Moore’s law is dead. The physical and economic principles that enabled an exponential rise in transi...
This demo elaborates on the programmability aspect of Simodense, a recently released open-source sof...
The SIMT execution model implemented in GPUs synchronizes groups of threads to run their common inst...
Parallel architectures following the SIMT model such as GPUs benefit from application regularity by ...
Many modern microprocessors implement Simultaneous Multi-Threading (SMT) to improve the overall effi...
The performance improvement of conventional processor has begun to stagnate in recent years. Because...
International audienceThreads of Single-Program Multiple-Data (SPMD) applications often execute the ...
This paper presents a novel, non-standard set of vector instruction types for exploring custom SIMD ...
International audienceWe present Simty, a massively multi-threaded RISC-V processor core that acts a...
Simty is a massively multi-threaded processor core that dynamically assembles SIMD instructions from...
Abstract—The wide availability and the Single-Instruction Multiple-Thread (SIMT)-style programming m...
International audienceSingle-Instruction Multiple-Thread (SIMT) micro-architectures implemented in G...
An important class of compute accelerators are graphics processing units (GPUs). Popular programming...
Simodense is a high-performance open-source RISC-V (RV32IM) softcore, optimised for exploring custom...
SIMT architectures improve performance and efficiency by ex-ploiting control and memory-access struc...
Moore’s law is dead. The physical and economic principles that enabled an exponential rise in transi...
This demo elaborates on the programmability aspect of Simodense, a recently released open-source sof...
The SIMT execution model implemented in GPUs synchronizes groups of threads to run their common inst...
Parallel architectures following the SIMT model such as GPUs benefit from application regularity by ...
Many modern microprocessors implement Simultaneous Multi-Threading (SMT) to improve the overall effi...
The performance improvement of conventional processor has begun to stagnate in recent years. Because...
International audienceThreads of Single-Program Multiple-Data (SPMD) applications often execute the ...
This paper presents a novel, non-standard set of vector instruction types for exploring custom SIMD ...