Abstract — This paper describes instruction set extensions for a variant of multi-threading called micro-threading for the LEON3 SPARCv8 processor. We show an architecture of the developed processor and its key blocks- cache controller, register file, thread scheduler. The processor has been implemented in a Xilinx Virtex2Pro FPGA. The extensions are evaluated in terms of extra resources needed, and the overall performance of the developed processor is evaluated on a simple DSP computation typical for embedded systems. I
Soft-core embedded systems implemented on FPGAs offer a high level of flexibility. Application speci...
Thread level parallelism of applications is commonly exploited using multi-thread processors. In suc...
Modern superscalar processors use wide instruction issue widths and out-of-order execution in order ...
This paper describes a number of microarchitectural tech-niques for supporting multithreading in sof...
Multi-core processors are becoming omnipresent in all kinds of computing platforms. Applications dev...
Abstract. We analyse an impact of long-latency instructions, the fa-mily blocksize parameter, and th...
Graduation date: 2007Dynamic multithreaded processors attempt to increase the performance of a singl...
Modem processors are designed to achieve greater amounts of instruction level parallelism (ILP) and ...
Several manufacturers have recently announced the first simultaneous-multithreaded processors, both ...
Large synchronization and communication overhead will become a major concern in future extreme-scale...
Present-day parallel computers often face the problems of large software overheads for process switc...
Dual width instruction set embedded processors such as ARM provide 16-bit instruction set in additio...
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruc...
We are investigating vector-thread architectures which provide competitive performance and efficienc...
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruc...
Soft-core embedded systems implemented on FPGAs offer a high level of flexibility. Application speci...
Thread level parallelism of applications is commonly exploited using multi-thread processors. In suc...
Modern superscalar processors use wide instruction issue widths and out-of-order execution in order ...
This paper describes a number of microarchitectural tech-niques for supporting multithreading in sof...
Multi-core processors are becoming omnipresent in all kinds of computing platforms. Applications dev...
Abstract. We analyse an impact of long-latency instructions, the fa-mily blocksize parameter, and th...
Graduation date: 2007Dynamic multithreaded processors attempt to increase the performance of a singl...
Modem processors are designed to achieve greater amounts of instruction level parallelism (ILP) and ...
Several manufacturers have recently announced the first simultaneous-multithreaded processors, both ...
Large synchronization and communication overhead will become a major concern in future extreme-scale...
Present-day parallel computers often face the problems of large software overheads for process switc...
Dual width instruction set embedded processors such as ARM provide 16-bit instruction set in additio...
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruc...
We are investigating vector-thread architectures which provide competitive performance and efficienc...
To achieve high performance, contemporary computer systems rely on two forms of parallelism: instruc...
Soft-core embedded systems implemented on FPGAs offer a high level of flexibility. Application speci...
Thread level parallelism of applications is commonly exploited using multi-thread processors. In suc...
Modern superscalar processors use wide instruction issue widths and out-of-order execution in order ...