We present Synchroscalar, a tile-based architecture for embedded processing that is designed to provide the flex ibility of DSPs while approaching the power efficiency of ASICs. We achieve this goal by providing high parallelism and voltage scaling while minimizing control and commu nication costs. Specifically, Synchroscalar uses columns of processor tiles organized into statically-assigned frequency-voltage domains to minimize power consump tion. Furthermore, while columns use SIMD control to min imize overhead, data-dependent computations can be supported by extremely flexible statically-scheduled com munication between columns. We provide a detailed evaluation of Synchroscalar in cluding SPICE simulation, wire and device models, syn the...
The 9th International System-on-Chip (SoC) Conference, Exhibit & Workshops, November 22-24, 2011, Bu...
Multiple clock domains is one solution to the increasing problem of propagating the clock signal acr...
The host-multi-SIMD chip multiprocessor (CMP) architecture has been proved to be an efficient archit...
Abstract. Embedded devices have hard performance targets and severe power and area constraints that ...
We present an overview of the Synchroscalar single-chip, multi-core processor. Through the design of...
Embedded computing platforms require to support complex functionalities with high computational thro...
Journal ArticleAs clock frequency increases and feature size decreases, clock distribution and wire...
In the last ten years, limited clock frequency scaling and increasing power density has shifted IC d...
How to effectively use the increasing number of transistors available on a single chip while avoidin...
This presentation will focus on algorithms and reconfigurable tiled architectures for streaming DSP ...
We focus on architectures for streaming DSP applications such as wireless baseband processing and im...
Abstract — This paper investigates implementation techniques for tile-based chip multiprocessors wit...
This thesis pertains to the design of a digital signal processor (DSP) with emphasis on lowpower for...
Microprocessors are traditionally designed to provide "best overall" performance across a ...
In this paper, we investigate the power implications of tile size selection for tile-based processor...
The 9th International System-on-Chip (SoC) Conference, Exhibit & Workshops, November 22-24, 2011, Bu...
Multiple clock domains is one solution to the increasing problem of propagating the clock signal acr...
The host-multi-SIMD chip multiprocessor (CMP) architecture has been proved to be an efficient archit...
Abstract. Embedded devices have hard performance targets and severe power and area constraints that ...
We present an overview of the Synchroscalar single-chip, multi-core processor. Through the design of...
Embedded computing platforms require to support complex functionalities with high computational thro...
Journal ArticleAs clock frequency increases and feature size decreases, clock distribution and wire...
In the last ten years, limited clock frequency scaling and increasing power density has shifted IC d...
How to effectively use the increasing number of transistors available on a single chip while avoidin...
This presentation will focus on algorithms and reconfigurable tiled architectures for streaming DSP ...
We focus on architectures for streaming DSP applications such as wireless baseband processing and im...
Abstract — This paper investigates implementation techniques for tile-based chip multiprocessors wit...
This thesis pertains to the design of a digital signal processor (DSP) with emphasis on lowpower for...
Microprocessors are traditionally designed to provide "best overall" performance across a ...
In this paper, we investigate the power implications of tile size selection for tile-based processor...
The 9th International System-on-Chip (SoC) Conference, Exhibit & Workshops, November 22-24, 2011, Bu...
Multiple clock domains is one solution to the increasing problem of propagating the clock signal acr...
The host-multi-SIMD chip multiprocessor (CMP) architecture has been proved to be an efficient archit...