Single-ISA heterogeneous multi-core architectures offer a compelling high-performance and high-efficiency solution to executing task-parallel workloads in mobile systems on chip (SoCs). In addition to task-parallel workloads, many data-parallel applications such as machine learning, computer vision, and data analytics increasingly run on mobile SoCs to provide real-time user interactions. Next-generation scalable vector architectures such as the RISC-V vector extension and Arm SVE have recently emerged as unified vector abstractions for both large- and small-scale systems. In this paper, we propose novel area-efficient high-performance architectures called big.VLITTLE that support next-generation vector architectures to efficiently accelera...
During past several years, there has been a trend that the modern mobile SoC (system-on-chip) chipse...
Media processing has motivated strong changes in the focus and design of processors. These applicati...
Heterogeneity, parallelization and vectorization are key techniques to improve the performance and e...
Single-ISA heterogeneous multi-core architectures offer a compelling high-performance and high-effic...
As we approach the end of conventional technology scaling, computer architects are forced to incorpo...
In numerous mobile applications involving complex video, image, signal, communication or security pr...
Vector processing has become commonplace in today's CPU microarchitectures. Vector instructions impr...
In the low-end mobile processor market, power, energy, and area budgets are significantly lower than...
The expeditious proliferation of Internet connectivity and the growing adoption of digital products ...
Data-level parallelism is frequently ignored or underutilized. Achieved through vector/SIMD capabili...
We are attacking the memory bottleneck by building a “smart ” memory controller that improves effect...
We are investigating vector-thread architectures which provide competitive performance and efficienc...
Abstract—State-of-the-art mobile system-on-chips (SoC) in-clude heterogeneity in various forms for a...
As the rate of annual data generation grows exponentially, there is a demand to aggregate and summar...
MW-scale parallel accelerators are a promising target for application domains such as the Internet o...
During past several years, there has been a trend that the modern mobile SoC (system-on-chip) chipse...
Media processing has motivated strong changes in the focus and design of processors. These applicati...
Heterogeneity, parallelization and vectorization are key techniques to improve the performance and e...
Single-ISA heterogeneous multi-core architectures offer a compelling high-performance and high-effic...
As we approach the end of conventional technology scaling, computer architects are forced to incorpo...
In numerous mobile applications involving complex video, image, signal, communication or security pr...
Vector processing has become commonplace in today's CPU microarchitectures. Vector instructions impr...
In the low-end mobile processor market, power, energy, and area budgets are significantly lower than...
The expeditious proliferation of Internet connectivity and the growing adoption of digital products ...
Data-level parallelism is frequently ignored or underutilized. Achieved through vector/SIMD capabili...
We are attacking the memory bottleneck by building a “smart ” memory controller that improves effect...
We are investigating vector-thread architectures which provide competitive performance and efficienc...
Abstract—State-of-the-art mobile system-on-chips (SoC) in-clude heterogeneity in various forms for a...
As the rate of annual data generation grows exponentially, there is a demand to aggregate and summar...
MW-scale parallel accelerators are a promising target for application domains such as the Internet o...
During past several years, there has been a trend that the modern mobile SoC (system-on-chip) chipse...
Media processing has motivated strong changes in the focus and design of processors. These applicati...
Heterogeneity, parallelization and vectorization are key techniques to improve the performance and e...