Heterogeneous systems have emerged as state-of-the-art computing solutions. Such systems consist of acceleration units that provide massive compute capabilities within limited power budgets. Compute and/or memory intensive regions of applications are often offloaded to these accelerators and as such they naturally exchange data with the host core. This complex scenario poses a key challenge: how do we optimize data movement between the host core and accelerators from a holistic system-level perspective? My research focuses on addressing the above question. Data movement optimizations can be explored in two flavors: 1) Maximizing locality and keeping the data close to its compute, and 2) moving the actual computation itself close to t...
Heterogeneous multicore systems are becoming increasingly important as the need for computation powe...
Today's heterogeneous architectures bring together multiple general purpose CPUs, domain specific GP...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...
Heterogeneous systems have emerged as state-of-the-art computing solutions. Such systems consist of ...
Recent technological trends have aided the design and development of large-scale heterogeneous syste...
Heterogeneous parallel computing combines general purpose processors with accelerators to efficientl...
In light of the failure of Dennard scaling and recent slowdown of Moore's Law, both industry and aca...
<p>The design of microprocessor technology has hit several "walls" in recent decades. These limits o...
The exponential growth of the dataset size demanded by modern big data applications requires innovat...
This dissertation investigates the communication optimization for customizable domain-specific compu...
Shared virtual memory is key in heterogeneous systems on chip (SoCs) that combine a general-purpose ...
Because of tight power and energy constraints, industry is progressively shifting toward heterogeneo...
In many domains, accelerators---such as graphic processing units (GPUs) and field programmable gate ...
The high performance computing landscape is shifting from collections of homogeneous nodes towards h...
This work studies programmability enhancing abstractions in the context of accelerators and heteroge...
Heterogeneous multicore systems are becoming increasingly important as the need for computation powe...
Today's heterogeneous architectures bring together multiple general purpose CPUs, domain specific GP...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...
Heterogeneous systems have emerged as state-of-the-art computing solutions. Such systems consist of ...
Recent technological trends have aided the design and development of large-scale heterogeneous syste...
Heterogeneous parallel computing combines general purpose processors with accelerators to efficientl...
In light of the failure of Dennard scaling and recent slowdown of Moore's Law, both industry and aca...
<p>The design of microprocessor technology has hit several "walls" in recent decades. These limits o...
The exponential growth of the dataset size demanded by modern big data applications requires innovat...
This dissertation investigates the communication optimization for customizable domain-specific compu...
Shared virtual memory is key in heterogeneous systems on chip (SoCs) that combine a general-purpose ...
Because of tight power and energy constraints, industry is progressively shifting toward heterogeneo...
In many domains, accelerators---such as graphic processing units (GPUs) and field programmable gate ...
The high performance computing landscape is shifting from collections of homogeneous nodes towards h...
This work studies programmability enhancing abstractions in the context of accelerators and heteroge...
Heterogeneous multicore systems are becoming increasingly important as the need for computation powe...
Today's heterogeneous architectures bring together multiple general purpose CPUs, domain specific GP...
The adoption of High-Level Synthesis (HLS) tools has significantly reduced accelerator design time. ...