Traditional (sequential) applications use malloc for a variety of dynamic data structures, like linked lists or trees. GPGPU is gaining attention and popularity because its massively-parallel architecture allows for great speed improvement for programs that can be parallelised and implemented for a platform like OpenCL. Programmers who try to port their existing sequential or even parallel program to OpenCL however will soon discover that this standard defines a subset of C with several limitations, one of which is the absence of a malloc() routine that can be called from an OpenCL kernel. This document describes the results of research towards the impact of this limitation by trying to answer the question: ``How should a kernel-side heap a...
This project provides a framework for fast memory managers on many core accelerators. It is based on...
Dynamic memory management is one of the most expensive but ubiquitous operations in many C/C++ appli...
While data parallelism aspects of OpenCL have been of primary interest due to the massively data par...
Abstract—Due to the diversity of processor architectures and application memory access patterns, the...
OpenCL is a programming language standard which enables the programmer to express the application by...
In this thesis, we describe two related memory allocators, each with novel properties. PALLOC1 cont...
OpenCL is a standard for parallel programming of heterogeneous systems. The benefits of a common pro...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
Heterogeneous computing platforms are becoming increasingly important in supercomputing. Many system...
The FreeBSD project has been engaged in ongoing work to provide scalable support for multi-processor...
International audienceThis paper presents a memory allocator targeting manycore architectures with d...
The omission of support for several software-defined constructs within High-Level Synthesis (HLS) ha...
In the last few years, the computing industry has changed its course from ever higher clock speeds t...
Many computer systems now include both CPUs and programmable GPUs. OpenCL, a new programming framew...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
This project provides a framework for fast memory managers on many core accelerators. It is based on...
Dynamic memory management is one of the most expensive but ubiquitous operations in many C/C++ appli...
While data parallelism aspects of OpenCL have been of primary interest due to the massively data par...
Abstract—Due to the diversity of processor architectures and application memory access patterns, the...
OpenCL is a programming language standard which enables the programmer to express the application by...
In this thesis, we describe two related memory allocators, each with novel properties. PALLOC1 cont...
OpenCL is a standard for parallel programming of heterogeneous systems. The benefits of a common pro...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
Heterogeneous computing platforms are becoming increasingly important in supercomputing. Many system...
The FreeBSD project has been engaged in ongoing work to provide scalable support for multi-processor...
International audienceThis paper presents a memory allocator targeting manycore architectures with d...
The omission of support for several software-defined constructs within High-Level Synthesis (HLS) ha...
In the last few years, the computing industry has changed its course from ever higher clock speeds t...
Many computer systems now include both CPUs and programmable GPUs. OpenCL, a new programming framew...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
This project provides a framework for fast memory managers on many core accelerators. It is based on...
Dynamic memory management is one of the most expensive but ubiquitous operations in many C/C++ appli...
While data parallelism aspects of OpenCL have been of primary interest due to the massively data par...