Making computer systems more energy efficient while obtaining the maximum performance possible is key for future developments in engineering, medicine, entertainment, etc. However it has become a difficult task due to the increasing complexity of hardware and software, and their interactions. For example, developers have to deal with deep, multi-level cache hierarchies on modern CPUs, and keep busy thousands of cores in GPUs, which makes the programming process more difficult. To simplify this task, new abstractions and programming models are becoming popular. Their goal is to make applications more scalable and efficient, while still providing the flexibility and portability of old, widely adopted models. One example of this is task-based ...
The paper presents a task allocation scheme for system-level synthesis of multirate real-time tasks ...
The task parallel programming model allows programmers to express concurrency at a high level of abs...
In systems with complex many-core cache hierarchy, exploiting data locality can significantly reduce...
Making computer systems more energy efficient while obtaining the maximum performance possible is ke...
Maximizing the performance of computer systems while making them more energy efficient is vital for ...
Recent scheduling heuristics for task-based applications have managed to improve their by taking int...
Single threaded tasks are the basic unit of scheduling in modern runtimes targeting multicore hardwa...
The shift toward multicore processors has transformed the software and hardware landscape in the las...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
Architects have adopted the shared memory model that implicitly manages cache coherence and cache ca...
Modern computer architectures expose an increasing number of parallel features supported by complex ...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2014.As multi-core processors b...
Parallel task-based programming models like OpenMP support the declaration of task data dependences....
Advances in hardware technologies combined with decreased costs have started a trend towards massive...
On recent high-performance multiprocessors, there is a potential conflict between the goals of achie...
The paper presents a task allocation scheme for system-level synthesis of multirate real-time tasks ...
The task parallel programming model allows programmers to express concurrency at a high level of abs...
In systems with complex many-core cache hierarchy, exploiting data locality can significantly reduce...
Making computer systems more energy efficient while obtaining the maximum performance possible is ke...
Maximizing the performance of computer systems while making them more energy efficient is vital for ...
Recent scheduling heuristics for task-based applications have managed to improve their by taking int...
Single threaded tasks are the basic unit of scheduling in modern runtimes targeting multicore hardwa...
The shift toward multicore processors has transformed the software and hardware landscape in the las...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
Architects have adopted the shared memory model that implicitly manages cache coherence and cache ca...
Modern computer architectures expose an increasing number of parallel features supported by complex ...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 2014.As multi-core processors b...
Parallel task-based programming models like OpenMP support the declaration of task data dependences....
Advances in hardware technologies combined with decreased costs have started a trend towards massive...
On recent high-performance multiprocessors, there is a potential conflict between the goals of achie...
The paper presents a task allocation scheme for system-level synthesis of multirate real-time tasks ...
The task parallel programming model allows programmers to express concurrency at a high level of abs...
In systems with complex many-core cache hierarchy, exploiting data locality can significantly reduce...