Modern designs for embedded systems are increasingly embracing cluster-based architectures, where small sets of cores communicate through tightly-coupled shared memory banks and high-performance interconnections. At the same time, the complexity of modern applications requires new programming abstractions to exploit dynamic and/or irregular parallelism on such platforms. Supporting dynamic parallelism in systems which i) are resource-constrained and ii) run applications with small units of work calls for a runtime environment which has minimal overhead for the scheduling of parallel tasks. In this work, we study the major sources of overhead in the implementation of OpenMP dynamic loops, sections and tasks, and propose a hardware implementa...
International audienceThe current trend in embedded computing consists in increasing the number of p...
The recent technological advancements and market trends are causing an interesting phenomenon toward...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...
Modern designs for embedded systems are increasingly embracing cluster-based architectures, where sm...
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These plat...
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These plat...
We explore runtime mechanisms and policies for scheduling dynamic multi-grain parallelism on heterog...
Nested parallelism is a well-known parallelization strategy to exploit irregular parallelism in HPC ...
Manufacturing and environmental variations cause timing errors that are typically avoided by conserv...
[[abstract]]Multicore computers have been widely included in cluster systems. They are shared memory...
This paper presents a hybrid approach to automatic parallelization of computer programs which combin...
Dynamic Task Scheduling is an enticing programming model aiming to ease the development of parallel ...
Computing systems have undergone a fundamental transformation from single core devices to devices wi...
Task Parallelism is a parallel programming model that provides code annotation constructs to outline...
Efficiently scheduling parallel tasks on to the processors of a shared-memory multiprocessor is crit...
International audienceThe current trend in embedded computing consists in increasing the number of p...
The recent technological advancements and market trends are causing an interesting phenomenon toward...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...
Modern designs for embedded systems are increasingly embracing cluster-based architectures, where sm...
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These plat...
Cluster-based architectures are increasingly being adopted to design embedded many-cores. These plat...
We explore runtime mechanisms and policies for scheduling dynamic multi-grain parallelism on heterog...
Nested parallelism is a well-known parallelization strategy to exploit irregular parallelism in HPC ...
Manufacturing and environmental variations cause timing errors that are typically avoided by conserv...
[[abstract]]Multicore computers have been widely included in cluster systems. They are shared memory...
This paper presents a hybrid approach to automatic parallelization of computer programs which combin...
Dynamic Task Scheduling is an enticing programming model aiming to ease the development of parallel ...
Computing systems have undergone a fundamental transformation from single core devices to devices wi...
Task Parallelism is a parallel programming model that provides code annotation constructs to outline...
Efficiently scheduling parallel tasks on to the processors of a shared-memory multiprocessor is crit...
International audienceThe current trend in embedded computing consists in increasing the number of p...
The recent technological advancements and market trends are causing an interesting phenomenon toward...
With the introduction of more powerful and massively parallel embedded processors, embedded systems ...