Servet is a suite of benchmarks focused on detecting a set of parameters with high influence on the overall performance of multicore systems. These parameters can be used for autotuning codes to increase their performance on multicore clusters. Although Servet has been proved to detect accurately cache hierarchies, bandwidths and bottlenecks in memory accesses, as well as the communication overhead among cores, up to now the impact of the use of this information on application performance optimization has not been assessed. This paper presents a novel algorithm that automatically uses Servet for mapping parallel applications on multicore systems and analyzes its impact on three testbeds using three different parallel programming models: mes...
International audienceCurrent and future architectures rely on thread-level parallelism to sustain p...
Compiler-based auto-parallelization is a much-studied area but has yet to find widespread applicatio...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
This is a post-peer-review, pre-copyedit version of an article published in Computers & Electrical E...
Abstract—The growing complexity in computer system hierar-chies due to the increase in the number of...
This is a post-peer-review, pre-copyedit version of an article published in 2010 IEEE International ...
International audienceThe parallelism in shared-memory systems has increased significantly with the ...
Though transistor scaling yields more transistors per chip, however, the consistent performance gain...
The efficient mapping of program parallelism to multi-core processors is highly dependent on the und...
For a wide variety of applications, both task and data parallelism must be exploited to achieve the ...
Abstract—Recently parallel architectures have entered every area of computing, from multi-core proce...
Multi-core processors are now ubiquitous and are widely seen as the most viable means of delivering ...
Recent trends show a steady increase in the utilization of heterogeneous multicore architectures in ...
Servet is a suite of benchmarks focused on extracting a set of parameters with high influence on the...
To reduce latency and increase bandwidth to memory, modern microprocessors are designed with deep me...
International audienceCurrent and future architectures rely on thread-level parallelism to sustain p...
Compiler-based auto-parallelization is a much-studied area but has yet to find widespread applicatio...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...
This is a post-peer-review, pre-copyedit version of an article published in Computers & Electrical E...
Abstract—The growing complexity in computer system hierar-chies due to the increase in the number of...
This is a post-peer-review, pre-copyedit version of an article published in 2010 IEEE International ...
International audienceThe parallelism in shared-memory systems has increased significantly with the ...
Though transistor scaling yields more transistors per chip, however, the consistent performance gain...
The efficient mapping of program parallelism to multi-core processors is highly dependent on the und...
For a wide variety of applications, both task and data parallelism must be exploited to achieve the ...
Abstract—Recently parallel architectures have entered every area of computing, from multi-core proce...
Multi-core processors are now ubiquitous and are widely seen as the most viable means of delivering ...
Recent trends show a steady increase in the utilization of heterogeneous multicore architectures in ...
Servet is a suite of benchmarks focused on extracting a set of parameters with high influence on the...
To reduce latency and increase bandwidth to memory, modern microprocessors are designed with deep me...
International audienceCurrent and future architectures rely on thread-level parallelism to sustain p...
Compiler-based auto-parallelization is a much-studied area but has yet to find widespread applicatio...
High-performance computing systems have become increasingly dynamic, complex, and unpredictable. To ...