Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016). Sofia (Bulgaria), October, 6-7, 2016.The ever growing complexity of high performance computing systems imposes significant challenges to exploit as much as possible their computational and memory resources. Recently, the Cache-aware Roofline Model has gained popularity due to its simplicity when modeling multi-cores with complex memory hierarchy, characterizing applications bottlenecks, and quantifying achieved or remaining improvements. In this short paper we involve hardware locality topology detection to build the Cache Aware Roofline Model for modern processors in an open-source locality-aware tool. The proposed tool also includes a se...
International audienceHigh-performance computing requires a deep knowledge of the hardware platform ...
This research is part of a co-design project that has the goal of designing hardware syste...
Manufacturers will likely offer multiple products with differing numbers of cores to cover multiple ...
Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016...
International audienceThe ever growing complexity of high performance computing systems imposes sign...
International audienceIn order to fulfill modern applications needs, computing systems become more p...
International audienceThe roofline model is a popular approach to ``bounds and bottleneck''performan...
With energy-efficient architectures, including accelerators and many-core processors, gaining tracti...
HPC applications usually run at a low fraction of the computer's peak performance. Empirical perform...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...
International audienceThe increasing computation capability of servers comes with a dramatic increas...
We present preliminary results of theRooflineToolkit formulticore, manycore, and accelerated archite...
Through years, the complexity of High Performance Computing (HPC) systems’ memory hierarchy has incr...
thesisTo address the need of understanding and optimizing the performance of complex applications an...
This research is part of a co-design project that has the goal of designing hardware systems to matc...
International audienceHigh-performance computing requires a deep knowledge of the hardware platform ...
This research is part of a co-design project that has the goal of designing hardware syste...
Manufacturers will likely offer multiple products with differing numbers of cores to cover multiple ...
Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016...
International audienceThe ever growing complexity of high performance computing systems imposes sign...
International audienceIn order to fulfill modern applications needs, computing systems become more p...
International audienceThe roofline model is a popular approach to ``bounds and bottleneck''performan...
With energy-efficient architectures, including accelerators and many-core processors, gaining tracti...
HPC applications usually run at a low fraction of the computer's peak performance. Empirical perform...
The end of Dennard scaling signaled a shift in HPC supercomputer architectures from systems built fr...
International audienceThe increasing computation capability of servers comes with a dramatic increas...
We present preliminary results of theRooflineToolkit formulticore, manycore, and accelerated archite...
Through years, the complexity of High Performance Computing (HPC) systems’ memory hierarchy has incr...
thesisTo address the need of understanding and optimizing the performance of complex applications an...
This research is part of a co-design project that has the goal of designing hardware systems to matc...
International audienceHigh-performance computing requires a deep knowledge of the hardware platform ...
This research is part of a co-design project that has the goal of designing hardware syste...
Manufacturers will likely offer multiple products with differing numbers of cores to cover multiple ...