International audienceIn order to fulfill modern applications needs, computing systems become more powerful, heterogeneous and complex. NUMA platforms and emerging high bandwidth memories offer new opportunities for performance improvements. However they also increase hardware and software complexity, thus making application performance analysis and optimization an even harder task. The Cache-Aware Roofline Model (CARM) is an insightful, yet simple model designed to address this issue. It provides feedback on potential applications bottlenecks and shows how far is the application performance from the achievable hardware upper-bounds. However, it does not encompass NUMA systems and next generation processors with heterogeneous memories. Yet,...
Hardware transactional memory (HTM) is supported by widely-used commodity processors. While the effe...
In scalable multiprocessor architectures, the times required for a processor to access various porti...
International audienceThe increasing computation capability of servers comes with a dramatic increas...
International audienceThe ever growing complexity of high performance computing systems imposes sign...
Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016...
International audienceModeling and simulation are crucial in high-performance computing (HPC), with ...
With energy-efficient architectures, including accelerators and many-core processors, gaining tracti...
The latency of memory access times is hence non-uniform, because it depends on where the request ori...
Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provid...
Today's microprocessors include multicores that feature a diverse set of compute cores and onboard m...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provid...
Computer architects have increased hardware parallelism and power efficiency by integrating massivel...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
La hiérarchie mémoire des serveurs de calcul est de plus en plus complexe. Les machines disposent de...
Hardware transactional memory (HTM) is supported by widely-used commodity processors. While the effe...
In scalable multiprocessor architectures, the times required for a processor to access various porti...
International audienceThe increasing computation capability of servers comes with a dramatic increas...
International audienceThe ever growing complexity of high performance computing systems imposes sign...
Proceedings of: Third International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2016...
International audienceModeling and simulation are crucial in high-performance computing (HPC), with ...
With energy-efficient architectures, including accelerators and many-core processors, gaining tracti...
The latency of memory access times is hence non-uniform, because it depends on where the request ori...
Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provid...
Today's microprocessors include multicores that feature a diverse set of compute cores and onboard m...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
Cache Coherent NUMA (ccNUMA) architectures are a widespread paradigm due to the benefits they provid...
Computer architects have increased hardware parallelism and power efficiency by integrating massivel...
As the adoption of Big Data technologies becomes the norm in an increasing number of scenarios, ther...
La hiérarchie mémoire des serveurs de calcul est de plus en plus complexe. Les machines disposent de...
Hardware transactional memory (HTM) is supported by widely-used commodity processors. While the effe...
In scalable multiprocessor architectures, the times required for a processor to access various porti...
International audienceThe increasing computation capability of servers comes with a dramatic increas...