International audienceWith the increase in size of supercomputers, also increases the number of abnormal events. Some of these events might lead to an application failure. Others might simply impact the system efficiency. CPU overheating is one such event that decreases the system efficiency: when a CPU overheats, it reduces its frequency. This paper studies the problem of CPU overheating in supercomputers. In a first part, we analyze data collected over one year on a supercomputer of the top500 list to understand under which conditions CPU overheating occurs. Our analysis show that overheating events are due to some specific applications. In a second part, we evaluate the impact of such overheating events on the performance of MPI applicat...
This thesis analyzes the dependency of performance, power consumption and temperature on processor f...
Low-power processors have emerged as an alternative for supercomputers and cloud computers to reduce...
Abstract As embedded devices start supporting heterogeneous processing cores (Central Processing Uni...
Power consumption and process variability are two important, interconnected, challenges of future ge...
International audienceParallel runtime systems such as MPI or task-based libraries provide models to...
Modern scientific discoveries are driven by an unsatisfiable demand for computational resources. Hig...
Abstract. Cluster end-users and administrators have become more cog-nizant of the fact that large-sc...
International audienceDespite recent advances in improving the performance of high performance compu...
As side effects of the end of Dennard’s scaling, power and thermal technological walls stand in fron...
Many contemporary HPC systems expose their jobs to substantial amounts of interference, leading to s...
As the scale of High-Performance Computing (HPC) clusters continues to grow, their increasing failur...
In the scope of technical and scientific computing the rush towards larger simulations, has been so ...
With the increase in size of supercomputers, also increases the number of failures or abnormal event...
Conference of 2013 IEEE 19th International On-Line Testing Symposium, IOLTS 2013 ; Conference Date: ...
Since 1993 we compile and publish twice a year a list of the most powerful supercomputers in the wor...
This thesis analyzes the dependency of performance, power consumption and temperature on processor f...
Low-power processors have emerged as an alternative for supercomputers and cloud computers to reduce...
Abstract As embedded devices start supporting heterogeneous processing cores (Central Processing Uni...
Power consumption and process variability are two important, interconnected, challenges of future ge...
International audienceParallel runtime systems such as MPI or task-based libraries provide models to...
Modern scientific discoveries are driven by an unsatisfiable demand for computational resources. Hig...
Abstract. Cluster end-users and administrators have become more cog-nizant of the fact that large-sc...
International audienceDespite recent advances in improving the performance of high performance compu...
As side effects of the end of Dennard’s scaling, power and thermal technological walls stand in fron...
Many contemporary HPC systems expose their jobs to substantial amounts of interference, leading to s...
As the scale of High-Performance Computing (HPC) clusters continues to grow, their increasing failur...
In the scope of technical and scientific computing the rush towards larger simulations, has been so ...
With the increase in size of supercomputers, also increases the number of failures or abnormal event...
Conference of 2013 IEEE 19th International On-Line Testing Symposium, IOLTS 2013 ; Conference Date: ...
Since 1993 we compile and publish twice a year a list of the most powerful supercomputers in the wor...
This thesis analyzes the dependency of performance, power consumption and temperature on processor f...
Low-power processors have emerged as an alternative for supercomputers and cloud computers to reduce...
Abstract As embedded devices start supporting heterogeneous processing cores (Central Processing Uni...