The emergence of new manycore architectures, such as the Intel Xeon Phi, poses new challenges in how to adapt existing libraries and applications to this type of systems. In particular, the exploitation of manycore accelerators requires a holistic solution that simultaneously addresses time-to-response, energy efficiency and ease of programming. In this paper, we adapt the SuperMatrix runtime task scheduler for dense linear algebra algorithms to the many-threaded Intel Xeon Phi, with special emphasis on the performance and energy profile of the solution. From the performance perspective, we optimize the balance between task- and data-parallelism, reporting notable results compared with Intel MKL. From the energy-aware point of view, we prop...
Abstract. Intel Xeon Phi is a recently released high-performance co-processor which features 61 core...
The power wall asks for a holistic effort from the high performance and scientific communities to de...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
The road towards Exascale Computing requires a holistic effort to address three different challenges...
This paper addresses the efficient exploitation of task-level parallelism, present in many dense lin...
In this paper, we propose a model for the energy consumption of the concurrent execution of three ke...
[EN] This paper analyzes the impact on power con- sumption of two DVFS-control strategies when appli...
workloads to run on hardware accelerators allowing for ad-vantages that come from the many-core arch...
Sparse and irregular computations constitute a large fraction of applications in the data-intensive ...
Energy consumption of processors and memories is quickly becoming a limiting factor in the deploymen...
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids...
In the early 2000s, the superscalar CPU paradigm reached the point of diminishing returns mainly due...
In a previous PPoPP paper we showed how the FLAME method-ology, combined with the SuperMatrix runtim...
Intel\u27s Xeon Phi coprocessor has successfully proved its capability by being used in Tianhe-2 and...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
Abstract. Intel Xeon Phi is a recently released high-performance co-processor which features 61 core...
The power wall asks for a holistic effort from the high performance and scientific communities to de...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...
The road towards Exascale Computing requires a holistic effort to address three different challenges...
This paper addresses the efficient exploitation of task-level parallelism, present in many dense lin...
In this paper, we propose a model for the energy consumption of the concurrent execution of three ke...
[EN] This paper analyzes the impact on power con- sumption of two DVFS-control strategies when appli...
workloads to run on hardware accelerators allowing for ad-vantages that come from the many-core arch...
Sparse and irregular computations constitute a large fraction of applications in the data-intensive ...
Energy consumption of processors and memories is quickly becoming a limiting factor in the deploymen...
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids...
In the early 2000s, the superscalar CPU paradigm reached the point of diminishing returns mainly due...
In a previous PPoPP paper we showed how the FLAME method-ology, combined with the SuperMatrix runtim...
Intel\u27s Xeon Phi coprocessor has successfully proved its capability by being used in Tianhe-2 and...
For the past decade, power/energy consumption has become a limiting factor for large-scale and embed...
Abstract. Intel Xeon Phi is a recently released high-performance co-processor which features 61 core...
The power wall asks for a holistic effort from the high performance and scientific communities to de...
Over the past few years, energy consumption has become the main limiting factor for computing in gen...