We present a completely new kind of approach for mapping the computation of an application to MP-SOC architecture. Instead of moving data read and write requests, extremely lightweight threads are moved between the processor cores. As a consequence, all kinds of cache coherence problems and need for read reply messages are eliminated. Lamport's sequential consistency of shared memory multiprocessor systems and flexible and efficient handling of varying number of threads are achieved for free. In our architecture, the challenge of having efficient implementation of an application reduces to mapping the used data so that the need to move threads is balanced with respect to the bandwidth of the local memories/intercommunication network. We out...
Abstract — Global communication costs in future single-chip multiprocessors will increase linearly w...
This paper describes initial results for an architecture called the Shared-Thread Multiprocessor (ST...
International audienceThe parallelism in shared-memory systems has increased significantly with the ...
Moving threads is a new kind of approach for mapping the computation of an application to multiproce...
Moving threads is a theoretically interesting approach for mapping the computation of an application...
Moving threads is a new kind of approach for multicore processor architectures. Traditionally, each ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
While the paradigm offered by SMP designs is a relatively clean one, programming paradigms offered o...
This dissertation focuses on design and implementation issues of a multithreaded parallel programmin...
Abstract—With the increasing prominence of many-core archi-tectures and decreasing per-core resource...
International audienceScalability and programmability are important issues in large homogeneous MPSo...
Distributed Shared Memory (DSM) systems provide a logically shared memory over physically distribute...
Programming multicore systems is currently considered very difficult. One reason is that those are mo...
MPI-based explicitly parallel programs have been widely used for developing highperformance applicat...
In this paper, we describe the architectural output of our 'Moving threads realization study' (MOTH)...
Abstract — Global communication costs in future single-chip multiprocessors will increase linearly w...
This paper describes initial results for an architecture called the Shared-Thread Multiprocessor (ST...
International audienceThe parallelism in shared-memory systems has increased significantly with the ...
Moving threads is a new kind of approach for mapping the computation of an application to multiproce...
Moving threads is a theoretically interesting approach for mapping the computation of an application...
Moving threads is a new kind of approach for multicore processor architectures. Traditionally, each ...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
While the paradigm offered by SMP designs is a relatively clean one, programming paradigms offered o...
This dissertation focuses on design and implementation issues of a multithreaded parallel programmin...
Abstract—With the increasing prominence of many-core archi-tectures and decreasing per-core resource...
International audienceScalability and programmability are important issues in large homogeneous MPSo...
Distributed Shared Memory (DSM) systems provide a logically shared memory over physically distribute...
Programming multicore systems is currently considered very difficult. One reason is that those are mo...
MPI-based explicitly parallel programs have been widely used for developing highperformance applicat...
In this paper, we describe the architectural output of our 'Moving threads realization study' (MOTH)...
Abstract — Global communication costs in future single-chip multiprocessors will increase linearly w...
This paper describes initial results for an architecture called the Shared-Thread Multiprocessor (ST...
International audienceThe parallelism in shared-memory systems has increased significantly with the ...