It is easy to find errors and inefficient parts of a sequential program, by using a standard debugger/profiler, but there is no such tool in a parallel environment. The only way to study the race conditions of a parallel program is to execute it and collect data about its execution. The programmer can then use the generated trace files and specialized tuning tools to visualize and improve the behavior of the program: idle processors, communications, etc. The problem in large parallel systems is that these tools have to deal with an enormous amount of data. The classical approach to monitor and trace analysis i.e. sequential, event driven, post-mortem monitoring) is no longer realistic. To avoid this bottleneck, we introduced PIMSY (Parallel...
In this paper, an efficient algorithm to simultaneously implement array alignment and data/computati...
With the emergence of highly heterogeneous, dynamic and large distributed platforms, declarative pro...
Automatic parallelization is one of the approaches aimed at a better and easier use of parallel comp...
In the data parallel programming style the user usually specifies the data parallelism explicitly so...
It is not easy to reach the best performances you can expect of a parallel computer. We therefore ha...
We describe a simple data-parallel kernel language which encapsulates the main data-parallel control...
This report describes three application program interfaces to BPFS, a distributed, modular parallel ...
The research presented in this thesis was conducted in the context of the Mosaic C, an experimental,...
Given a set $L$ of $n$ points in the $d$-dimensional Cartesian space $E^d$, and a query specifying a...
In this paper, we compare three nested loops parallelization algorithms (Allen and Kennedy's algorit...
We describe the compilation and execution of data-parallel languages for networks of workstations. E...
We study the computational power of rational Piecewise Constant Derivative (PCD) systems. PCD system...
A parallel programming archetype [Cha94, CMMM95] is an abstraction that captures the common features...
The computational requirements of the processors are increasing tremendously with the increase in th...
Cole presented a parallel merge sort for the PRAM model that performs in log n parallel steps using ...
In this paper, an efficient algorithm to simultaneously implement array alignment and data/computati...
With the emergence of highly heterogeneous, dynamic and large distributed platforms, declarative pro...
Automatic parallelization is one of the approaches aimed at a better and easier use of parallel comp...
In the data parallel programming style the user usually specifies the data parallelism explicitly so...
It is not easy to reach the best performances you can expect of a parallel computer. We therefore ha...
We describe a simple data-parallel kernel language which encapsulates the main data-parallel control...
This report describes three application program interfaces to BPFS, a distributed, modular parallel ...
The research presented in this thesis was conducted in the context of the Mosaic C, an experimental,...
Given a set $L$ of $n$ points in the $d$-dimensional Cartesian space $E^d$, and a query specifying a...
In this paper, we compare three nested loops parallelization algorithms (Allen and Kennedy's algorit...
We describe the compilation and execution of data-parallel languages for networks of workstations. E...
We study the computational power of rational Piecewise Constant Derivative (PCD) systems. PCD system...
A parallel programming archetype [Cha94, CMMM95] is an abstraction that captures the common features...
The computational requirements of the processors are increasing tremendously with the increase in th...
Cole presented a parallel merge sort for the PRAM model that performs in log n parallel steps using ...
In this paper, an efficient algorithm to simultaneously implement array alignment and data/computati...
With the emergence of highly heterogeneous, dynamic and large distributed platforms, declarative pro...
Automatic parallelization is one of the approaches aimed at a better and easier use of parallel comp...