This paper presents an approach for profiling and tracing multithreaded applications with two main objectives. First, extend the positive points and overcome the limitations of GPROF tool when used on parallel applications. Second, focus on gathering information that can be useful for extending the existing GCC profile-driven optimizations and to investigate on new ones for parallel applications. In order to perform an insightful profiling of a multithreaded application, our approach proposes to gather intra-thread together with inter-thread information. For the latter, Operating System activity, as well as the usage of programmer-level synchronization mechanisms (e.g., semaphores, mutex), have to be taken into account. The proposed approac...
With the evolution of multi-core, multi-threaded processors from simple-scalar processors, the perfo...
Multithreaded programming is becoming increasingly important because of recent increase in the use o...
This article contains a brief description of existing graphical methods for presenting multithreaded...
AbstractApplication analysis is facilitated through a number of program profiling tools. The tools v...
The popularity of parallel systems for building high performance software only continues to rise. Pr...
Future integrated systems will contain billions of transistors, composing tens to hundreds of IP cor...
Understanding the performance of a multi-threaded application is difficult. The threads interfere wh...
Dynamically determining the appropriate number of threads for a multi-threaded application may lead ...
We investigate high-performance threading architectures for I/O intensive multi-threaded servers. We...
TECHNIQUES FOR THE EXECUTION PROFILE ANALYSIS AND OPTIMIZATION OF COMPUTATIONAL CHEMISTRY PROGRAMS, ...
In this paper, we present VIProf, a full-system, performance sampling system capable of extracting r...
Since multicore systems offer greater performance via parallelism, future computing is progressing t...
Application profiling is an important step in the design and optimization of embedded systems. Accur...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
Reconfigurable systems map the computational intensive parts of the code in hardware while less comp...
With the evolution of multi-core, multi-threaded processors from simple-scalar processors, the perfo...
Multithreaded programming is becoming increasingly important because of recent increase in the use o...
This article contains a brief description of existing graphical methods for presenting multithreaded...
AbstractApplication analysis is facilitated through a number of program profiling tools. The tools v...
The popularity of parallel systems for building high performance software only continues to rise. Pr...
Future integrated systems will contain billions of transistors, composing tens to hundreds of IP cor...
Understanding the performance of a multi-threaded application is difficult. The threads interfere wh...
Dynamically determining the appropriate number of threads for a multi-threaded application may lead ...
We investigate high-performance threading architectures for I/O intensive multi-threaded servers. We...
TECHNIQUES FOR THE EXECUTION PROFILE ANALYSIS AND OPTIMIZATION OF COMPUTATIONAL CHEMISTRY PROGRAMS, ...
In this paper, we present VIProf, a full-system, performance sampling system capable of extracting r...
Since multicore systems offer greater performance via parallelism, future computing is progressing t...
Application profiling is an important step in the design and optimization of embedded systems. Accur...
Over the past 10 years we have seen the transition from single core computer to multicore computing,...
Reconfigurable systems map the computational intensive parts of the code in hardware while less comp...
With the evolution of multi-core, multi-threaded processors from simple-scalar processors, the perfo...
Multithreaded programming is becoming increasingly important because of recent increase in the use o...
This article contains a brief description of existing graphical methods for presenting multithreaded...