ABSTRACT: We propose a new class of profiler for distributed and heterogeneous systems. In these systems, a task may wait for the result of another task, either locally or remotely. Such wait dependencies are invisible to instruction profilers. We propose a host-based, precise method to recover recursively wait causes across machines, using blocking as the fundamental mechanism to detect changes in the control flow. It relies solely on operating system events, namely scheduling, interrupts and network events. It is therefore capable of observing kernel threads interactions and achieves user-space runtime independence. Given a task, the algorithm computes its active path from the trace, which is presented in an interactive viewer for inspect...
ABSTRACT: This paper studies the preemption between programs running in different virtual machines o...
International audienceUnderstanding the performance of a multi-threaded application is difficult. Th...
The amount of parallelism in modern supercomputers currently grows from generation to generation, an...
Distributed tracing allows tracking user requests that span across multiple services and machines in...
ABSTRACT: Tracing allows the analysis of task interactions with each other and with the operating sy...
ABSTRACT: This article proposes a novel approach to synchronize a posteriori the detailed execution ...
Fay is a flexible platform for the efficient collection, processing, and analysis of software execut...
ABSTRACT: Detecting latency-related problems in production environments is usually carried out at th...
ABSTRACT: Debugging and profiling tools can alter the execution flow or timing, can induce heisenbug...
One of the most challenging problems facing today's software engineer is to understand and modify di...
One of the most challenging problems facing today's software engineer is to understand and modify di...
ABSTRACT: This paper studies interactions between virtual machines and their host through the sharin...
Driven by growing application requirements and accelerated by current trends in microprocessor desig...
Due to the available concurrency in modern-day supercomputers, the complexity of developing efficien...
Abstract—Event traces are valuable for understanding the behavior of parallel programs. However, aut...
ABSTRACT: This paper studies the preemption between programs running in different virtual machines o...
International audienceUnderstanding the performance of a multi-threaded application is difficult. Th...
The amount of parallelism in modern supercomputers currently grows from generation to generation, an...
Distributed tracing allows tracking user requests that span across multiple services and machines in...
ABSTRACT: Tracing allows the analysis of task interactions with each other and with the operating sy...
ABSTRACT: This article proposes a novel approach to synchronize a posteriori the detailed execution ...
Fay is a flexible platform for the efficient collection, processing, and analysis of software execut...
ABSTRACT: Detecting latency-related problems in production environments is usually carried out at th...
ABSTRACT: Debugging and profiling tools can alter the execution flow or timing, can induce heisenbug...
One of the most challenging problems facing today's software engineer is to understand and modify di...
One of the most challenging problems facing today's software engineer is to understand and modify di...
ABSTRACT: This paper studies interactions between virtual machines and their host through the sharin...
Driven by growing application requirements and accelerated by current trends in microprocessor desig...
Due to the available concurrency in modern-day supercomputers, the complexity of developing efficien...
Abstract—Event traces are valuable for understanding the behavior of parallel programs. However, aut...
ABSTRACT: This paper studies the preemption between programs running in different virtual machines o...
International audienceUnderstanding the performance of a multi-threaded application is difficult. Th...
The amount of parallelism in modern supercomputers currently grows from generation to generation, an...