Abstract—This paper introduces a new hardware mechanism for capturing and compressing program execution traces unob-trusively in real-time. The proposed mechanism is based on two structures called stream cache and last stream predictor. We explore the effectiveness of a trace module based on these struc-tures and analyze the design space. We show that our trace module, with less than 600 bytes of state, achieves a trace-port bandwidth of 0.15 bits/instruction/processor, which is over six times better than state-of-the-art commercial designs. I
Capturing program and data traces during program execution unobtrusively on-the-fly is crucial in de...
The objective of this paper is to improve the use of the hardware resources of the trace cache mecha...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...
Unobtrusive capturing of program execution traces in real-time is crucial in debugging cyber-physica...
Abstract—Unobtrusive capturing of program execution traces in real-time is crucial for debugging man...
Abstract—Collecting the program execution traces at full speed is essential to the analysis and debu...
As the instruction issue width of superscalar proces-sors increases, instruction fetch bandwidth req...
Trace caches are used to help dynamic branch prediction make multiple predictions in a cycle by embe...
Abstract — Trace-driven simulation has long been used in both processor and memory studies. The larg...
Trace files record the execution behavior of programs for future analysis. Unfortunately, nontrivial...
The use of Trace Caches is a well known technique to overcome the problem of limited instruction fet...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Trace-driven simulation is a wid...
Event tracing of applications under dynamic execution is crucial for performance modeling, optimizat...
In trace processors, a sequential program is partitioned at run time into "traces." A tra...
Capturing program and data traces during program execution unobtrusively on-the-fly is crucial in de...
The objective of this paper is to improve the use of the hardware resources of the trace cache mecha...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...
Unobtrusive capturing of program execution traces in real-time is crucial in debugging cyber-physica...
Abstract—Unobtrusive capturing of program execution traces in real-time is crucial for debugging man...
Abstract—Collecting the program execution traces at full speed is essential to the analysis and debu...
As the instruction issue width of superscalar proces-sors increases, instruction fetch bandwidth req...
Trace caches are used to help dynamic branch prediction make multiple predictions in a cycle by embe...
Abstract — Trace-driven simulation has long been used in both processor and memory studies. The larg...
Trace files record the execution behavior of programs for future analysis. Unfortunately, nontrivial...
The use of Trace Caches is a well known technique to overcome the problem of limited instruction fet...
We explore the use of compiler optimizations, which optimize the layout of instructions in memory. T...
Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)Trace-driven simulation is a wid...
Event tracing of applications under dynamic execution is crucial for performance modeling, optimizat...
In trace processors, a sequential program is partitioned at run time into "traces." A tra...
Capturing program and data traces during program execution unobtrusively on-the-fly is crucial in de...
The objective of this paper is to improve the use of the hardware resources of the trace cache mecha...
As the issue width of superscalar processors is increased, instruction fetch bandwidth requirements ...