With speculative thread-level parallelization, codes that cannot be fully compiler-analyzed are aggressively executed in parallel. If the hardware detects a cross-thread dependence violation, it squashes offending threads and resumes execution. Unfortunately, frequent squashing cripples performance. This paper proposes a new frameworkof hardware mechanisms to eliminate most squashes due to data dependences in multiprocessors. The frameworkworks by learning and predicting violations, and applying delayed disambiguation, value prediction, and stall and release. The frameworkis suited for directory-based multiprocessors that trackmemory accesses at the system level with the coarse granularity of memory lines. Simulations of a 16-processor mach...
As we look to the future, and the prospect of a bil-lion transistors on a chip, it seems inevitable ...
The advent of multicores presents a promising opportunity for speeding up the execution of sequentia...
The major specific contributions are: (1) We introduce a new compiler analysis to identify the memor...
108 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2001.In this thesis, we also propo...
We present a software approach to design a thread-level data dependence speculation system targeting...
grantor: University of TorontoTo fully exploit the potential of single-chip multiprocessor...
The basic idea under speculative parallelization (also called thread-level spec-ulation) [2, 6, 7] i...
Speculative thread-level parallelization is a promising way to speed up codes that compilers fail to...
As we look to the future, and the prospect of a billion transistors on a chip, it seems inevitable t...
Speculative thread-level parallelization is a promising way to speed up codes that compilers fail to...
With the advent of multicore processors, extracting thread level parallelism from a sequential progr...
This paper presents the Mitosis framework, which is a combined hardware-software approach to specula...
Speculative parallelization can provide significant sources of additional thread-level parallelism, ...
Speculative parallelization (SP) enables a processor to extract multiple threads from a sequential i...
Thread-level speculation (TLS) has proven to be a promising method of extracting parallelism from bo...
As we look to the future, and the prospect of a bil-lion transistors on a chip, it seems inevitable ...
The advent of multicores presents a promising opportunity for speeding up the execution of sequentia...
The major specific contributions are: (1) We introduce a new compiler analysis to identify the memor...
108 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2001.In this thesis, we also propo...
We present a software approach to design a thread-level data dependence speculation system targeting...
grantor: University of TorontoTo fully exploit the potential of single-chip multiprocessor...
The basic idea under speculative parallelization (also called thread-level spec-ulation) [2, 6, 7] i...
Speculative thread-level parallelization is a promising way to speed up codes that compilers fail to...
As we look to the future, and the prospect of a billion transistors on a chip, it seems inevitable t...
Speculative thread-level parallelization is a promising way to speed up codes that compilers fail to...
With the advent of multicore processors, extracting thread level parallelism from a sequential progr...
This paper presents the Mitosis framework, which is a combined hardware-software approach to specula...
Speculative parallelization can provide significant sources of additional thread-level parallelism, ...
Speculative parallelization (SP) enables a processor to extract multiple threads from a sequential i...
Thread-level speculation (TLS) has proven to be a promising method of extracting parallelism from bo...
As we look to the future, and the prospect of a bil-lion transistors on a chip, it seems inevitable ...
The advent of multicores presents a promising opportunity for speeding up the execution of sequentia...
The major specific contributions are: (1) We introduce a new compiler analysis to identify the memor...