The implementation of operating system functions can significantly affect the performance of parallel programs. Our research concerns the customization of operating system functionality for different target hardware to improve the performance of application programs. In this paper, we describe our experience with a reconfigurable, multiprocessor Mach cthreads package on a 32-node KSR-1 supercomputer. Sample static and dynamic configurations address the exchange and on-line adaptation of threads schedulers, and the on-line adaptation of threads synchronization constructs. Experimental results are demonstrated with two different parallel application programs, (1) a parallel branch-and-bound application and (2) the runtime kernel of a Time War...
This paper evaluates new techniques to improve performance and efficiency of Chip MultiProcessors (C...
Processors in large-scale multiprocessors must be able to tolerate large communication latencies and...
Future integrated systems will contain billions of transistors, composing tens to hundreds of IP cor...
The implementation of operating system functions can significantly affect the performance of paralle...
Operating Systems have been considered as a cor-nerstone of the modern computer system, and the con-...
Recently, lightweight thread libraries have become a common entity to support concurrent programmin...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 1996.Designing high performance...
This paper describes initial results for an architecture called the Shared-Thread Multiprocessor (ST...
Multi-core processors are ubiquitous in all market segments from embedded to high performance comput...
In processors with several levels of hardware resource sharing, like CMPs in which each core is an S...
Loop scheduling has significant differences in multithreaded from other parallel processors. The sha...
The continuing launch of various multi-core processors popularizes parallel computing of gaining hig...
Thread level parallelism of applications is commonly exploited using multi-thread processors. In suc...
This paper describes a number of microarchitectural tech-niques for supporting multithreading in sof...
Abstract 1 This paper discusses the preliminary performance study of hybrid multithreaded execution ...
This paper evaluates new techniques to improve performance and efficiency of Chip MultiProcessors (C...
Processors in large-scale multiprocessors must be able to tolerate large communication latencies and...
Future integrated systems will contain billions of transistors, composing tens to hundreds of IP cor...
The implementation of operating system functions can significantly affect the performance of paralle...
Operating Systems have been considered as a cor-nerstone of the modern computer system, and the con-...
Recently, lightweight thread libraries have become a common entity to support concurrent programmin...
Thesis (Ph. D.)--University of Rochester. Dept. of Computer Science, 1996.Designing high performance...
This paper describes initial results for an architecture called the Shared-Thread Multiprocessor (ST...
Multi-core processors are ubiquitous in all market segments from embedded to high performance comput...
In processors with several levels of hardware resource sharing, like CMPs in which each core is an S...
Loop scheduling has significant differences in multithreaded from other parallel processors. The sha...
The continuing launch of various multi-core processors popularizes parallel computing of gaining hig...
Thread level parallelism of applications is commonly exploited using multi-thread processors. In suc...
This paper describes a number of microarchitectural tech-niques for supporting multithreading in sof...
Abstract 1 This paper discusses the preliminary performance study of hybrid multithreaded execution ...
This paper evaluates new techniques to improve performance and efficiency of Chip MultiProcessors (C...
Processors in large-scale multiprocessors must be able to tolerate large communication latencies and...
Future integrated systems will contain billions of transistors, composing tens to hundreds of IP cor...