The Tera Multithreaded Architecture (MTA) is a radical new architecture intended to revolutionize high-performance computing in both the scientific and commercial marketplaces. Each processor supports 128 threads in hardware. Extremely fast thread switching is used to mask latency in a uniform-access memory system without caching. It is claimed that these hardware characteristics allow compilers to easily transform sequential programs into efficient multithreaded programs for the Tera MTA. In this paper, we attempt to provide an objective initial evaluation of the performance of the Tera multithreaded architecture and programming system for general-purpose applications. The basis of our investigation is two programs from the C3I Parallel Be...
Coarse grained threading, such as in the SPMD model of parallel programming, has been shown to be an...
Multithreaded processors, having hardware support for the concurrent execution of fine-grained thre...
Multithreaded processors, having hardware support for the concurrent execution of fine-grained thre...
The Tera Multithreaded Architecture (MTA) is a radical new architecture intended to revolutionize hi...
The Tera Multithreaded Architecture (MTA) is a new parallel supercomputer currently being installed ...
Tera Computer and Sandia National Laboratories have completed a CRADA, which examined the Tera Multi...
The difficulty of programming parallel computers has impeded their wide-spread use. The problems are...
The success of parallel computing in solving real-life computationally-intensive problems relies on ...
Present-day parallel computers often face the problems of large software overheads for process switc...
Many enhancements have been made to the traditional general purpose load-store computer architecture...
The two key factors affecting the performance of tera-scale computations are the parallel efficiency...
The Hewlett-Packard X- and V-Class ccNUMA systems appear well suited to exploiting coarse and fine-g...
Commercial multicore central processing units (CPU) integrate a number of processor cores on a singl...
The Hewlett-Packard X- and V-Class ccNUMA systems appear well suited to exploiting coarse and fine-g...
We present a new scheme for evaluating the performance of multithreaded computers and demonstrate it...
Coarse grained threading, such as in the SPMD model of parallel programming, has been shown to be an...
Multithreaded processors, having hardware support for the concurrent execution of fine-grained thre...
Multithreaded processors, having hardware support for the concurrent execution of fine-grained thre...
The Tera Multithreaded Architecture (MTA) is a radical new architecture intended to revolutionize hi...
The Tera Multithreaded Architecture (MTA) is a new parallel supercomputer currently being installed ...
Tera Computer and Sandia National Laboratories have completed a CRADA, which examined the Tera Multi...
The difficulty of programming parallel computers has impeded their wide-spread use. The problems are...
The success of parallel computing in solving real-life computationally-intensive problems relies on ...
Present-day parallel computers often face the problems of large software overheads for process switc...
Many enhancements have been made to the traditional general purpose load-store computer architecture...
The two key factors affecting the performance of tera-scale computations are the parallel efficiency...
The Hewlett-Packard X- and V-Class ccNUMA systems appear well suited to exploiting coarse and fine-g...
Commercial multicore central processing units (CPU) integrate a number of processor cores on a singl...
The Hewlett-Packard X- and V-Class ccNUMA systems appear well suited to exploiting coarse and fine-g...
We present a new scheme for evaluating the performance of multithreaded computers and demonstrate it...
Coarse grained threading, such as in the SPMD model of parallel programming, has been shown to be an...
Multithreaded processors, having hardware support for the concurrent execution of fine-grained thre...
Multithreaded processors, having hardware support for the concurrent execution of fine-grained thre...