In this paper we discuss our preliminary experience with implementing active messages and Split-C on a cluster of workstations connected via Scalable Coherent Interface (SCI). Our experiments were conducted on a cluster of UltraSparcs connected via SBus-2 SCI adapter cards and an SCI switch. We present performance measurements and discuss several avenues for optimizations. As our experiments show, SCI offers much lower latencies than other cluster of workstation architectures, including a Myrinet based cluster, and other parallel machines like the Meiko CS-2. The simple remote load and store transactions supported by SCI allow for a very low latency implementation of the data transfer. However, active messages additionally require a transfe...
A Local Area Network (LAN) can be built using the ANSI/IEEE Standard 1592-1992 Scalable Coherent Int...
A critical factor in the design of any parallel processing computer system is the interconnection ne...
Improving the performance of future computing systems will be based upon the ability of increasing t...
In this paper we discuss our experience with highperformance cluster computing using Scalable Cohere...
A cluster is a group of independent compute nodes which are coupled by an interconnection network. T...
Due to their excellent cost/performance ratio, clusters of PCs can be attractive high-performance co...
This paper describes a new interconnect standard, the Scalable Coherent Interface (SCI, ANSI/IEEE S...
SCI is based on unidirectional point-to-point links connected in rings. Separate rings can be connec...
The design challenge for large-scale multiprocessors is (1) to minimize communication overhead, (2) ...
The Scalable Coherent Interface (SCI) project (IEEE P1596) found a way to avoid the limits that are ...
Clusters of multiprocessors, or Clumps, promise to be the supercomputers of the future, but obtainin...
The design challenge for large-scale multiprocessors is (1) to minimize communication overhead, (2) ...
This paper provides experimental data and analysis to quantify both the peak and sustained performan...
This paper describes a novel methodology for implementing a common set of collective communication o...
Recently several network technologies which support user-level communication between processes using...
A Local Area Network (LAN) can be built using the ANSI/IEEE Standard 1592-1992 Scalable Coherent Int...
A critical factor in the design of any parallel processing computer system is the interconnection ne...
Improving the performance of future computing systems will be based upon the ability of increasing t...
In this paper we discuss our experience with highperformance cluster computing using Scalable Cohere...
A cluster is a group of independent compute nodes which are coupled by an interconnection network. T...
Due to their excellent cost/performance ratio, clusters of PCs can be attractive high-performance co...
This paper describes a new interconnect standard, the Scalable Coherent Interface (SCI, ANSI/IEEE S...
SCI is based on unidirectional point-to-point links connected in rings. Separate rings can be connec...
The design challenge for large-scale multiprocessors is (1) to minimize communication overhead, (2) ...
The Scalable Coherent Interface (SCI) project (IEEE P1596) found a way to avoid the limits that are ...
Clusters of multiprocessors, or Clumps, promise to be the supercomputers of the future, but obtainin...
The design challenge for large-scale multiprocessors is (1) to minimize communication overhead, (2) ...
This paper provides experimental data and analysis to quantify both the peak and sustained performan...
This paper describes a novel methodology for implementing a common set of collective communication o...
Recently several network technologies which support user-level communication between processes using...
A Local Area Network (LAN) can be built using the ANSI/IEEE Standard 1592-1992 Scalable Coherent Int...
A critical factor in the design of any parallel processing computer system is the interconnection ne...
Improving the performance of future computing systems will be based upon the ability of increasing t...