In this work we examine the implications of building a single logical link out of multiple physical links. We use MultiEdge [12] to examine the throughput-CPU utiliza-tion tradeoffs and examine how overheads and performance scale with the number and speed of links. We use low-level instrumentation to understand associated overheads, we experiment with setups between 1 and 8 1-GBit/s links, and we contrast our results with a single 10-GBit/s link. We find that: (a) Our base protocol achieves up-to 65 % of the nominal aggregate throughput. (b) Replacing the in-terrupts with polling significantly impacts only the multiple link configurations, reaching 80 % of nominal throughput. (c) The impact of copying on CPU overhead is significant, and rem...
Recent developments in networking technology cause a growing interest in connecting local-area clust...
We evaluate the performance of a Fast Ethernet network configured with a single large switch, a sing...
Cluster interconnect performance is typically characterized by latency and throughput. However, not ...
Ethernet line rates are projected to reach 100 Gbits/s by as soon as 2010. While in principle suitab...
Parallel computing on clusters of workstations is receiving much attention from the research communi...
Today's data centers consist of several thousand PCs that provide massive amounts of computational p...
In this diploma thesis, the implementation of a networking protocol, which is adopted from a previou...
The performance evaluation of multiprocessor interconnects cannot be divorced from issues of traffic...
In the early years of parallel computing research, significant theoretical studies were done on inte...
This is an extension of a similar paper from Workshop on Communication Architecture for Clusters (CA...
This paper presents a performance model for Many-to-One type communications on a dedicated heterogen...
Abstract—Ethernet has been used for interconnection networks of high-performance computing (HPC) sys...
ABSTRACT-- Large production volume of the devices results in very low equipment cost based on Ethern...
In this diploma thesis, the implementation of a networking protocol, which is adopted from a previou...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
Recent developments in networking technology cause a growing interest in connecting local-area clust...
We evaluate the performance of a Fast Ethernet network configured with a single large switch, a sing...
Cluster interconnect performance is typically characterized by latency and throughput. However, not ...
Ethernet line rates are projected to reach 100 Gbits/s by as soon as 2010. While in principle suitab...
Parallel computing on clusters of workstations is receiving much attention from the research communi...
Today's data centers consist of several thousand PCs that provide massive amounts of computational p...
In this diploma thesis, the implementation of a networking protocol, which is adopted from a previou...
The performance evaluation of multiprocessor interconnects cannot be divorced from issues of traffic...
In the early years of parallel computing research, significant theoretical studies were done on inte...
This is an extension of a similar paper from Workshop on Communication Architecture for Clusters (CA...
This paper presents a performance model for Many-to-One type communications on a dedicated heterogen...
Abstract—Ethernet has been used for interconnection networks of high-performance computing (HPC) sys...
ABSTRACT-- Large production volume of the devices results in very low equipment cost based on Ethern...
In this diploma thesis, the implementation of a networking protocol, which is adopted from a previou...
This paper describes the basic concepts of our solution to improve the performance of Ethernet Commu...
Recent developments in networking technology cause a growing interest in connecting local-area clust...
We evaluate the performance of a Fast Ethernet network configured with a single large switch, a sing...
Cluster interconnect performance is typically characterized by latency and throughput. However, not ...