As technology advances, the number of cores in Chip MultiProcessor systems and MultiProcessor Systems-on-Chips keeps increasing. The network must provide sustained throughput and ultra-low latencies. In this paper we propose new pipelined switch designs focused in reducing the switch latency. We identify the switch components that limit the switch frequency: the arbiter. Then, we simplify the arbiter logic by using multiple smaller arbiters, but increasing greatly the switch area. To solve this problem, a second design is presented where the routing traversal and arbitrations tasks are mixed. Results demonstrate a switch latency reduction ranging from 10 % to 21%. Network latency is reduced in a range from 11 % to 15%. I
High-speed and low-power routers form the basic building blocks of on-die interconnect fabrics that ...
Current interconnect standards providing hardware support for quality of service (QoS) consider up t...
The Clos-network is widely recognized as a scalable architecture for high-performance switches and r...
[EN] As technology advances, the number of cores in Chip MultiProcessor systems and MultiProcessor S...
Abstract—Large systems-on-chip (SoCs) and chip multiprocessors (CMPs), incorporating tens to hundred...
High performance computer and data-centers require PetaFlop/s processing speed and Petabyte storage ...
Power density and cooling issues are limiting the performance of high performance chip multiprocesso...
Abstract — With the increasing complexity of system-on-chip, Networks on Chip (NoC) of multi-hop swi...
A comparison is made among a large number of designs for the purpose of specifying low-cost yet cost...
The implementation of a low-latency optical packet switch architecture that is controllable while sc...
Abstract. Whereas efcient barrier implementations were once a concern only in high-performance compu...
In this paper we present an efficient technique to reduce the switching activity in a CMOS combinati...
Switch design for interconnection networks plays an important role in the overall performance of mul...
The on-chip communication requirements of many systems are best served through the deployment of a r...
A simple distributed, modular architecture for a very large scale ATM switch is proposed in this pap...
High-speed and low-power routers form the basic building blocks of on-die interconnect fabrics that ...
Current interconnect standards providing hardware support for quality of service (QoS) consider up t...
The Clos-network is widely recognized as a scalable architecture for high-performance switches and r...
[EN] As technology advances, the number of cores in Chip MultiProcessor systems and MultiProcessor S...
Abstract—Large systems-on-chip (SoCs) and chip multiprocessors (CMPs), incorporating tens to hundred...
High performance computer and data-centers require PetaFlop/s processing speed and Petabyte storage ...
Power density and cooling issues are limiting the performance of high performance chip multiprocesso...
Abstract — With the increasing complexity of system-on-chip, Networks on Chip (NoC) of multi-hop swi...
A comparison is made among a large number of designs for the purpose of specifying low-cost yet cost...
The implementation of a low-latency optical packet switch architecture that is controllable while sc...
Abstract. Whereas efcient barrier implementations were once a concern only in high-performance compu...
In this paper we present an efficient technique to reduce the switching activity in a CMOS combinati...
Switch design for interconnection networks plays an important role in the overall performance of mul...
The on-chip communication requirements of many systems are best served through the deployment of a r...
A simple distributed, modular architecture for a very large scale ATM switch is proposed in this pap...
High-speed and low-power routers form the basic building blocks of on-die interconnect fabrics that ...
Current interconnect standards providing hardware support for quality of service (QoS) consider up t...
The Clos-network is widely recognized as a scalable architecture for high-performance switches and r...