Overlay NoCs, such as Hoplite, are cheap to implement on an FPGA but provide no bounds on worst-case routing latency of packets traversing the NoC due to deflection routing. In this paper, we show how to adapt Hoplite to enable calculation of precise upper bounds on routing latency by modifying the routing function to prioritize deflections, and by regulating the injection of packets to meet certain throughput and burstiness constraints. We provide an analytical model for computing end-to-end latency in the form of (1) in-flight time in the network $T^f$, and (2) waiting time at the source node $T^s$. To bound in-flight time in an $m \times m$ NoC, we modify the routing function and switching crossbar richness in the Hoplite router to del...
Bridge-local latency computation is often regarded with caution, as historic efforts with the Credit...
Real-time applications such as multimedia and gaming require stringent performance guarantees, usual...
The Kalray MPPA2-256 processor integrates 256 processing cores and 32 management cores on a chip. Th...
In order to communicate, cores of a multi-core platform traditionally relied on shared busses. Howev...
With the increasing number of computation nodes integrated in multi and many-core platforms, network...
This article was presented in part at the International Conference on Embedded Software 2020 and app...
The recent line of Versal FPGA devices from Xilinx Inc. includes a hard Network-On-Chip (NoC) embedd...
Real-time (RT) communication support is a critical requirement for many complex embedded application...
We can prove occupancy bounds of stall-free FIFOs used in deflection-free, low-cost, and high-speed ...
This paper considers the use of routerless networks-on-chip as an alternative on-chip interconnect f...
Nowadays available multiprocessor platforms predominantly use a network-on-chip (NoC) architecture a...
Many classes of applications require Quality of Service (QoS) guarantees from the system interconnec...
In overlay networks, both local and long-distance links appear as a single hop to a routing protocol...
Computing performance needs in domains such as automotive, avionics, railway, and space are on the ...
Wormhole-based mesh Networks-on-Chip (wNoC) are deployed in high-performance many-core processors du...
Bridge-local latency computation is often regarded with caution, as historic efforts with the Credit...
Real-time applications such as multimedia and gaming require stringent performance guarantees, usual...
The Kalray MPPA2-256 processor integrates 256 processing cores and 32 management cores on a chip. Th...
In order to communicate, cores of a multi-core platform traditionally relied on shared busses. Howev...
With the increasing number of computation nodes integrated in multi and many-core platforms, network...
This article was presented in part at the International Conference on Embedded Software 2020 and app...
The recent line of Versal FPGA devices from Xilinx Inc. includes a hard Network-On-Chip (NoC) embedd...
Real-time (RT) communication support is a critical requirement for many complex embedded application...
We can prove occupancy bounds of stall-free FIFOs used in deflection-free, low-cost, and high-speed ...
This paper considers the use of routerless networks-on-chip as an alternative on-chip interconnect f...
Nowadays available multiprocessor platforms predominantly use a network-on-chip (NoC) architecture a...
Many classes of applications require Quality of Service (QoS) guarantees from the system interconnec...
In overlay networks, both local and long-distance links appear as a single hop to a routing protocol...
Computing performance needs in domains such as automotive, avionics, railway, and space are on the ...
Wormhole-based mesh Networks-on-Chip (wNoC) are deployed in high-performance many-core processors du...
Bridge-local latency computation is often regarded with caution, as historic efforts with the Credit...
Real-time applications such as multimedia and gaming require stringent performance guarantees, usual...
The Kalray MPPA2-256 processor integrates 256 processing cores and 32 management cores on a chip. Th...