This article explores opportunities to mitigate the performance impact of IOMMU on high-speed network traffic, as used in the Linux kernel. We first characterize IOTLB behavior and its effects on recent Intel Xeon Scalable & AMD EPYC processors at 200 Gbps, by analyzing the impact of different factors contributing to IOTLB misses and causing throughput drop (up to 20% compared to the no-IOMMU case in our experiments). Secondly, we discuss and analyze possible mitigations, including proposals and evaluation of a practical hugepage-aware memory allocator for the network device drivers to employ hugepage IOTLB entries in the Linux kernel. Our evaluation shows that using hugepage-backed buffers can completely recover the throughput drop int...
The Linux kernel provides a high-performance networking subsystem by processing packets in soft-IRQ ...
The advent of high-speed networks may soon increase the network bandwidth available to workstation c...
Memory access is the major bottleneck in realizing multi-hundred-gigabit networks with commodity har...
The input/output memory management unit (IOMMU) was recently introduced into mainstream computer ar-...
The IOMMU allows the OS to encapsulate I/O devices in their own virtual memory spaces, thus restrict...
Raw data of the experiments performed for "Overcoming the IOTLB wall for multi-100-gbps Linux-based ...
The PC hardware architecture and commodity op-erating systems such as Linux or Microsoft Windows are...
The block I/O subsystem of the Linux kernel is one of the critical components affecting the performa...
Ethernet continues to be the most widely used network architecture today due to its low cost and bac...
IOMMUs are hardware devices that trans-late device DMA addresses to proper ma-chine physical address...
This paper looks at the I/O bottleneck in operating systems, with particular focus on high-speed net...
As IO bandwidth continues to grow, processor speeds have stagnated. As such, the need to maximize th...
The topic of this thesis is the performance of computer networks. While network performance has gene...
The Linux Scalability Project is adapting Linux for use in enterprise-scale networking environments....
The performance of the protocol stack implementation of an operating system can greatly impact the p...
The Linux kernel provides a high-performance networking subsystem by processing packets in soft-IRQ ...
The advent of high-speed networks may soon increase the network bandwidth available to workstation c...
Memory access is the major bottleneck in realizing multi-hundred-gigabit networks with commodity har...
The input/output memory management unit (IOMMU) was recently introduced into mainstream computer ar-...
The IOMMU allows the OS to encapsulate I/O devices in their own virtual memory spaces, thus restrict...
Raw data of the experiments performed for "Overcoming the IOTLB wall for multi-100-gbps Linux-based ...
The PC hardware architecture and commodity op-erating systems such as Linux or Microsoft Windows are...
The block I/O subsystem of the Linux kernel is one of the critical components affecting the performa...
Ethernet continues to be the most widely used network architecture today due to its low cost and bac...
IOMMUs are hardware devices that trans-late device DMA addresses to proper ma-chine physical address...
This paper looks at the I/O bottleneck in operating systems, with particular focus on high-speed net...
As IO bandwidth continues to grow, processor speeds have stagnated. As such, the need to maximize th...
The topic of this thesis is the performance of computer networks. While network performance has gene...
The Linux Scalability Project is adapting Linux for use in enterprise-scale networking environments....
The performance of the protocol stack implementation of an operating system can greatly impact the p...
The Linux kernel provides a high-performance networking subsystem by processing packets in soft-IRQ ...
The advent of high-speed networks may soon increase the network bandwidth available to workstation c...
Memory access is the major bottleneck in realizing multi-hundred-gigabit networks with commodity har...