Recent high-performance storage devices have exposed software inefficiencies in existing storage stacks, leading to a new breed of I/O stacks. The newest storage API of the Linux kernel is io-uring. We perform one of the first in-depth studies of io-uring, and compare its performance and dis-/advantages with the established libaio and SPDK APIs. Our key findings reveal that (i) polling design significantly impacts performance; (ii) with enough CPU cores io-uring can deliver performance close to that of SPDK; and (iii) performance scalability over multiple CPU cores and devices requires careful consideration and necessitates a hybrid approach. Last, we provide design guidelines for developers of storage intensive applications
This article presents the design, implementation, and evaluation of IO -Lite, a unified I/O bufferin...
Task-based programming models are excellent tools to parallelize and seamlessly load balance an appl...
Abstract K42 is an open-source research kernel targeted for 64-bit cache-coherent multiprocessor sys...
Recent high-performance storage devices have exposed software inefficiencies in existing storage sta...
As IO bandwidth continues to grow, processor speeds have stagnated. As such, the need to maximize th...
Traditionally storage has not been part of a programming model’s semantics and is added only as an I...
Two key changes are driving an immediate need for deeper understanding of I/O workloads in high-perf...
System software overheads in the I/O path, including VFS and file system code, become more pronounce...
System software overheads in the I/O path, including VFS and file system code, become more pronounce...
Achieving predictable performance in shared cloud storage services is hard. Tenants want reservation...
Abstract — The importance of a well tuned I/O subsystem is beyond doubt. In many cases, I/O may be t...
A critical but often ignored component of system performance is the I/O system. Today's applications...
Abstract. I/O intensive applications have posed great challenges to computational scientists. A majo...
## About the experiments ### Acronyms used in the figures - BufferedReader (BR). - LineNumberReade...
Parallel computers are increasingly being used to run large-scale applications that also have huge I...
This article presents the design, implementation, and evaluation of IO -Lite, a unified I/O bufferin...
Task-based programming models are excellent tools to parallelize and seamlessly load balance an appl...
Abstract K42 is an open-source research kernel targeted for 64-bit cache-coherent multiprocessor sys...
Recent high-performance storage devices have exposed software inefficiencies in existing storage sta...
As IO bandwidth continues to grow, processor speeds have stagnated. As such, the need to maximize th...
Traditionally storage has not been part of a programming model’s semantics and is added only as an I...
Two key changes are driving an immediate need for deeper understanding of I/O workloads in high-perf...
System software overheads in the I/O path, including VFS and file system code, become more pronounce...
System software overheads in the I/O path, including VFS and file system code, become more pronounce...
Achieving predictable performance in shared cloud storage services is hard. Tenants want reservation...
Abstract — The importance of a well tuned I/O subsystem is beyond doubt. In many cases, I/O may be t...
A critical but often ignored component of system performance is the I/O system. Today's applications...
Abstract. I/O intensive applications have posed great challenges to computational scientists. A majo...
## About the experiments ### Acronyms used in the figures - BufferedReader (BR). - LineNumberReade...
Parallel computers are increasingly being used to run large-scale applications that also have huge I...
This article presents the design, implementation, and evaluation of IO -Lite, a unified I/O bufferin...
Task-based programming models are excellent tools to parallelize and seamlessly load balance an appl...
Abstract K42 is an open-source research kernel targeted for 64-bit cache-coherent multiprocessor sys...