As IO bandwidth continues to grow, processor speeds have stagnated. As such, the need to maximize the utility of our CPU cycles for IO-intensive applications is constantly growing. In my dissertation, I investigate processing bottlenecks for IO-intensive applications. I find that small and large IO cause different bottlenecks. Small IO, often caused by remote procedure calls (RPCs), cause IO stack bottlenecks. Large IO, caused by serving content such as web pages, audio, and video, causes bottlenecks due to overhead in IO buffer copying. I aim to accelerate both types of IO request without modification to the application, allowing for easier development and faster deployment. To mitigate these bottlenecks, I propose that IO-intensive applic...
Importance of low-latency heterogeneous systems in today’s world is immeasurable, which was proven i...
Application performance on modern processors has become increasingly dictated by the use of on-chip ...
The speed of CPUs and memories has historically outstripped I/O, but emerging network and storage te...
Recent high-performance storage devices have exposed software inefficiencies in existing storage sta...
## About the experiments ### Acronyms used in the figures - BufferedReader (BR). - LineNumberReade...
This article presents the design, implementation, and evaluation of IO -Lite, a unified I/O bufferin...
Customized hardware accelerators have made it possible to meet increasing workload demands in cloud ...
The area of parallel and distributed computing has grown very fast in the past few decades with the ...
This article explores opportunities to mitigate the performance impact of IOMMU on high-speed networ...
To increase the scale and performance of scientific applications, scientists commonly distribute com...
We present a novel taxonomy that characterizes in a structured way the software and hardware tradeof...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
The compute capacity growth in high performance computing (HPC) systems is outperforming improvement...
The 2014 TOP500 supercomputer list includes over 40 deployed petascale systems, and the high perform...
This paper looks at the I/O bottleneck in operating systems, with particular focus on high-speed net...
Importance of low-latency heterogeneous systems in today’s world is immeasurable, which was proven i...
Application performance on modern processors has become increasingly dictated by the use of on-chip ...
The speed of CPUs and memories has historically outstripped I/O, but emerging network and storage te...
Recent high-performance storage devices have exposed software inefficiencies in existing storage sta...
## About the experiments ### Acronyms used in the figures - BufferedReader (BR). - LineNumberReade...
This article presents the design, implementation, and evaluation of IO -Lite, a unified I/O bufferin...
Customized hardware accelerators have made it possible to meet increasing workload demands in cloud ...
The area of parallel and distributed computing has grown very fast in the past few decades with the ...
This article explores opportunities to mitigate the performance impact of IOMMU on high-speed networ...
To increase the scale and performance of scientific applications, scientists commonly distribute com...
We present a novel taxonomy that characterizes in a structured way the software and hardware tradeof...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
The compute capacity growth in high performance computing (HPC) systems is outperforming improvement...
The 2014 TOP500 supercomputer list includes over 40 deployed petascale systems, and the high perform...
This paper looks at the I/O bottleneck in operating systems, with particular focus on high-speed net...
Importance of low-latency heterogeneous systems in today’s world is immeasurable, which was proven i...
Application performance on modern processors has become increasingly dictated by the use of on-chip ...
The speed of CPUs and memories has historically outstripped I/O, but emerging network and storage te...