As computing power increases exponentially, vast amount of data is created by many scientific re- search activities. However, the bandwidth for storing the data to disks and reading the data from disks has been improving at a much slower pace. These two trends produce an ever-widening data access gap. Our work brings together two distinct technologies to address this data access issue: indexing and in situ processing. From decades of database research literature, we know that indexing is an effective way to address the data access issue, particularly for accessing relatively small fraction of data records. As data sets increase in sizes, more and more analysts need to use selective data access, which makes indexing an even more important fo...
With the huge amount of data continuously accumulated and shared by individuals and organizations, i...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Modern hardware has the potential to play a central role in scalable data management systems. A real...
As computing power increases exponentially, vast amount of data is created by many scientific re- se...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
The ability to extract information from collected data has always driven science. Today.s large comp...
FastBit is a software tool for searching large read-only data sets. It organizes user data in a col...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
Abstract. FastBit is a software tool for searching large read-only datasets. It organizes user data ...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
In this chapter, we explore ways to answer queries on large multi-dimensional data efficiently. Giv...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
Abstract. Tree based indexing structures like B-trees, B+-trees, Bitmap indexes and R-trees have bec...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
Graduation date: 2015Large databases and data warehouses are becoming prevalent for the storage and ...
With the huge amount of data continuously accumulated and shared by individuals and organizations, i...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Modern hardware has the potential to play a central role in scalable data management systems. A real...
As computing power increases exponentially, vast amount of data is created by many scientific re- se...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the- a...
The ability to extract information from collected data has always driven science. Today.s large comp...
FastBit is a software tool for searching large read-only data sets. It organizes user data in a col...
FastBit is a software tool for searching large read-only datasets. It organizes user data in a colum...
Abstract. FastBit is a software tool for searching large read-only datasets. It organizes user data ...
Modern scientific datasets present numerous data management and analysis challenges. State-of-the-ar...
In this chapter, we explore ways to answer queries on large multi-dimensional data efficiently. Giv...
In this paper, we describe a strategy of using compressed bitmap indices to speed up queries on bot...
Abstract. Tree based indexing structures like B-trees, B+-trees, Bitmap indexes and R-trees have bec...
Abstract—Scientific experiments and simulations produce mountains of data in file formats, such as H...
Graduation date: 2015Large databases and data warehouses are becoming prevalent for the storage and ...
With the huge amount of data continuously accumulated and shared by individuals and organizations, i...
Scientific experiments and large-scale simulations produce massive amounts of data. Many of these sc...
Modern hardware has the potential to play a central role in scalable data management systems. A real...