As applications are moving towards peta and exascale data sets, it has become increasingly important to develop more efficient data retrieval and storage mechanisms that will aid in reducing network traffic, server load, as well as minimizing user perceived retrieval delays. We propose an Intelligent Caching technique and a Graph Summarization technique in order to achieve low latency data retrieval for big data based applications. Our caching approach is developed on top of HDFS to optimize the read latency of HDFS. HDFS is primarily suitable for Write Once Read Many (WORM) applications where the number of reads is significantly more than that of writes. In our Intelligent Caching approach, we analyze real world map reduce traces from Fa...
© 2017 IEEE. Large-scale applications implemented in today's high performance graph frameworks heavi...
Graph processing is experiencing a surge of renewed interest as applications in social networks and ...
The Hadoop Distributed File System (HDFS) is a network file system used to support multiple widely-u...
As applications are moving towards peta and exascale data sets, it has become increasingly important...
Approximately 4 billion people have access to the Internet, additionally 23 billion devices are conn...
The explosion of big data poses a serious problem to the efficient retrieval and management of infor...
We have examined the tradeoffs in applying regular and Compressed Bloom filters to the name query pr...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
Big data processing systems are becoming increasingly more present in cloud workloads. Consequently,...
For peer-to-peer web search engines it is important to quickly process queries and return search res...
The use of computational platforms such as Hadoop and Spark is growing rapidly as a successful parad...
Summarization: In the last decade, data processing systems started using main memory as much as poss...
Low-latency, high-throughput systems for serving interactive queries are crucial to today's web serv...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
The World-Wide Web continues its remarkable and seemingly unregulated growth. This growth has seen a...
© 2017 IEEE. Large-scale applications implemented in today's high performance graph frameworks heavi...
Graph processing is experiencing a surge of renewed interest as applications in social networks and ...
The Hadoop Distributed File System (HDFS) is a network file system used to support multiple widely-u...
As applications are moving towards peta and exascale data sets, it has become increasingly important...
Approximately 4 billion people have access to the Internet, additionally 23 billion devices are conn...
The explosion of big data poses a serious problem to the efficient retrieval and management of infor...
We have examined the tradeoffs in applying regular and Compressed Bloom filters to the name query pr...
International audienceCaching can effectively reduce the cost of serving content and improve the use...
Big data processing systems are becoming increasingly more present in cloud workloads. Consequently,...
For peer-to-peer web search engines it is important to quickly process queries and return search res...
The use of computational platforms such as Hadoop and Spark is growing rapidly as a successful parad...
Summarization: In the last decade, data processing systems started using main memory as much as poss...
Low-latency, high-throughput systems for serving interactive queries are crucial to today's web serv...
Large scientific collaborations often have multiple scientists accessing the same set of files while...
The World-Wide Web continues its remarkable and seemingly unregulated growth. This growth has seen a...
© 2017 IEEE. Large-scale applications implemented in today's high performance graph frameworks heavi...
Graph processing is experiencing a surge of renewed interest as applications in social networks and ...
The Hadoop Distributed File System (HDFS) is a network file system used to support multiple widely-u...