A popular programming paradigm in the cloud, MapReduce is ex-tensively considered and used for “big data ” analysis. Unfortu-nately, a great many “big data ” applications require capabilities be-yond those originally intended by MapReduce, often burdening de-velopers to write unnatural non-obvious MapReduce programs so as to twist the underlying system to meet the requirements. In this paper, we focus on a class of “big data ” applications that in addi-tion to MapReduce’s main data source, require selective access to one or many data sources, e.g., various kinds of indices, knowledge bases, external cloud services. We propose to extend MapReduce with EFind, an Efficient and Flexible index access solution, to better support this class of ap-...
We observe two important trends brought about by the evolution of Internet in recent years. Firstly ...
Running multiple instances of the MapReduce framework concurrently in a multicluster system or datac...
[[abstract]]MapReduce is a programming model to process a massive amount of data on cloud computing....
Several research works have focused on supporting index access in MapReduce systems. These works hav...
National audienceIn this report we address the problem of data management in clouds for the MapReduc...
International audienceSearchable encryption provides encryption schemes allowing search on encrypted...
International audienceThe MapReduce programming model, proposed by Google, offers a simple and effic...
In the recent years the problems of using generic storage (i.e., relational) techniques for very spe...
Cloud computing [1] offers new approaches for scientific computing that leverage the major commercia...
In Information Retrieval (IR), the efficient indexing of terabyte-scale and larger corpora is still ...
In the last decade, our ability to store data has grown at a greater rate than our ability to proces...
Nowadays cloud computing is becoming a trend on big data processing. Google created MapReduce model ...
The emergence of big data has brought a great impact on traditional computing mode, the distributed ...
Abstract — The utility computing model introduced by cloud computing combined with the rich set of ...
the date of receipt and acceptance should be inserted later Abstract Hadoop MapReduce has evolved to...
We observe two important trends brought about by the evolution of Internet in recent years. Firstly ...
Running multiple instances of the MapReduce framework concurrently in a multicluster system or datac...
[[abstract]]MapReduce is a programming model to process a massive amount of data on cloud computing....
Several research works have focused on supporting index access in MapReduce systems. These works hav...
National audienceIn this report we address the problem of data management in clouds for the MapReduc...
International audienceSearchable encryption provides encryption schemes allowing search on encrypted...
International audienceThe MapReduce programming model, proposed by Google, offers a simple and effic...
In the recent years the problems of using generic storage (i.e., relational) techniques for very spe...
Cloud computing [1] offers new approaches for scientific computing that leverage the major commercia...
In Information Retrieval (IR), the efficient indexing of terabyte-scale and larger corpora is still ...
In the last decade, our ability to store data has grown at a greater rate than our ability to proces...
Nowadays cloud computing is becoming a trend on big data processing. Google created MapReduce model ...
The emergence of big data has brought a great impact on traditional computing mode, the distributed ...
Abstract — The utility computing model introduced by cloud computing combined with the rich set of ...
the date of receipt and acceptance should be inserted later Abstract Hadoop MapReduce has evolved to...
We observe two important trends brought about by the evolution of Internet in recent years. Firstly ...
Running multiple instances of the MapReduce framework concurrently in a multicluster system or datac...
[[abstract]]MapReduce is a programming model to process a massive amount of data on cloud computing....