Large-scale iterative computations are common in many important data mining and machine learning algorithms. In most of these applications, individual iterations can be specified as MapReduce computations, leading to the Iterative MapReduce programming model for efficient execution of data-intensive iterative computations interoperably between HPC and cloud environments. The initial work of Iterative MapReduce model [1] focuses on optimization of data flow and reducing data transfer between MapReduce iterations by caching invariant data in the local memory of compute nodes. We observe that a systematic approach to collective communication is essential in many iterative algorithms but is missing in the current model. Thus we generalize the i...
Clustering is a process of grouping objects that are similar among themselves but dissimilar to obje...
Huge datasets are becoming prevalent; even as re-searchers, we now routinely have to work with datas...
Large datasets (“Big Data”) are becoming ubiquitous be-cause the potential value in deriving insight...
Large-scale iterative computations are common in many important data mining and machine learning alg...
Abstract—Large-scale iterative computations are common in many important data mining and machine lea...
Abstract — Computer vision is being revolutionized by the incredible volume of visual data available...
and data intensive MPI runtime as a layered Map-Collective architecture with Map-AllGather, Map-AllR...
Community detection, also named as graph clustering, is essential to various graph analysis applicat...
MapReduce is a software framework that allows certain kinds of parallelizable or distributable probl...
Cloud computing [1] offers new approaches for scientific computing that leverage the major commercia...
Clustering is defined as the process of grouping a set of objects in a way that objects in the same ...
In an attempt to increase the performance/cost ratio, large compute clusters are becoming heterogene...
The recent growing size of datasets requires scalability of data mining algorithms, such as clusteri...
K-means clustering plays a vital role in data mining. As an iterative computation, its performance w...
Poster presented at the 2012 Washington State University Academic Showcase.Identifying close-knit co...
Clustering is a process of grouping objects that are similar among themselves but dissimilar to obje...
Huge datasets are becoming prevalent; even as re-searchers, we now routinely have to work with datas...
Large datasets (“Big Data”) are becoming ubiquitous be-cause the potential value in deriving insight...
Large-scale iterative computations are common in many important data mining and machine learning alg...
Abstract—Large-scale iterative computations are common in many important data mining and machine lea...
Abstract — Computer vision is being revolutionized by the incredible volume of visual data available...
and data intensive MPI runtime as a layered Map-Collective architecture with Map-AllGather, Map-AllR...
Community detection, also named as graph clustering, is essential to various graph analysis applicat...
MapReduce is a software framework that allows certain kinds of parallelizable or distributable probl...
Cloud computing [1] offers new approaches for scientific computing that leverage the major commercia...
Clustering is defined as the process of grouping a set of objects in a way that objects in the same ...
In an attempt to increase the performance/cost ratio, large compute clusters are becoming heterogene...
The recent growing size of datasets requires scalability of data mining algorithms, such as clusteri...
K-means clustering plays a vital role in data mining. As an iterative computation, its performance w...
Poster presented at the 2012 Washington State University Academic Showcase.Identifying close-knit co...
Clustering is a process of grouping objects that are similar among themselves but dissimilar to obje...
Huge datasets are becoming prevalent; even as re-searchers, we now routinely have to work with datas...
Large datasets (“Big Data”) are becoming ubiquitous be-cause the potential value in deriving insight...