In this dissertation, we explore parallel algorithms for general N-Body problems in high dimensions, and their applications in machine learning and image analysis on distributed infrastructures. In the first part of this work, we proposed and developed a set of basic tools built on top of Message Passing Interface and OpenMP for massively parallel nearest neighbors search. In particular, we present a distributed tree structure to index data in arbitrary number of dimensions, and a novel algorithm that eliminate the need for collective coordinate exchanges during tree construction. To the best of our knowledge, our nearest neighbors package is the first attempt that scales to millions of cores in up to a thousand dimensions. Based on our...
Computing k-Nearest Neighbors (KNN) is one of the core kernels used in many machine lear...
We propose two solutions for both nearest neigh-bors and range search problems. For the nearest neig...
In parallel computing environments from multicore systems to cloud computers and supercomputers, dat...
For many computer vision and machine learning problems, large training sets are key for good perform...
For many computer vision and machine learning problems, large training sets are key for good perform...
For many computer vision and machine learning problems, large training sets are key for good perform...
This paper considers a parallel algorithm for Bayesian network structure learning from large data se...
This dissertation develops and studies fast algorithms for solving closest point problems. Algorithm...
Abstract. We present a fast algorithm for kernel summation problems in high-dimensions. These proble...
Exact Bayesian structure discovery in Bayesian networks requires exponential time and space. Using d...
This paper presents new parallel algorithms for generating Euclidean minimum spanning trees and spa...
General N-body problems are a set of problems in which an update to a single element in the system d...
To overcome the high computing cost associated with high-dimensional digital image descriptor matchi...
General N-body problems are a set of problems in which an update to a single element in the system d...
Similarity search problems in high-dimensional data arise in many areas of computer science such as ...
Computing k-Nearest Neighbors (KNN) is one of the core kernels used in many machine lear...
We propose two solutions for both nearest neigh-bors and range search problems. For the nearest neig...
In parallel computing environments from multicore systems to cloud computers and supercomputers, dat...
For many computer vision and machine learning problems, large training sets are key for good perform...
For many computer vision and machine learning problems, large training sets are key for good perform...
For many computer vision and machine learning problems, large training sets are key for good perform...
This paper considers a parallel algorithm for Bayesian network structure learning from large data se...
This dissertation develops and studies fast algorithms for solving closest point problems. Algorithm...
Abstract. We present a fast algorithm for kernel summation problems in high-dimensions. These proble...
Exact Bayesian structure discovery in Bayesian networks requires exponential time and space. Using d...
This paper presents new parallel algorithms for generating Euclidean minimum spanning trees and spa...
General N-body problems are a set of problems in which an update to a single element in the system d...
To overcome the high computing cost associated with high-dimensional digital image descriptor matchi...
General N-body problems are a set of problems in which an update to a single element in the system d...
Similarity search problems in high-dimensional data arise in many areas of computer science such as ...
Computing k-Nearest Neighbors (KNN) is one of the core kernels used in many machine lear...
We propose two solutions for both nearest neigh-bors and range search problems. For the nearest neig...
In parallel computing environments from multicore systems to cloud computers and supercomputers, dat...