We present Stochastic Neighbor Compression (SNC), an algorithm to compress a dataset for the purpose of k-nearest neighbor (kNN) clas-sification. Given training data, SNC learns a much smaller synthetic data set, that minimizes the stochastic 1-nearest neighbor classification error on the training data. This approach has sev-eral appealing properties: due to its small size, the compressed set speeds up kNN testing dras-tically (up to several orders of magnitude, in our experiments); it makes the kNN classifier sub-stantially more robust to label noise; on 4 of 7 data sets it yields lower test error than kNN on the entire training set, even at compression ra-tios as low as 2%; finally, the SNC compression leads to impressive speed ups over k...
Nonparametric classification models, such as K-Nearest Neighbor (KNN), have become particularly powe...
k nearest neighbor (kNN) is an effective and powerful lazy learning algorithm, notwithstanding its e...
We present the first sample compression algorithm for nearest neighbors with non-trivial performance...
The kNN (k-nearest neighbors) classification algorithm is one of the most widely used non-parametric...
This paper introduces a new local asymmetric weighting scheme for the nearest neighbor classificatio...
Abstract. This paper proposes SV-kNNC, a new algorithm for k-Nearest Neighbor (kNN). This algorithm ...
A local distance measure for the nearest neighbor classification rule is shown to achieve high comp...
In this paper, we propose a coarse to fine K nearest neighbor (KNN) classifier (CFKNNC). CFKNNC diff...
The k-nearest neighbors (kNN) classifier predicts a class of a query, q, by taking the majority clas...
International audienceThe unbounded and multidimensional nature, the evolution of data distributions...
Data visualization has always been a necessity. That is why the dimension reduction field is an impo...
In the k-nearest neighbor algorithm (k-NN), the determination of classes for test instances is usual...
Stochastic Neighbor Embedding (SNE) and variants like t-distributed SNE are popular methods of unsup...
Representative data in terms of a set of selected samples is of interest for various machine learnin...
In this paper, a novel prototype reduction algorithm is proposed, which aims at reducing the storage...
Nonparametric classification models, such as K-Nearest Neighbor (KNN), have become particularly powe...
k nearest neighbor (kNN) is an effective and powerful lazy learning algorithm, notwithstanding its e...
We present the first sample compression algorithm for nearest neighbors with non-trivial performance...
The kNN (k-nearest neighbors) classification algorithm is one of the most widely used non-parametric...
This paper introduces a new local asymmetric weighting scheme for the nearest neighbor classificatio...
Abstract. This paper proposes SV-kNNC, a new algorithm for k-Nearest Neighbor (kNN). This algorithm ...
A local distance measure for the nearest neighbor classification rule is shown to achieve high comp...
In this paper, we propose a coarse to fine K nearest neighbor (KNN) classifier (CFKNNC). CFKNNC diff...
The k-nearest neighbors (kNN) classifier predicts a class of a query, q, by taking the majority clas...
International audienceThe unbounded and multidimensional nature, the evolution of data distributions...
Data visualization has always been a necessity. That is why the dimension reduction field is an impo...
In the k-nearest neighbor algorithm (k-NN), the determination of classes for test instances is usual...
Stochastic Neighbor Embedding (SNE) and variants like t-distributed SNE are popular methods of unsup...
Representative data in terms of a set of selected samples is of interest for various machine learnin...
In this paper, a novel prototype reduction algorithm is proposed, which aims at reducing the storage...
Nonparametric classification models, such as K-Nearest Neighbor (KNN), have become particularly powe...
k nearest neighbor (kNN) is an effective and powerful lazy learning algorithm, notwithstanding its e...
We present the first sample compression algorithm for nearest neighbors with non-trivial performance...