Abstract—Nearest neighbour search is a core process in many data mining algorithms. Finding reliable closest matches of a query in a high dimensional space is still a challenging task. This is because the effectiveness of many dissimilarity measures, that are based on a geometric model, such as `p-norm, decreases as the number of dimensions increases. In this paper, we examine how the data distribution can be exploited to measure dissimilarity between two instances and propose a new data dependent dissimilarity measure called ‘mp-dissimilarity’. Rather than relying on geometric distance, it measures the dissimilarity between two instances in each dimension as a probability mass in a region that encloses the two instances. It deems two insta...
In recent years, the effect of the curse of high dimensionality has been studied in great detail on ...
A pre-printMeasures of similarity (or dissimilarity) are a key ingredient to many machine learning a...
Dissimilarity measures, the basis of similarity-based retrieval, can be viewed as a distance and a s...
Nearest neighbour search is a core process in many data mining algorithms. Finding reliable closest ...
Nearest neighbor search is a core process in many data mining algorithms. Finding reliable closest m...
In image retrieval, an effective dissimilarity (or similarity) measure is required to retrieve the p...
In data mining, the task-specific performances of conventional distance-based similarity measures va...
The use of distance metrics such as the Euclidean or Manhattan distance for nearest neighbour algori...
Abstract— In image retrieval, an effective dissimilarity measure is required to retrieve the percept...
This paper introduces the first generic version of data dependent dissimilarity and shows that it pr...
Existing distance metric learning methods require optimisation to learn a feature space to transform...
Image retrieval and clustering are two important tools for analysing and organising images. Dissimil...
Nearest neighbour search is a simple technique widely used in Pattern Recognition tasks. When the d...
The development of analysis methods for categorical data begun in 90's decade, and it has been boomi...
Abstract. In recent years, the eect of the curse of high dimensionality has been studied in great de...
In recent years, the effect of the curse of high dimensionality has been studied in great detail on ...
A pre-printMeasures of similarity (or dissimilarity) are a key ingredient to many machine learning a...
Dissimilarity measures, the basis of similarity-based retrieval, can be viewed as a distance and a s...
Nearest neighbour search is a core process in many data mining algorithms. Finding reliable closest ...
Nearest neighbor search is a core process in many data mining algorithms. Finding reliable closest m...
In image retrieval, an effective dissimilarity (or similarity) measure is required to retrieve the p...
In data mining, the task-specific performances of conventional distance-based similarity measures va...
The use of distance metrics such as the Euclidean or Manhattan distance for nearest neighbour algori...
Abstract— In image retrieval, an effective dissimilarity measure is required to retrieve the percept...
This paper introduces the first generic version of data dependent dissimilarity and shows that it pr...
Existing distance metric learning methods require optimisation to learn a feature space to transform...
Image retrieval and clustering are two important tools for analysing and organising images. Dissimil...
Nearest neighbour search is a simple technique widely used in Pattern Recognition tasks. When the d...
The development of analysis methods for categorical data begun in 90's decade, and it has been boomi...
Abstract. In recent years, the eect of the curse of high dimensionality has been studied in great de...
In recent years, the effect of the curse of high dimensionality has been studied in great detail on ...
A pre-printMeasures of similarity (or dissimilarity) are a key ingredient to many machine learning a...
Dissimilarity measures, the basis of similarity-based retrieval, can be viewed as a distance and a s...