In this work, we revisit the curse of dimensionality, especially the concentration of the norm phenomenon which is the inability of distance functions to separate points well in high dimensions. We study the influence of the different properties of a distance measure, viz., triangle inequality, boundedness and translation invariance and on this phenomenon. Our studies indicate that unbounded distance measures whose expectations do not exist are to be preferred. We propose some new distance measures based on our studies and present many experimental results which seem to confirm our analysis. In particular, we study these distance measures w.r.t. indices like relative variance and relative contrast and further compare and contrast these meas...
The attention in anomaly is difficult since they include important and actionable data in lots of do...
A-B: Influence of dimensionality D on distance-of-distances. A: Absolute shrinkage of Euclidean dist...
Nearest neighbor queries are important in many settings, including spatial databases (Find the k clo...
In this work, we revisit the curse of dimensionality, especially the concentration of the norm pheno...
Abstract. In recent years, the eect of the curse of high dimensionality has been studied in great de...
AbstractBeyer et al. gave a sufficient condition for the high dimensional phenomenon known as the co...
In recent years, the effect of the curse of high dimensionality has been studied in great detail on ...
Abstract Let X = (X1,...,Xd) be a R d-valued random vector with i.i.d. components, and let ‖X‖p = ( ...
The distribution of distances between points in a high-dimensional data set tends to look quite diff...
In data analysis, the use of a distance function is ubiquitous. There is an increased awareness abo...
Distances between data points are widely used in machine learning applications. Yet, when corrupted ...
In order to address high dimensional problems, a new ‘direction-aware’ metric is introduced in this ...
The metric search paradigm has been to this day successfully applied to several real-world problems,...
AbstractThe hubness phenomenon is a recently discovered aspect of the curse of dimensionality. Hub o...
19 pagesLet $\bX=(X_1, \hdots, X_d)$ be a $\mathbb R^d$-valued random vector with i.i.d.~components,...
The attention in anomaly is difficult since they include important and actionable data in lots of do...
A-B: Influence of dimensionality D on distance-of-distances. A: Absolute shrinkage of Euclidean dist...
Nearest neighbor queries are important in many settings, including spatial databases (Find the k clo...
In this work, we revisit the curse of dimensionality, especially the concentration of the norm pheno...
Abstract. In recent years, the eect of the curse of high dimensionality has been studied in great de...
AbstractBeyer et al. gave a sufficient condition for the high dimensional phenomenon known as the co...
In recent years, the effect of the curse of high dimensionality has been studied in great detail on ...
Abstract Let X = (X1,...,Xd) be a R d-valued random vector with i.i.d. components, and let ‖X‖p = ( ...
The distribution of distances between points in a high-dimensional data set tends to look quite diff...
In data analysis, the use of a distance function is ubiquitous. There is an increased awareness abo...
Distances between data points are widely used in machine learning applications. Yet, when corrupted ...
In order to address high dimensional problems, a new ‘direction-aware’ metric is introduced in this ...
The metric search paradigm has been to this day successfully applied to several real-world problems,...
AbstractThe hubness phenomenon is a recently discovered aspect of the curse of dimensionality. Hub o...
19 pagesLet $\bX=(X_1, \hdots, X_d)$ be a $\mathbb R^d$-valued random vector with i.i.d.~components,...
The attention in anomaly is difficult since they include important and actionable data in lots of do...
A-B: Influence of dimensionality D on distance-of-distances. A: Absolute shrinkage of Euclidean dist...
Nearest neighbor queries are important in many settings, including spatial databases (Find the k clo...