We investigate a distance metric, previously defined for the measurement of structured data, in the more general context of vector spaces. The metric has a basis in information theory and assesses the distance between two vectors in terms of their relative information content. The resulting metric gives an outcome based on the dimensional correlation, rather than magnitude, of the input vectors, in a manner similar to Cosine Distance. In this paper the metric is defined, and assessed, in comparison with Cosine Distance, for its major properties: semantics, properties for use within similarity search, and evaluation efficiency. We find that it is fairly well correlated with Cosine Distance in dense spaces, but its semantics are in some cas...
summary:Distance metrics are at the core of many processing and machine learning algorithms. In many...
Numerous methods of multivariate statistics and data mining suffer from the presence of outlying mea...
International audienceA wide range of machine learning and signal processing applications involve da...
We investigate a distance metric, previously defined for the measurement of structured data, in the ...
We investigate a distance metric, previously defined for the measurement of structured data, in the ...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
There are many contexts where the definition of similarity in multivariate space requires to be base...
There are many contexts where the definition of similarity in multivariate space requires to be base...
Abstract. Distance covariance and distance correlation are scalar coefficients that characterize ind...
Abstract. Data structures for similarity search are commonly evalu-ated on data in vector spaces, bu...
We propose a new class of metrics on sets, vectors, and functions that can be used in various stages...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
summary:Distance metrics are at the core of many processing and machine learning algorithms. In many...
Numerous methods of multivariate statistics and data mining suffer from the presence of outlying mea...
International audienceA wide range of machine learning and signal processing applications involve da...
We investigate a distance metric, previously defined for the measurement of structured data, in the ...
We investigate a distance metric, previously defined for the measurement of structured data, in the ...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
There are many contexts where the definition of similarity in multivariate space requires to be base...
There are many contexts where the definition of similarity in multivariate space requires to be base...
Abstract. Distance covariance and distance correlation are scalar coefficients that characterize ind...
Abstract. Data structures for similarity search are commonly evalu-ated on data in vector spaces, bu...
We propose a new class of metrics on sets, vectors, and functions that can be used in various stages...
The majority of work in similarity search focuses on the efficiency of threshold and nearest-neighbo...
summary:Distance metrics are at the core of many processing and machine learning algorithms. In many...
Numerous methods of multivariate statistics and data mining suffer from the presence of outlying mea...
International audienceA wide range of machine learning and signal processing applications involve da...