Numerical measures of pattern dissimilarity are at the heart of pattern recognition and classification. Applications of pattern recognition grow more sophisticated every year, and consequently we require distance measures for patterns not easily expressible as feature vectors. Examples include strings, parse trees, time series, random spatial fields, and random graphs [79] [117]. Distance measures are not arbitrary. They can only be effective when they incorporate information about the problem domain; this is a direct consequence of the Ugly Duckling theorem [37]. This thesis poses the question: how can the principles of information theory and statistics guide us in constructing distance measures? In this thesis, I examine dista...
Statistical distances allow us to quantify the closeness between two statistical objects. Many dista...
Several probabilistic distance, information, uncertainty, and overlap measures are proposed and cons...
Labelled Markov chains (LMCs) are widely used in probabilistic verification, speech recognition, com...
We prove results on the decidability and complexity of computing the total variation distance (equiv...
Two different aspects of the problem of selecting measurements for statistical pattern recognition a...
We live in a probabilistic world---a world full of distributions from which we sample. Learning, evo...
Much like in other modeling disciplines does the distance metric used (a measure for dissimilarity) ...
The optimal distance measure for a given discrimination task under the nearest neighbor framework ha...
A distance measure in the appropriate space of stochastic processes can be used to measure the quali...
A definition of distance measure between structural descriptions, which is based on a probabilistic ...
The use of distance measures in Statistics is of fundamental importance in solving practical problem...
Real-world data typically contain a large number of features that are often heterogeneous in nature,...
At their core, many time series data mining algorithms can be reduced to reasoning about the shapes ...
We prove results on the decidability and complexity of computing the total variation distance (equiv...
We propose that pseudometric, a subadditive distance measure, has sufficient properties to be a good...
Statistical distances allow us to quantify the closeness between two statistical objects. Many dista...
Several probabilistic distance, information, uncertainty, and overlap measures are proposed and cons...
Labelled Markov chains (LMCs) are widely used in probabilistic verification, speech recognition, com...
We prove results on the decidability and complexity of computing the total variation distance (equiv...
Two different aspects of the problem of selecting measurements for statistical pattern recognition a...
We live in a probabilistic world---a world full of distributions from which we sample. Learning, evo...
Much like in other modeling disciplines does the distance metric used (a measure for dissimilarity) ...
The optimal distance measure for a given discrimination task under the nearest neighbor framework ha...
A distance measure in the appropriate space of stochastic processes can be used to measure the quali...
A definition of distance measure between structural descriptions, which is based on a probabilistic ...
The use of distance measures in Statistics is of fundamental importance in solving practical problem...
Real-world data typically contain a large number of features that are often heterogeneous in nature,...
At their core, many time series data mining algorithms can be reduced to reasoning about the shapes ...
We prove results on the decidability and complexity of computing the total variation distance (equiv...
We propose that pseudometric, a subadditive distance measure, has sufficient properties to be a good...
Statistical distances allow us to quantify the closeness between two statistical objects. Many dista...
Several probabilistic distance, information, uncertainty, and overlap measures are proposed and cons...
Labelled Markov chains (LMCs) are widely used in probabilistic verification, speech recognition, com...