13 pagesMany pattern recognition algorithms are based on the nearest neighbour search and use the well known edit distance, for which the primitive edit costs are usually fixed in advance. In this article, we aim at learning an unbiased stochastic edit distance in the form of a finite-state transducer from a corpus of (input,output) pairs of strings. Contrary to the other standard methods, which generally use the Expectation Maximisation algorithm, our algorithm learns a transducer independently on the marginal probability distribution of the input strings. Such an unbiased way to proceed requires to optimise the parameters of a conditional transducer instead of a joint one. We apply our new model in the context of handwritten digit recogni...
38 pagesInternational audienceNowadays, there is a growing interest in machine learning and pattern ...
In a music recognition task, the classification of a new melody is often achieved by looking for the...
In a number of fields one is to compare a witness string with a distribution. One possibility is to ...
13 pagesMany pattern recognition algorithms are based on the nearest neighbour search and use the we...
pages 240-252International audienceMany real-world applications such as spell-checking or DNA analys...
In many applications, it is necessary to determine the similarity of two strings. A widely-used noti...
International audienceDuring the past few years, several works have been done to derive string kerne...
Abstract—In many applications, it is necessary to determine the similarity of two strings. A widely-...
String similarity is most often measured by weighted or unweighted edit distance d(x, y). Ristad and...
pages 42-53International audienceTrees provide a suited structural representation to deal with compl...
International audienceLearning the parameters of the edit distance has been increasingly studied dur...
In a number of fields, it is necessary to compare a witness string with a distribution. One possibil...
International audienceIn a music recognition task, the classification of a new melody is often achiev...
In computer science, a lot of applications use distances. In the context of structured data, strings...
38 pagesInternational audienceNowadays, there is a growing interest in machine learning and pattern ...
In a music recognition task, the classification of a new melody is often achieved by looking for the...
In a number of fields one is to compare a witness string with a distribution. One possibility is to ...
13 pagesMany pattern recognition algorithms are based on the nearest neighbour search and use the we...
pages 240-252International audienceMany real-world applications such as spell-checking or DNA analys...
In many applications, it is necessary to determine the similarity of two strings. A widely-used noti...
International audienceDuring the past few years, several works have been done to derive string kerne...
Abstract—In many applications, it is necessary to determine the similarity of two strings. A widely-...
String similarity is most often measured by weighted or unweighted edit distance d(x, y). Ristad and...
pages 42-53International audienceTrees provide a suited structural representation to deal with compl...
International audienceLearning the parameters of the edit distance has been increasingly studied dur...
In a number of fields, it is necessary to compare a witness string with a distribution. One possibil...
International audienceIn a music recognition task, the classification of a new melody is often achiev...
In computer science, a lot of applications use distances. In the context of structured data, strings...
38 pagesInternational audienceNowadays, there is a growing interest in machine learning and pattern ...
In a music recognition task, the classification of a new melody is often achieved by looking for the...
In a number of fields one is to compare a witness string with a distribution. One possibility is to ...