problem in a weighted automaton. This problem arises commonly in speech recognition applications when a ranked list of unique recognizer hypotheses is desired. We believe this is the first n-best algorithm to remove redundant hypotheses before rather than after the n-best determination. We give a detailed description of the algorithm and demonstrate its correctness. We report experimental results showing its efficiency and practicality even for large n in a 40; 000-word vocabulary North American Business News (NAB) task. In particular, we show that 1000-best generation in this task requires negligible added time over recognizer lattice generation
AbstractWe present a general algorithm, pre-determinization, that makes an arbitrary weighted transd...
. We consider a data mining problem in a large collection of unstructured texts based on association...
International audienceWe study the use of morphosyntactic knowledge to process N-best lists. We prop...
We show that a previously proposed algorithm for the N-best trees problem can be made more efficient...
Abstract. We investigate the problem of extracting the k best strings from a non-deterministic weigh...
In large vocabulary continuous speech recognition, high level linguistic knowledge can enhance perfo...
We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We show that W...
We propose a simple yet effective method for improving speech recognition by reranking the N-best sp...
N-best or lattice-based tokenization has been widely used in speech-related classification tasks. In...
International audienceWe present a new string matching algorithm optimal on average (with equiprobab...
The problem of finding the consensus (most probable string) for a distribution generated by a weight...
A repetition in a string of letters consists of exact concatenations of identical factors of the str...
Paper submited to IWPT 2011, 12th International Conference on Parsing Technologies, Dublin, Ireland,...
We survey the use of weighted nitestate transducers WFSTs in speech recognition We show that WFSTs...
In speech understanding systems, the interface between acoustic and linguistic modules is often repr...
AbstractWe present a general algorithm, pre-determinization, that makes an arbitrary weighted transd...
. We consider a data mining problem in a large collection of unstructured texts based on association...
International audienceWe study the use of morphosyntactic knowledge to process N-best lists. We prop...
We show that a previously proposed algorithm for the N-best trees problem can be made more efficient...
Abstract. We investigate the problem of extracting the k best strings from a non-deterministic weigh...
In large vocabulary continuous speech recognition, high level linguistic knowledge can enhance perfo...
We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We show that W...
We propose a simple yet effective method for improving speech recognition by reranking the N-best sp...
N-best or lattice-based tokenization has been widely used in speech-related classification tasks. In...
International audienceWe present a new string matching algorithm optimal on average (with equiprobab...
The problem of finding the consensus (most probable string) for a distribution generated by a weight...
A repetition in a string of letters consists of exact concatenations of identical factors of the str...
Paper submited to IWPT 2011, 12th International Conference on Parsing Technologies, Dublin, Ireland,...
We survey the use of weighted nitestate transducers WFSTs in speech recognition We show that WFSTs...
In speech understanding systems, the interface between acoustic and linguistic modules is often repr...
AbstractWe present a general algorithm, pre-determinization, that makes an arbitrary weighted transd...
. We consider a data mining problem in a large collection of unstructured texts based on association...
International audienceWe study the use of morphosyntactic knowledge to process N-best lists. We prop...