Abstract. This paper addresses lexical ambiguity with focus on a par-ticular problem known as accent prediction, in that given an accentless sequence, we need to restore correct accents. This can be modelled as a sequence classification problem for which variants of Markov chains can be applied. Although the state space is large (about the vocabulary size), it is highly constrained when conditioned on the data observation. We investigate the application of several methods, including Powered Product-of-N-grams, Structured Perceptron and Conditional Random Fields (CRFs). We empirically show in the Vietnamese case that these methods are fairly robust and efficient. The second-order CRFs achieve best results with about 94 % term accuracy
A central difficulty with automatic speech recognition is the temporally inaccurate nature of the sp...
Natural language processing is a useful processing technique of language data, such as text and spee...
This paper describes an unsupervised approach for natural language disambiguation, applicable to amb...
Note:This research is concerned with a Markov-model-based solution to the problem of lexical disambi...
Article dans revue scientifique avec comité de lecture.In natural language and especially in spontan...
(HSM) for statistical modeling of sequence data. The HSM generalizes our previous proposal on struct...
Lexical ambiguity resolution is a pervasive problem in natural language processing. An important exa...
International audienceSequence labeling is concerned with processing an input data sequence and prod...
This chapter presents a statistical decision procedure for lexical ambiguity resolution in text-to-s...
We propose a new discriminative framework, namely Hidden Dynamic Conditional Random Fields (HD-CRFs)...
In natural language, several sequences of words are very frequent. A classical language model, like ...
This paper presents a statistical decision procedure for lexical ambiguity resolution. The algorithm...
Some machine learning tasks have a complex output, rather than a real number or a class. Those outpu...
Automatic speech recognition has matured into a commercially successful technology, enabling voice-b...
Language identification is an important issue in many speech applica-tions. We address this problem ...
A central difficulty with automatic speech recognition is the temporally inaccurate nature of the sp...
Natural language processing is a useful processing technique of language data, such as text and spee...
This paper describes an unsupervised approach for natural language disambiguation, applicable to amb...
Note:This research is concerned with a Markov-model-based solution to the problem of lexical disambi...
Article dans revue scientifique avec comité de lecture.In natural language and especially in spontan...
(HSM) for statistical modeling of sequence data. The HSM generalizes our previous proposal on struct...
Lexical ambiguity resolution is a pervasive problem in natural language processing. An important exa...
International audienceSequence labeling is concerned with processing an input data sequence and prod...
This chapter presents a statistical decision procedure for lexical ambiguity resolution in text-to-s...
We propose a new discriminative framework, namely Hidden Dynamic Conditional Random Fields (HD-CRFs)...
In natural language, several sequences of words are very frequent. A classical language model, like ...
This paper presents a statistical decision procedure for lexical ambiguity resolution. The algorithm...
Some machine learning tasks have a complex output, rather than a real number or a class. Those outpu...
Automatic speech recognition has matured into a commercially successful technology, enabling voice-b...
Language identification is an important issue in many speech applica-tions. We address this problem ...
A central difficulty with automatic speech recognition is the temporally inaccurate nature of the sp...
Natural language processing is a useful processing technique of language data, such as text and spee...
This paper describes an unsupervised approach for natural language disambiguation, applicable to amb...