Recent research on the TIMIT corpus suggests that longerlength acoustic units are better suited for modelling coarticulation and long-term temporal dependencies in speech than conventional context-dependent phone models. However, the impressive results achieved on TIMIT [1] are yet to be reproduced on other corpora, such as read speech from the Spoken Dutch Corpus. Differences between TIMIT and the Spoken Dutch Corpus data are analysed in an attempt to better understand in which conditions the use of longer-length units can be expected to result in considerable improvements in recognition accuracy. We conclude that at least part of the improvements found with TIMIT can be explained by details of the experimental procedure, and that longer-l...
A series of experiments was conducted to determine (1) the accuracy with which vowel segment duratio...
Studies from multiple disciplines show that spectro-temporal units of natural languages and human sp...
Automatic speech recognition (ASR) does not perform equally well on every speaker. There is bias aga...
Recent research on the TIMIT corpus suggests that longer-length acoustic units are better suited for...
Recent research on the TIMIT corpus suggests that longer-length acoustic models are more appropriat...
Recent research on the TIMIT database suggests that longerlength acoustic units are better suited fo...
Contains fulltext : 42083.pdf (author's version ) (Open Access)SPECOM 2005, 17 okt...
Inter-speaker variability, one of the problems faced in speech recognition system, has caused the pe...
Recent research suggests that modeling coarticulation in speech is more appropriate at the syllable ...
In general the aim of an automatic speech recognition system is to write down what is said. State of...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
Including information distributed over intervals of syllabic duration (100--250 ms) may greatly impr...
Abstract. Inter-speaker variability, one of the problems faced in speech recognition system, has cau...
Transforming an acoustic signal to words is the gold standard in automatic speech recognition. Whil...
Recent research suggests that it is more appropriate to model pronunciation variation with syllable-...
A series of experiments was conducted to determine (1) the accuracy with which vowel segment duratio...
Studies from multiple disciplines show that spectro-temporal units of natural languages and human sp...
Automatic speech recognition (ASR) does not perform equally well on every speaker. There is bias aga...
Recent research on the TIMIT corpus suggests that longer-length acoustic units are better suited for...
Recent research on the TIMIT corpus suggests that longer-length acoustic models are more appropriat...
Recent research on the TIMIT database suggests that longerlength acoustic units are better suited fo...
Contains fulltext : 42083.pdf (author's version ) (Open Access)SPECOM 2005, 17 okt...
Inter-speaker variability, one of the problems faced in speech recognition system, has caused the pe...
Recent research suggests that modeling coarticulation in speech is more appropriate at the syllable ...
In general the aim of an automatic speech recognition system is to write down what is said. State of...
Generally speaking, the speaker-dependence of a speech recognition system stems from speaker-depende...
Including information distributed over intervals of syllabic duration (100--250 ms) may greatly impr...
Abstract. Inter-speaker variability, one of the problems faced in speech recognition system, has cau...
Transforming an acoustic signal to words is the gold standard in automatic speech recognition. Whil...
Recent research suggests that it is more appropriate to model pronunciation variation with syllable-...
A series of experiments was conducted to determine (1) the accuracy with which vowel segment duratio...
Studies from multiple disciplines show that spectro-temporal units of natural languages and human sp...
Automatic speech recognition (ASR) does not perform equally well on every speaker. There is bias aga...