The goal of this thesis is to explore various strategies for incorporating contextual information into a segment-based speech recognition system, while maintaining com-putational costs at a level acceptable for implementation in a real-time system. The latter is achieved by using context-independent models in the search, while context-dependent models are reserved for re-scoring the hypotheses proposed by the context-independent system. Within this framework, several types of context-dependent sub-word units were evaluated, including word-dependent, biphone, and triphone units. In each case, deleted interpolation was used to compensate for the lack of training data for the mod-els. Other types of context-dependent modeling, such as context-...
This paper describes continuous speech recognition incorporating the additional complement informati...
Performance of any continuous speech recognition system is dependent on the accuracy of its acoustic...
Speech technology has developed to levels equivalent with human parity through the use of deep neura...
Recently, we have developed a probabilistic framework for segment-based speech recognition that repr...
The incorporation of grammatical information into speech recognition systems is often used to increa...
This thesis explores the use of efficient acoustic modeling techniques to improve the performance of...
In this paper, we present a new approach to visual speech recognition which improves contextual mode...
Speech is at the core of human communication. Speaking and listing comes so natural to us that we do...
Context-dependent models for language units are essential in high-accuracy speech recognition. Howev...
Although the importance of contextual information in speech recognition has been acknowledged for a ...
International audienceWe developed an environment for speech understanding, based on a sto...
In this paper we show how speech recognition can be enhance by combining context representation with...
Language models (LMs) are often constructed by building com-ponent models on multiple text sources t...
Available from British Library Document Supply Centre-DSC:DXN047663 / BLDSC - British Library Docume...
The use of prior situational/contextual knowledge about a given task can significantly improve auto...
This paper describes continuous speech recognition incorporating the additional complement informati...
Performance of any continuous speech recognition system is dependent on the accuracy of its acoustic...
Speech technology has developed to levels equivalent with human parity through the use of deep neura...
Recently, we have developed a probabilistic framework for segment-based speech recognition that repr...
The incorporation of grammatical information into speech recognition systems is often used to increa...
This thesis explores the use of efficient acoustic modeling techniques to improve the performance of...
In this paper, we present a new approach to visual speech recognition which improves contextual mode...
Speech is at the core of human communication. Speaking and listing comes so natural to us that we do...
Context-dependent models for language units are essential in high-accuracy speech recognition. Howev...
Although the importance of contextual information in speech recognition has been acknowledged for a ...
International audienceWe developed an environment for speech understanding, based on a sto...
In this paper we show how speech recognition can be enhance by combining context representation with...
Language models (LMs) are often constructed by building com-ponent models on multiple text sources t...
Available from British Library Document Supply Centre-DSC:DXN047663 / BLDSC - British Library Docume...
The use of prior situational/contextual knowledge about a given task can significantly improve auto...
This paper describes continuous speech recognition incorporating the additional complement informati...
Performance of any continuous speech recognition system is dependent on the accuracy of its acoustic...
Speech technology has developed to levels equivalent with human parity through the use of deep neura...