This paper addresses the impact of telephone transmission channels on automatic speech recognition (ASR) performance. A real-time simulation model is described and implemented, which allows impairments that are encountered in traditional as well as modern (mobile, IP-based) networks to be flexibly and efficiently generated. The model is based on input parameters which are known to telephone network planners; thus, it can be applied without measuring specific network characteristics. It can be used for an analytic assessment of the impact of channel impairments on ASR performance, for producing training material with defined transmission characteristics, or for testing spoken dialogue systems in realistic network environments. In the present...
Purpose: Automatic speech recognition (ASR) is commonly used to produce telephone captions to provid...
Modern telecommunication networks increasingly comprise trunks of packet-based transmission (e.g. Vo...
High quality automatic speech recognition (ASR) depends on the context of the speech. Cleanly record...
This paper addresses the transmission channel impact on human-to-human speech communication quality ...
Mobile communication presents a number of challenges to speech technology such as the limited resour...
In automatic speech recognition systems (ASRs), training is a critical phase to the system?s success...
This paper presents an experimental study on the impact of telephone channels on the accuracy of aut...
A tool for simulating the acoustic conditions during the speech input to a recognition system and th...
The paper presents the problem of signal degradation in packet-based voice transmission and its infl...
A tool for simulating the acoustic conditions during the speech input to a recognition system and th...
This paper describes the specification, design and development phases of two widely used telepho...
This paper describes the specification, design and development phases of two widely used telepho...
This paper describes the specification, design and development phases of two widely used telepho...
This paper describes the specification, design and development phases of two widely used telepho...
High quality automatic speech recognition (ASR) depends on the context of the speech. For example, c...
Purpose: Automatic speech recognition (ASR) is commonly used to produce telephone captions to provid...
Modern telecommunication networks increasingly comprise trunks of packet-based transmission (e.g. Vo...
High quality automatic speech recognition (ASR) depends on the context of the speech. Cleanly record...
This paper addresses the transmission channel impact on human-to-human speech communication quality ...
Mobile communication presents a number of challenges to speech technology such as the limited resour...
In automatic speech recognition systems (ASRs), training is a critical phase to the system?s success...
This paper presents an experimental study on the impact of telephone channels on the accuracy of aut...
A tool for simulating the acoustic conditions during the speech input to a recognition system and th...
The paper presents the problem of signal degradation in packet-based voice transmission and its infl...
A tool for simulating the acoustic conditions during the speech input to a recognition system and th...
This paper describes the specification, design and development phases of two widely used telepho...
This paper describes the specification, design and development phases of two widely used telepho...
This paper describes the specification, design and development phases of two widely used telepho...
This paper describes the specification, design and development phases of two widely used telepho...
High quality automatic speech recognition (ASR) depends on the context of the speech. For example, c...
Purpose: Automatic speech recognition (ASR) is commonly used to produce telephone captions to provid...
Modern telecommunication networks increasingly comprise trunks of packet-based transmission (e.g. Vo...
High quality automatic speech recognition (ASR) depends on the context of the speech. Cleanly record...