In Spoken Language Understanding (SLU) the task is to extract important information from audio commands, like the intent of what a user wants the system to do and special entities like locations or numbers. This paper presents a simple method for embedding intents and entities into Finite State Transducers, and, in combination with a pretrained general-purpose Speech-to-Text model, allows building SLU-models without any additional training. Building those models is very fast and only takes a few seconds. It is also completely language independent. With a comparison on different benchmarks it is shown that this method can outperform multiple other, more resource demanding SLU approaches
International audienceWe introduce Generative Spoken Language Modeling, the task of learning the aco...
In the past two decades there have been several projects on Spoken Language Understanding (SLU). I...
International audienceEnd-to-end spoken language understanding (SLU) predicts intent directly from a...
In Spoken Language Understanding (SLU) the task is to extract important information from audio comma...
In speech recognition systems language model (LMs) are often constructed by training and combining m...
In speech recognition systems language model (LMs) are often constructed by training and combining m...
End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single mo...
We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We show that W...
Spoken Language Understanding (SLU) is a core task in most human-machine interaction systems . With ...
Transcribing speech in properly formatted written language presents some challenges for automatic sp...
International audienceSpoken Language Understanding (SLU) is a core task in most human-machine inter...
Telephone services are now deployed that allow users to react to tele-phone prompts in spoken natura...
This paper describes a new approach to language model adaptation for speech recognition based on the...
Spoken dialog systems are slowly becoming and integral part of the human experience due to their var...
Voice Assistants such as Alexa, Siri, and Google Assistant typically use a two-stage Spoken Language...
International audienceWe introduce Generative Spoken Language Modeling, the task of learning the aco...
In the past two decades there have been several projects on Spoken Language Understanding (SLU). I...
International audienceEnd-to-end spoken language understanding (SLU) predicts intent directly from a...
In Spoken Language Understanding (SLU) the task is to extract important information from audio comma...
In speech recognition systems language model (LMs) are often constructed by training and combining m...
In speech recognition systems language model (LMs) are often constructed by training and combining m...
End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single mo...
We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We show that W...
Spoken Language Understanding (SLU) is a core task in most human-machine interaction systems . With ...
Transcribing speech in properly formatted written language presents some challenges for automatic sp...
International audienceSpoken Language Understanding (SLU) is a core task in most human-machine inter...
Telephone services are now deployed that allow users to react to tele-phone prompts in spoken natura...
This paper describes a new approach to language model adaptation for speech recognition based on the...
Spoken dialog systems are slowly becoming and integral part of the human experience due to their var...
Voice Assistants such as Alexa, Siri, and Google Assistant typically use a two-stage Spoken Language...
International audienceWe introduce Generative Spoken Language Modeling, the task of learning the aco...
In the past two decades there have been several projects on Spoken Language Understanding (SLU). I...
International audienceEnd-to-end spoken language understanding (SLU) predicts intent directly from a...