Over the past few years, self-supervised learned speech representations have emerged as fruitful replacements for conventional surface representations when solving Spoken Language Understanding (SLU) tasks. Simultaneously, multilingual models trained on massive textual data were introduced to encode language agnostic semantics. Recently, the SAMU-XLSR approach introduced a way to make profit from such textual models to enrich multilingual speech representations with language agnostic semantics. By aiming for better semantic extraction on a challenging Spoken Language Understanding task and in consideration with computation costs, this study investigates a specific in-domain semantic enrichment of the SAMU-XLSR model by specializing it on a ...
End-to-end formulation of automatic speech recognition (ASR) and speech translation (ST) makes it ea...
End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single mo...
International audienceThis work investigates speaker adaptation and transfer learning for spoken lan...
Spoken language understanding (SLU) topic has seen a lot of progress these last three years, with th...
We propose the SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Repre...
Self-Supervised Learning (SSL) using huge unlabeled data has been successfully explored for image an...
In the past two decades there have been several projects on Spoken Language Understanding (SLU). I...
Self-supervised learning (SSL) is at the origin of unprecedented improvements in many different doma...
International audienceSpoken language understanding (SLU) topic has seen a lot of progress these las...
International audienceSpoken language understanding (SLU) topic has seen a lot of progress these las...
International audienceSpoken language understanding (SLU) topic has seen a lot of progress these las...
International audienceSpoken language understanding (SLU) topic has seen a lot of progress these las...
Self-supervised representation learning (SSRL) has improved the performance on downstream phoneme re...
We introduce the Universal Speech Model (USM), a single large model that performs automatic speech r...
International audienceThis work deals with spoken language understanding (SLU) systems in the scenar...
End-to-end formulation of automatic speech recognition (ASR) and speech translation (ST) makes it ea...
End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single mo...
International audienceThis work investigates speaker adaptation and transfer learning for spoken lan...
Spoken language understanding (SLU) topic has seen a lot of progress these last three years, with th...
We propose the SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Repre...
Self-Supervised Learning (SSL) using huge unlabeled data has been successfully explored for image an...
In the past two decades there have been several projects on Spoken Language Understanding (SLU). I...
Self-supervised learning (SSL) is at the origin of unprecedented improvements in many different doma...
International audienceSpoken language understanding (SLU) topic has seen a lot of progress these las...
International audienceSpoken language understanding (SLU) topic has seen a lot of progress these las...
International audienceSpoken language understanding (SLU) topic has seen a lot of progress these las...
International audienceSpoken language understanding (SLU) topic has seen a lot of progress these las...
Self-supervised representation learning (SSRL) has improved the performance on downstream phoneme re...
We introduce the Universal Speech Model (USM), a single large model that performs automatic speech r...
International audienceThis work deals with spoken language understanding (SLU) systems in the scenar...
End-to-end formulation of automatic speech recognition (ASR) and speech translation (ST) makes it ea...
End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single mo...
International audienceThis work investigates speaker adaptation and transfer learning for spoken lan...