Simultaneous speech translation (SimulST) is a challenging task aiming to translate streaming speech before the complete input is observed. A SimulST system generally includes two components: the pre-decision that aggregates the speech information and the policy that decides to read or write. While recent works had proposed various strategies to improve the pre-decision, they mainly adopt the fixed wait-k policy, leaving the adaptive policies rarely explored. This paper proposes to model the adaptive policy by adapting the Continuous Integrate-and-Fire (CIF). Compared with monotonic multihead attention (MMA), our method has the advantage of simpler computation, superior quality at low latency, and better generalization to long utterances. W...
In simultaneous speech translation (SimulST), effective policies that determine when to write partia...
End-to-end formulation of automatic speech recognition (ASR) and speech translation (ST) makes it ea...
The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech ...
Simultaneous translation systems start producing the output while processing the partial source sent...
Simultaneous speech translation (SimulST) is the task in which output generation has to be performed...
End-to-end simultaneous speech translation (SimulST) outputs translation while receiving the streami...
Simultaneous machine translation systems rely on a policy to schedule read and write operations in o...
Speech-to-speech translation (S2ST) converts input speech to speech in another language. A challenge...
In simultaneous speech translation (SimulST), finding the best trade-off between high translation qu...
Simultaneous neural Machine Translation (SiMT) aims to maintain translation quality while minimizing...
In this paper, we describe our submission to the Simultaneous Speech Translation at IWSLT 2022. We e...
Speech translation is the task of translating speech in one language to text or speech in another la...
In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which ...
This paper proposes a token-level serialized output training (t-SOT), a novel framework for streamin...
Transformer models using segment-based processing have been an effective architecture for simultaneo...
In simultaneous speech translation (SimulST), effective policies that determine when to write partia...
End-to-end formulation of automatic speech recognition (ASR) and speech translation (ST) makes it ea...
The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech ...
Simultaneous translation systems start producing the output while processing the partial source sent...
Simultaneous speech translation (SimulST) is the task in which output generation has to be performed...
End-to-end simultaneous speech translation (SimulST) outputs translation while receiving the streami...
Simultaneous machine translation systems rely on a policy to schedule read and write operations in o...
Speech-to-speech translation (S2ST) converts input speech to speech in another language. A challenge...
In simultaneous speech translation (SimulST), finding the best trade-off between high translation qu...
Simultaneous neural Machine Translation (SiMT) aims to maintain translation quality while minimizing...
In this paper, we describe our submission to the Simultaneous Speech Translation at IWSLT 2022. We e...
Speech translation is the task of translating speech in one language to text or speech in another la...
In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which ...
This paper proposes a token-level serialized output training (t-SOT), a novel framework for streamin...
Transformer models using segment-based processing have been an effective architecture for simultaneo...
In simultaneous speech translation (SimulST), effective policies that determine when to write partia...
End-to-end formulation of automatic speech recognition (ASR) and speech translation (ST) makes it ea...
The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech ...