The primary goal of this FBK’s systems submission to the IWSLT 2022 offline and simultaneous speech translation tasks is to reduce model training costs without sacrificing translation quality. As such, we first question the need of ASR pre-training, showing that it is not essential to achieve competitive results. Second, we focus on data filtering, showing that a simple method that looks at the ratio between source and target characters yields a quality improvement of 1 BLEU. Third, we compare different methods to reduce the detrimental effect of the audio segmentation mismatch between training data manually segmented at sentence level and inference data that is automatically segmented. Towards the same goal of training cost reduction, we p...
Paper accepted to IWSLT 2021This paper describes the submission to the IWSLT 2021 offline speech tra...
Data augmentation is a technique to generate new training data based on existing data. We evaluate t...
International audienceThis paper describes the ON-TRAC Consortium translation systems developed for ...
The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech ...
This paper describes the submission to the IWSLT 2021 offline speech translation task by the UPC Mac...
This paper describes FBK’s participation in the IWSLT 2020 offline speech translation (ST) task. The...
This paper describes FBK’s submission to the end-to-end English-German speech translation task at IW...
When building state-of-the-art speech translation models, the need for large computational resources...
For automatic speech translation (AST), end-to-end approaches are outperformed by cascaded models th...
International audienceSelf-supervised learning from raw speech has been proven beneficial to improve...
International audienceThis paper describes the ON-TRAC Consortium translation systems developed for ...
In this paper, we describe our submission to the Simultaneous Speech Translation at IWSLT 2022. We e...
This paper describes FBK’s submission to the end-to-end speech translation (ST) task at IWSLT 2019. ...
This paper describes the University of Helsinki Language Technology group’s participation in the IWS...
In simultaneous speech translation (SimulST), finding the best trade-off between high translation qu...
Paper accepted to IWSLT 2021This paper describes the submission to the IWSLT 2021 offline speech tra...
Data augmentation is a technique to generate new training data based on existing data. We evaluate t...
International audienceThis paper describes the ON-TRAC Consortium translation systems developed for ...
The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech ...
This paper describes the submission to the IWSLT 2021 offline speech translation task by the UPC Mac...
This paper describes FBK’s participation in the IWSLT 2020 offline speech translation (ST) task. The...
This paper describes FBK’s submission to the end-to-end English-German speech translation task at IW...
When building state-of-the-art speech translation models, the need for large computational resources...
For automatic speech translation (AST), end-to-end approaches are outperformed by cascaded models th...
International audienceSelf-supervised learning from raw speech has been proven beneficial to improve...
International audienceThis paper describes the ON-TRAC Consortium translation systems developed for ...
In this paper, we describe our submission to the Simultaneous Speech Translation at IWSLT 2022. We e...
This paper describes FBK’s submission to the end-to-end speech translation (ST) task at IWSLT 2019. ...
This paper describes the University of Helsinki Language Technology group’s participation in the IWS...
In simultaneous speech translation (SimulST), finding the best trade-off between high translation qu...
Paper accepted to IWSLT 2021This paper describes the submission to the IWSLT 2021 offline speech tra...
Data augmentation is a technique to generate new training data based on existing data. We evaluate t...
International audienceThis paper describes the ON-TRAC Consortium translation systems developed for ...