Segmentation methods are an essential part of the simultaneous machine translation process because, in the ideal case, they split the input into chunks whose translation is independent of any forthcoming context. Furthermore, the optimal splitting should also ensure that the segments with the previous characterization have minimal lengths. However, there is still no agreement about the rules that should produce such an optimal splitting. Therefore, we started with the annotation of the ESIC dataset by simulating a perfect human interpreter with an infinite amount of time and resources. Then we proposed multiple segmentation methods that we compared to each other in terms of segments' lengths, counts, and statistics of the most frequently sp...
The authors of (Cho et al., 2014a) have shown that the recently introduced neural network translatio...
International audienceIn this article we deal with the text segmentation problem in statistical lang...
Simultaneous neural Machine Translation (SiMT) aims to maintain translation quality while minimizing...
Segmentation of the incoming speech stream and translating segments incrementally is a commonly used...
End-to-end simultaneous speech translation (SimulST) outputs translation while receiving the streami...
We propose a method to split and translate input sentences for speech translation in order to overco...
Recent studies on direct speech translation show continuous improvements by means of data augmentati...
Speech segmentation is the problem of finding the end points of a speech utterance for passing to an...
We tackle the previously unaddressed problem of unsupervised determination of the optimal morphologi...
International audienceBoosted by the simultaneous translation shared task at IWSLT 2020, promising e...
Machine translation of spoken language is a challenging task that involves several natural language ...
This paper presents a novel automatic sentence segmentation method for evaluating machine translatio...
This paper proposes an input-splitting method for translating spoken-language which includes many lo...
Speech segmentation, which splits long speech into short segments, is essential for speech translati...
Direct speech-to-text translation (ST) models are usually trained on corpora segmented at sentence l...
The authors of (Cho et al., 2014a) have shown that the recently introduced neural network translatio...
International audienceIn this article we deal with the text segmentation problem in statistical lang...
Simultaneous neural Machine Translation (SiMT) aims to maintain translation quality while minimizing...
Segmentation of the incoming speech stream and translating segments incrementally is a commonly used...
End-to-end simultaneous speech translation (SimulST) outputs translation while receiving the streami...
We propose a method to split and translate input sentences for speech translation in order to overco...
Recent studies on direct speech translation show continuous improvements by means of data augmentati...
Speech segmentation is the problem of finding the end points of a speech utterance for passing to an...
We tackle the previously unaddressed problem of unsupervised determination of the optimal morphologi...
International audienceBoosted by the simultaneous translation shared task at IWSLT 2020, promising e...
Machine translation of spoken language is a challenging task that involves several natural language ...
This paper presents a novel automatic sentence segmentation method for evaluating machine translatio...
This paper proposes an input-splitting method for translating spoken-language which includes many lo...
Speech segmentation, which splits long speech into short segments, is essential for speech translati...
Direct speech-to-text translation (ST) models are usually trained on corpora segmented at sentence l...
The authors of (Cho et al., 2014a) have shown that the recently introduced neural network translatio...
International audienceIn this article we deal with the text segmentation problem in statistical lang...
Simultaneous neural Machine Translation (SiMT) aims to maintain translation quality while minimizing...