Subtitles, in order to achieve their purpose of transmitting information, need to be easily readable. The segmentation of subtitles into phrases or linguistic units is key to their readability and comprehension. However, automatically segmenting a sentence into subtitles is a challenging task and data containing reliable human segmentation decisions are often scarce. In this paper, we leverage data with noisy segmentation from large subtitle corpora and combine them with smaller amounts of high-quality data in order to train models which perform automatic segmentation of a sentence into subtitles. We show that even a minimum amount of reliable data can lead to readable subtitles and that quality is more important than quantity for the task...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
Speech translation (ST) has lately received growing interest for the generation of subtitles without...
Recently, with the development of Speech to Text, which converts voice to text, and machine translat...
Subtitles, in order to achieve their purpose of transmitting information, need to be easily readable...
Subtitles appear on screen as short pieces of text, segmented based on formal constraints (length) a...
Speech translation for subtitling (SubST) is the task of automatically translating speech data into ...
Growing needs in localising audiovisual content in multiple languages through subtitles call for the...
Understanding the way people watch subtitled films has become a central concern for subtitling resea...
[EN] Automatic segmentation of subtitles is a novel research field which has not been studied extens...
This paper describes the evaluation methodology followed to measure the impact of using a machine le...
Growing needs in translating multimedia content have resulted in Neural Machine Translation (NMT) ...
Psycho-linguistic literature shows that reading is so highly structured and formalized as to make it...
This study aims to shed light on the possible effects of line-breaks on film subtitle readability an...
Growing needs in localising multimedia content for global audiences have resulted in Neural Machine ...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
Speech translation (ST) has lately received growing interest for the generation of subtitles without...
Recently, with the development of Speech to Text, which converts voice to text, and machine translat...
Subtitles, in order to achieve their purpose of transmitting information, need to be easily readable...
Subtitles appear on screen as short pieces of text, segmented based on formal constraints (length) a...
Speech translation for subtitling (SubST) is the task of automatically translating speech data into ...
Growing needs in localising audiovisual content in multiple languages through subtitles call for the...
Understanding the way people watch subtitled films has become a central concern for subtitling resea...
[EN] Automatic segmentation of subtitles is a novel research field which has not been studied extens...
This paper describes the evaluation methodology followed to measure the impact of using a machine le...
Growing needs in translating multimedia content have resulted in Neural Machine Translation (NMT) ...
Psycho-linguistic literature shows that reading is so highly structured and formalized as to make it...
This study aims to shed light on the possible effects of line-breaks on film subtitle readability an...
Growing needs in localising multimedia content for global audiences have resulted in Neural Machine ...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
Speech translation (ST) has lately received growing interest for the generation of subtitles without...
Recently, with the development of Speech to Text, which converts voice to text, and machine translat...