Subtitles, in order to achieve their purpose of transmitting information, need to be easily readable. The segmentation of subtitles into phrases or linguistic units is key to their readability and comprehension. However, automatically segmenting a sentence into subtitles is a challenging task and data containing reliable human segmentation decisions are often scarce. In this paper, we leverage data with noisy segmentation from large subtitle corpora and combine them with smaller amounts of high-quality data in order to train models which perform automatic segmentation of a sentence into subtitles. We show that even a minimum amount of reliable data can lead to readable subtitles and that quality is more important than quantity for the task ...
This study aims to shed light on the possible effects of line-breaks on film subtitle readability an...
Proceedings of the 17th Nordic Conference of Computational Linguistics NODALIDA 2009. Editors: Kri...
Quality is a rather slippery concept, and its assessment in subtitling can be a challenging task, as...
Subtitles, in order to achieve their purpose of transmitting information, need to be easily readable...
Growing needs in localising multimedia content for global audiences have resulted in Neural Machine ...
Understanding the way people watch subtitled films has become a central concern for subtitling resea...
Speech translation for subtitling (SubST) is the task of automatically translating speech data into ...
This paper describes the evaluation methodology followed to measure the impact of using a machine le...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
Subtitles appear on screen as short pieces of text, segmented based on formal constraints (length) a...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
Growing needs in localising audiovisual content in multiple languages through subtitles call for the...
[EN] Automatic segmentation of subtitles is a novel research field which has not been studied extens...
Growing needs in translating multimedia content have resulted in Neural Machine Translation (NMT) ...
Psycho-linguistic literature shows that reading is so highly structured and formalized as to make it...
This study aims to shed light on the possible effects of line-breaks on film subtitle readability an...
Proceedings of the 17th Nordic Conference of Computational Linguistics NODALIDA 2009. Editors: Kri...
Quality is a rather slippery concept, and its assessment in subtitling can be a challenging task, as...
Subtitles, in order to achieve their purpose of transmitting information, need to be easily readable...
Growing needs in localising multimedia content for global audiences have resulted in Neural Machine ...
Understanding the way people watch subtitled films has become a central concern for subtitling resea...
Speech translation for subtitling (SubST) is the task of automatically translating speech data into ...
This paper describes the evaluation methodology followed to measure the impact of using a machine le...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
Subtitles appear on screen as short pieces of text, segmented based on formal constraints (length) a...
There is a discrepancy between professional subtitling guidelines and how they are implemented in re...
Growing needs in localising audiovisual content in multiple languages through subtitles call for the...
[EN] Automatic segmentation of subtitles is a novel research field which has not been studied extens...
Growing needs in translating multimedia content have resulted in Neural Machine Translation (NMT) ...
Psycho-linguistic literature shows that reading is so highly structured and formalized as to make it...
This study aims to shed light on the possible effects of line-breaks on film subtitle readability an...
Proceedings of the 17th Nordic Conference of Computational Linguistics NODALIDA 2009. Editors: Kri...
Quality is a rather slippery concept, and its assessment in subtitling can be a challenging task, as...