[EN] Automatic segmentation of subtitles is a novel research field which has not been studied extensively to date. However, quality automatic subtitling is a real need for broadcasters which seek for automatic solutions given the demanding European audiovisual legislation. In this article, a method based on Conditional Random Field is presented to deal with the automatic subtitling segmentation. This is a continuation of a previous work in the field, which proposed a method based on Support Vector Machine classifier to generate possible candidates for breaks. For this study, two corpora in Basque and Spanish were used for experiments, and the performance of the current method was tested and compared with the previous solution and two rule-b...
Segmentation is widely used in adapting Statistical Machine Translation to highly inflected language...
In this paper we describe an improved algorithm for the automatic segmentation of speech corpora. Ap...
International audienceThis paper studies the generation of intralingual closed captions from automat...
[EN] Automatic segmentation of subtitles is a novel research field which has not been studied extens...
This paper describes the evaluation methodology followed to measure the impact of using a machine le...
Subtitles, in order to achieve their purpose of transmitting information, need to be easily readable...
Subtitles appear on screen as short pieces of text, segmented based on formal constraints (length) a...
Speech translation for subtitling (SubST) is the task of automatically translating speech data into ...
Movie and TV subtitles contain large amounts of conversational material, but lack an explicit turn s...
The subject of the thesis is automatic sentence compression with machine learning, so that the compr...
In this paper, we present some recent improvements in our automatic speech segmentation system, whic...
Recent developments in machine translation and speech translation are opening up opportunities for c...
Growing needs in localising multimedia content for global audiences have resulted in Neural Machine ...
The linguistic segmentation has been the interest of researchers in the field of deaf and hardof-hea...
Growing needs in localising audiovisual content in multiple languages through subtitles call for the...
Segmentation is widely used in adapting Statistical Machine Translation to highly inflected language...
In this paper we describe an improved algorithm for the automatic segmentation of speech corpora. Ap...
International audienceThis paper studies the generation of intralingual closed captions from automat...
[EN] Automatic segmentation of subtitles is a novel research field which has not been studied extens...
This paper describes the evaluation methodology followed to measure the impact of using a machine le...
Subtitles, in order to achieve their purpose of transmitting information, need to be easily readable...
Subtitles appear on screen as short pieces of text, segmented based on formal constraints (length) a...
Speech translation for subtitling (SubST) is the task of automatically translating speech data into ...
Movie and TV subtitles contain large amounts of conversational material, but lack an explicit turn s...
The subject of the thesis is automatic sentence compression with machine learning, so that the compr...
In this paper, we present some recent improvements in our automatic speech segmentation system, whic...
Recent developments in machine translation and speech translation are opening up opportunities for c...
Growing needs in localising multimedia content for global audiences have resulted in Neural Machine ...
The linguistic segmentation has been the interest of researchers in the field of deaf and hardof-hea...
Growing needs in localising audiovisual content in multiple languages through subtitles call for the...
Segmentation is widely used in adapting Statistical Machine Translation to highly inflected language...
In this paper we describe an improved algorithm for the automatic segmentation of speech corpora. Ap...
International audienceThis paper studies the generation of intralingual closed captions from automat...