In this paper we present a new algorithm for text segmentation based on deep sentence encoders and the TextTiling algorithm. We will describe how text segmentation is an essential first step in the re-purposing of media content like TV newscasts and how the proposed methodology can add value to other subsequent tasks involving such media products thanks to the features extracted for segmentation. We present experiments on Wikipedia and transcripts from CNN 10 news show and the results of the proposed algorithm will be compared to other approaches. Our method shows improvement over other unsupervised methods and it gives results that are competitive with supervised approaches without the need for any training data. Finally, we will give exam...
International audienceExtractive summarization consists of generating a summary by ranking sentences...
In this paper, we present a framework for segmenting the news programs into different story topics. ...
In this paper, we introduce and evaluate two novel approaches, one using video stream and the other ...
In this paper we present a new algorithm for text segmentation based on deep sentence encoders and t...
In this paper we propose a course-grained NLP approach to text segmentation based on the\ud analysis...
Text segmentation is a traditional task in NLP where a document is broken down into smaller, coheren...
Automatic text segmentation, which is the task of breaking a text into topically-consistent segments...
Current research devoted to the Natural Language Processing problem of sentence segmentation from ra...
Neural sentence encoders (NSE) are effective in many NLP tasks, including topic segmentation. Howeve...
Traditional sentence representations such as bag-of-words (BOW) and term frequency-inverse document ...
Abstract. We present divSeg, a novel method for text segmentation that iteratively splits a portion ...
This paper proposes a transformer over transformer framework, called Transformer2, to perform neural...
17 pages, 2 columnsInternational audienceThe gradual migration of television from broadcast diffusio...
This paper introduces a new statistical approach to partitioning text automatically into coherent se...
In the Natural Language Understanding field, one of the important tasks is topic detection. Given th...
International audienceExtractive summarization consists of generating a summary by ranking sentences...
In this paper, we present a framework for segmenting the news programs into different story topics. ...
In this paper, we introduce and evaluate two novel approaches, one using video stream and the other ...
In this paper we present a new algorithm for text segmentation based on deep sentence encoders and t...
In this paper we propose a course-grained NLP approach to text segmentation based on the\ud analysis...
Text segmentation is a traditional task in NLP where a document is broken down into smaller, coheren...
Automatic text segmentation, which is the task of breaking a text into topically-consistent segments...
Current research devoted to the Natural Language Processing problem of sentence segmentation from ra...
Neural sentence encoders (NSE) are effective in many NLP tasks, including topic segmentation. Howeve...
Traditional sentence representations such as bag-of-words (BOW) and term frequency-inverse document ...
Abstract. We present divSeg, a novel method for text segmentation that iteratively splits a portion ...
This paper proposes a transformer over transformer framework, called Transformer2, to perform neural...
17 pages, 2 columnsInternational audienceThe gradual migration of television from broadcast diffusio...
This paper introduces a new statistical approach to partitioning text automatically into coherent se...
In the Natural Language Understanding field, one of the important tasks is topic detection. Given th...
International audienceExtractive summarization consists of generating a summary by ranking sentences...
In this paper, we present a framework for segmenting the news programs into different story topics. ...
In this paper, we introduce and evaluate two novel approaches, one using video stream and the other ...