Text segmentation is a very critical step to many applications and while it has been addressed extensively for the English language, work on text segmentation for other languages is still lagging behind. In this paper the ArabicSeg system for segmenting Arabic news stories is presented. The developed system is based on a linguistic technique called lexical chaining which measures the cohesion between textual units. In conjunction with this technique, a set of error reduction filters have been introduced and were found to significantly reduce segmentation errors in the detection of borders in Arabic based news stories. The results of evaluation experiments carried out on an Arabic Reuters news story dataset are presented. An analysis of the ...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
Segmentation is considered as a core step for any recognition or classification method and for the t...
Text embedded in video sequences is very important to semantic indexing and content-based retrieval ...
Topic segmentation is essential for a lot of Natural Language Processing (NLP) applications, such as...
AbstractTopic segmentation is important for many natural language processing applications such as in...
This paper describes a rule-based approach to segment Arabic texts into clauses. Our method relies o...
The Arabic language has a very rich morphology where a word is composed of zero or more prefixes, a ...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...
The need of having a topic segmentation system for Arabic text is to improve the functionalities of ...
In this paper we compare the performance of three dis-tinct approaches to lexical cohesion based tex...
Recently, natural language processing tasks are more frequently conducted over online content. This ...
In this paper, we present results from a Broadcast News story segmentation system developed for the ...
This paper focuses on presenting a general methodology for acquiring and automatically segmenting br...
In this study, we present an automatic technique to help segment the Arabic texts while preserving t...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
Segmentation is considered as a core step for any recognition or classification method and for the t...
Text embedded in video sequences is very important to semantic indexing and content-based retrieval ...
Topic segmentation is essential for a lot of Natural Language Processing (NLP) applications, such as...
AbstractTopic segmentation is important for many natural language processing applications such as in...
This paper describes a rule-based approach to segment Arabic texts into clauses. Our method relies o...
The Arabic language has a very rich morphology where a word is composed of zero or more prefixes, a ...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...
Story segmentation of news broadcasts has been shown to improve the accuracy of the subsequent proce...
The need of having a topic segmentation system for Arabic text is to improve the functionalities of ...
In this paper we compare the performance of three dis-tinct approaches to lexical cohesion based tex...
Recently, natural language processing tasks are more frequently conducted over online content. This ...
In this paper, we present results from a Broadcast News story segmentation system developed for the ...
This paper focuses on presenting a general methodology for acquiring and automatically segmenting br...
In this study, we present an automatic technique to help segment the Arabic texts while preserving t...
In this paper we compare the performance of three distinct approaches to lexical cohesion based text...
Segmentation is considered as a core step for any recognition or classification method and for the t...
Text embedded in video sequences is very important to semantic indexing and content-based retrieval ...