This paper focuses on presenting a general methodology for acquiring and automatically segmenting broadcast news data from the web. It was shown that it is possible starting from a relatively small corpus of about 10 hours to segment automatically about 30 hours of data. This step is important because manual segmentation of broadcast news data is generally very tedious and time consuming. In addition to the data collection proposal we show the development of an initial recognition system. We present an automatic procedure for creating vowelizations for Arabic words. This is again important because most available Arabic transcriptions lack vowelization, which is crucial for creating phonetic transcription. The performance of our system is in...
In this paper we present a recipe and language resources for training and testing Arabic speech reco...
In this paper, we present results from a Broadcast News story segmentation system developed for the ...
Arabic is the most widely spoken language in the Arab World. Most people of the Islamic World unders...
This paper describes the collection and transcription of large amounts of Arabic broadcast news spee...
This paper describes the collect and transcription of a large set of Arabic broadcast news speech da...
In this paper, we show the progress for Arabic speech recognition by incorporating contextual inform...
Treebanking a large corpus of relatively structured speech transcribed from various Arabic Broadcast...
This paper presents our recent effort that aims at improving our Arabic broadcast news (BN) recognit...
This paper presents the results and conclusions of a study on speech segmentation system for Arabic ...
This paper reports the results of the first phase of a research work for building a high performance...
Recently, promising results have been reported on video text detection and recognition. Most of the ...
Language Engineering, including Information Retrieval, Machine Translation and other Natural Languag...
Text segmentation is a very critical step to many applications and while it has been addressed exten...
This dataset is a relatively great size collection of Arabic news tweets that were collected from an...
The dataset is a collection of Arabic texts, which covers modern Arabic language used in newspapers ...
In this paper we present a recipe and language resources for training and testing Arabic speech reco...
In this paper, we present results from a Broadcast News story segmentation system developed for the ...
Arabic is the most widely spoken language in the Arab World. Most people of the Islamic World unders...
This paper describes the collection and transcription of large amounts of Arabic broadcast news spee...
This paper describes the collect and transcription of a large set of Arabic broadcast news speech da...
In this paper, we show the progress for Arabic speech recognition by incorporating contextual inform...
Treebanking a large corpus of relatively structured speech transcribed from various Arabic Broadcast...
This paper presents our recent effort that aims at improving our Arabic broadcast news (BN) recognit...
This paper presents the results and conclusions of a study on speech segmentation system for Arabic ...
This paper reports the results of the first phase of a research work for building a high performance...
Recently, promising results have been reported on video text detection and recognition. Most of the ...
Language Engineering, including Information Retrieval, Machine Translation and other Natural Languag...
Text segmentation is a very critical step to many applications and while it has been addressed exten...
This dataset is a relatively great size collection of Arabic news tweets that were collected from an...
The dataset is a collection of Arabic texts, which covers modern Arabic language used in newspapers ...
In this paper we present a recipe and language resources for training and testing Arabic speech reco...
In this paper, we present results from a Broadcast News story segmentation system developed for the ...
Arabic is the most widely spoken language in the Arab World. Most people of the Islamic World unders...