In this paper, we address the lexicon design problem in Turkish large vocabulary speech recognition. Although we focus only on Turkish, the methods described here are general enough that they can be considered for other agglutinative languages like Finnish, Korean etc. In an agglutinative language, several words can be created from a single root word using a rich collection of morphological rules. So, a virtually infinite size lexicon is required to cover the language if words are used as the basic units. The standard approach to this problem is to discover a number of primitive units so that a large set of words can be created by compounding those units. Two broad classes of methods are available for splitting words into their sub-units; m...
This paper presents work on developing speech corpora and recognition tools for Turkish by porting S...
Title: Processing of Turkic Languages Author: Sibel Ciddi Department: Institute of Formal and Applie...
In this paper, we design and develop an intelligible and natural sounding corpus-based concatenative...
Turkish is an agglutinative language with rich morphology. A Turkish verb can have thousands of diff...
This chapter presents an overview of language modeling followed by a discussion of the challenges in...
To build automatic speech recognition (ASR) systems with a low word error rate (WER), a large speech...
Since Turkish is a morphologically productive language, it is almost impossible for a word-based rec...
It is practically impossible to build a word-based lexicon for speech recognition in agglutinative l...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
This study aims to build a new language model for Turkish continuous speech recognition. Turkish is ...
The primary aim of this study was to contribute to the development of multilingual automatic speech ...
Automatic speech recognition (ASR) is one of the most important applications of speech and language ...
Abstract—This paper proposes a novel approach to integrate the morphology of a language into an auto...
. In this paper we outline a lexical organization for Turkish that makes use of lexical rules for i...
Today, the vocabulary size for language models in large vocabulary speech recognition is typically s...
This paper presents work on developing speech corpora and recognition tools for Turkish by porting S...
Title: Processing of Turkic Languages Author: Sibel Ciddi Department: Institute of Formal and Applie...
In this paper, we design and develop an intelligible and natural sounding corpus-based concatenative...
Turkish is an agglutinative language with rich morphology. A Turkish verb can have thousands of diff...
This chapter presents an overview of language modeling followed by a discussion of the challenges in...
To build automatic speech recognition (ASR) systems with a low word error rate (WER), a large speech...
Since Turkish is a morphologically productive language, it is almost impossible for a word-based rec...
It is practically impossible to build a word-based lexicon for speech recognition in agglutinative l...
We study class-based n-gram and neural network language models for very large vocabulary speech reco...
This study aims to build a new language model for Turkish continuous speech recognition. Turkish is ...
The primary aim of this study was to contribute to the development of multilingual automatic speech ...
Automatic speech recognition (ASR) is one of the most important applications of speech and language ...
Abstract—This paper proposes a novel approach to integrate the morphology of a language into an auto...
. In this paper we outline a lexical organization for Turkish that makes use of lexical rules for i...
Today, the vocabulary size for language models in large vocabulary speech recognition is typically s...
This paper presents work on developing speech corpora and recognition tools for Turkish by porting S...
Title: Processing of Turkic Languages Author: Sibel Ciddi Department: Institute of Formal and Applie...
In this paper, we design and develop an intelligible and natural sounding corpus-based concatenative...