AbstractLanguage identification is widely used in machine learning, text mining, information retrieval, and speech processing. Available techniques for solving the problem of language identification do require large amount of training text that are not available for under-resourced languages which form the bulk of the World’s languages. The primary objective of this study is to propose a lexicon based algorithm which is able to perform language identification using minimal training data. Because language identification is often the first step in many natural language processing tasks, it is necessary to explore techniques that will perform language identification in the shortest possible time. Hence, the second objective of this research is...
We present a statistical approach to text-based automatic language identification that focuses on di...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
Recently there has been interest in the approaches for training speech recognition systems for langu...
Language identification is widely used in machine learning, text mining, information retrieval, and ...
Language identification is the process of determining the natural language of text documents using c...
Abstract. This paper describes the participation of UAIC team at the LogCLEF 2011 initiative, langua...
AbstractLanguage identification (LI) is a phase of natural language processing. Although LI is forme...
The classification accuracy of text-based language identification depends on several factors, includ...
Abstract—Language Identification is the process of determining in which natural language the content...
In a multi-language Information Retrieval setting, the knowledge about the language of a user query ...
In this paper we present two experiments conducted for comparison of different language identificati...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Loanword identification is studied in recent years to alleviate data sparseness in several natural l...
Automatic Language Identification (ALI) is the first necessary step to do any language-dependent nat...
This paper describes three approaches to the task of automatically identifying the language a text i...
We present a statistical approach to text-based automatic language identification that focuses on di...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
Recently there has been interest in the approaches for training speech recognition systems for langu...
Language identification is widely used in machine learning, text mining, information retrieval, and ...
Language identification is the process of determining the natural language of text documents using c...
Abstract. This paper describes the participation of UAIC team at the LogCLEF 2011 initiative, langua...
AbstractLanguage identification (LI) is a phase of natural language processing. Although LI is forme...
The classification accuracy of text-based language identification depends on several factors, includ...
Abstract—Language Identification is the process of determining in which natural language the content...
In a multi-language Information Retrieval setting, the knowledge about the language of a user query ...
In this paper we present two experiments conducted for comparison of different language identificati...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Loanword identification is studied in recent years to alleviate data sparseness in several natural l...
Automatic Language Identification (ALI) is the first necessary step to do any language-dependent nat...
This paper describes three approaches to the task of automatically identifying the language a text i...
We present a statistical approach to text-based automatic language identification that focuses on di...
We describe the use of text data scraped from the web to augment language models for Automatic Speec...
Recently there has been interest in the approaches for training speech recognition systems for langu...