Language identification is a simple problem that becomes much more difficult when its usual assumptions are broken. In this paper we consider the task of classifying short segments of text in closely-related languages for the Discriminating Similar Languages shared task, which is broke
Computational approaches in language identification often result in high number of false positives a...
In our paper we present two new approaches for language identification. Both of them are based on th...
We present a statistical approach to text-based automatic language identification that focuses on di...
Automatic Language Identification (LI) or Dialect Identification (DI) of short texts of closely rela...
Text on the Internet is written in different languages and scripts that can be divided into differen...
This paper describes an experiment to perform language identification on a sub-sentence basis. The t...
Automatic Language Identification (LI) or Dialect Identification (DI) of short texts of closely rela...
This paper extends the work of Cavnar and Trenkle N-gram text categorization [2], enhances the study...
Abstract. This paper describes the participation of UAIC team at the LogCLEF 2011 initiative, langua...
In a multi-language Information Retrieval setting, the knowledge about the language of a user query ...
Language identification is an important issue in many speech applica-tions. We address this problem ...
A theory of approximate language identification analogous to the existing theory of exact language i...
Abstract—Language Identification is the process of determining in which natural language the content...
In this paper we present two experiments conducted for comparison of different language identificati...
In this paper we describe the language identification system we developed for the Discriminating Sim...
Computational approaches in language identification often result in high number of false positives a...
In our paper we present two new approaches for language identification. Both of them are based on th...
We present a statistical approach to text-based automatic language identification that focuses on di...
Automatic Language Identification (LI) or Dialect Identification (DI) of short texts of closely rela...
Text on the Internet is written in different languages and scripts that can be divided into differen...
This paper describes an experiment to perform language identification on a sub-sentence basis. The t...
Automatic Language Identification (LI) or Dialect Identification (DI) of short texts of closely rela...
This paper extends the work of Cavnar and Trenkle N-gram text categorization [2], enhances the study...
Abstract. This paper describes the participation of UAIC team at the LogCLEF 2011 initiative, langua...
In a multi-language Information Retrieval setting, the knowledge about the language of a user query ...
Language identification is an important issue in many speech applica-tions. We address this problem ...
A theory of approximate language identification analogous to the existing theory of exact language i...
Abstract—Language Identification is the process of determining in which natural language the content...
In this paper we present two experiments conducted for comparison of different language identificati...
In this paper we describe the language identification system we developed for the Discriminating Sim...
Computational approaches in language identification often result in high number of false positives a...
In our paper we present two new approaches for language identification. Both of them are based on th...
We present a statistical approach to text-based automatic language identification that focuses on di...