This work deals with the application that uses the machine-learning methods for the automatic language and encoding recognition. Various topics related to the project are analysed here one by one. Brief descriptionof the application should give the reader an idea about the features andthe usage of the program. The n-gram language model, the EM-smoothing algorithm and the entropy is explained in the chapter about the statistical methods. Some implementation matters are subject of analysis, for example the data file format and the inner word representation using the trie. The interpretation of a vast set of experiments evaluating the precision of the application is included as well as a short summary of the possible usage of the software
We present a framework for statistical machine translation of natural languages based on direct maxi...
Abstract: In this paper we present a synthesis of the theoretical fundamentals and some practical as...
This paper extends the work of Cavnar and Trenkle N-gram text categorization [2], enhances the study...
Processing simple or complex texts (MIME type - application) often requires automatic recognition of...
Abstract: Statistical properties of European language texts are investigated with the use ...
We explain how to apply statistical techniques to solve several language-recognition problems that a...
This paper investigates the integration of a statistical language model into an on-line recognition ...
Abstract—Language Identification is the process of determining in which natural language the content...
The thesis deals with language modelling for German. The main concerns are the specifics of German l...
Automatic Speech Recognition (ASR) systems utilize statistical acoustic and language models to find ...
Language technology is when a computer processes human languages in some way. Since human languages ...
Recent years have seen rapid growth in the deployment of statistical methods for computational langu...
Language modeling is an important part for both speech recognition and machine translation systems. ...
TITLE: Language recognition based on the frequency analysis SUMMARY: The aim of this bachelor thesis...
We present a statistical approach to text-based automatic language identification that focuses on di...
We present a framework for statistical machine translation of natural languages based on direct maxi...
Abstract: In this paper we present a synthesis of the theoretical fundamentals and some practical as...
This paper extends the work of Cavnar and Trenkle N-gram text categorization [2], enhances the study...
Processing simple or complex texts (MIME type - application) often requires automatic recognition of...
Abstract: Statistical properties of European language texts are investigated with the use ...
We explain how to apply statistical techniques to solve several language-recognition problems that a...
This paper investigates the integration of a statistical language model into an on-line recognition ...
Abstract—Language Identification is the process of determining in which natural language the content...
The thesis deals with language modelling for German. The main concerns are the specifics of German l...
Automatic Speech Recognition (ASR) systems utilize statistical acoustic and language models to find ...
Language technology is when a computer processes human languages in some way. Since human languages ...
Recent years have seen rapid growth in the deployment of statistical methods for computational langu...
Language modeling is an important part for both speech recognition and machine translation systems. ...
TITLE: Language recognition based on the frequency analysis SUMMARY: The aim of this bachelor thesis...
We present a statistical approach to text-based automatic language identification that focuses on di...
We present a framework for statistical machine translation of natural languages based on direct maxi...
Abstract: In this paper we present a synthesis of the theoretical fundamentals and some practical as...
This paper extends the work of Cavnar and Trenkle N-gram text categorization [2], enhances the study...