ISBN 978-1-945626-43-2International audienceThe present contribution revolves around a contrastive subword n-gram model which has been tested in the Discriminating between Similar Languages shared task. I present and discuss the method used in this 14-way language identification task comprising varieties of 6 main language groups. It features the following characteristics: (1) the preprocessing and conversion of a collection of documents to sparse features; (2) weighted character n-gram profiles; (3) a multinomial Bayesian classifier. Meaningful bag-of-n-grams features can be used as a system in a straightforward way, my approach outperforms most of the systems used in the DSL shared task (3rd rank)
We describe the system built by the National Research Council Canada for the ”Discriminating between...
Verwimp L., Pelemans J., Van hamme H., Wambacq P., ''Extending n-gram language models based on equiv...
We describe the system built by the National Research Council (NRC) Canada for the 2015 shared task ...
This paper describes an approach to discriminating similar languages using word- and character-based...
In this paper we describe the language identification system we developed for the Discriminating Sim...
This paper presents a novel neural architecture capable of outperforming state-of-the-art systems on...
The Discriminating between Similar Languages (DSL) shared task at VarDial challenged partici-pants t...
Statistical n-gram language modeling is used in many domains like speech recognition, language ident...
This paper describes our approaches to Na-tive Language Identification (NLI) for the NLI shared task...
International audienceThis paper describes an extension of the n-gram language model: the similar n-...
We describe the system built by the National Research Council Canada for the \u201dDiscriminating be...
In this paper we present two experiments conducted for comparison of different language identificati...
In this paper, we explore the use of the Support Vector Machines (SVMs) to learn a discriminatively ...
This paper describes the system developed by the Centre for English Corpus Linguis- tics (CECL) to d...
We present a method to discriminate between texts written in either the Netherlandic or the Flemish ...
We describe the system built by the National Research Council Canada for the ”Discriminating between...
Verwimp L., Pelemans J., Van hamme H., Wambacq P., ''Extending n-gram language models based on equiv...
We describe the system built by the National Research Council (NRC) Canada for the 2015 shared task ...
This paper describes an approach to discriminating similar languages using word- and character-based...
In this paper we describe the language identification system we developed for the Discriminating Sim...
This paper presents a novel neural architecture capable of outperforming state-of-the-art systems on...
The Discriminating between Similar Languages (DSL) shared task at VarDial challenged partici-pants t...
Statistical n-gram language modeling is used in many domains like speech recognition, language ident...
This paper describes our approaches to Na-tive Language Identification (NLI) for the NLI shared task...
International audienceThis paper describes an extension of the n-gram language model: the similar n-...
We describe the system built by the National Research Council Canada for the \u201dDiscriminating be...
In this paper we present two experiments conducted for comparison of different language identificati...
In this paper, we explore the use of the Support Vector Machines (SVMs) to learn a discriminatively ...
This paper describes the system developed by the Centre for English Corpus Linguis- tics (CECL) to d...
We present a method to discriminate between texts written in either the Netherlandic or the Flemish ...
We describe the system built by the National Research Council Canada for the ”Discriminating between...
Verwimp L., Pelemans J., Van hamme H., Wambacq P., ''Extending n-gram language models based on equiv...
We describe the system built by the National Research Council (NRC) Canada for the 2015 shared task ...