The paper reports work on collecting and annotating code-mixed English-Hindi so-cial media text (Twitter and Facebook messages), and experiments on automatic tagging of these corpora, using both a coarse-grained and a fine-grained part-of-speech tag set. We compare the perfor-mance of a combination of language spe-cific taggers to that of applying four ma-chine learning algorithms to the task (Con-ditional Random Fields, Sequential Mini-mal Optimization, Naïve Bayes and Ran-dom Forests), using a range of different features based on word context and word-internal information.
We address the problem of part-of-speech tagging for English data from the popular microblogging ser...
Abstract The Code-mixed text classification is challenging due to the lack of code-mixed labeled da...
Part-of-Speech (POS) tagging can be described as a task of doing automatic annotation of syntactic c...
The paper reports work on collecting and annotating code-mixed English-Hindi so- cial media text ...
Code-mixing is frequently observed in user generated content on social media, especially from multil...
Code-mixing is frequently observed in user generated content on social media, especially from multil...
Code-mixing or language-mixing is a linguistic phenomenon where multiple language mix together durin...
A Part-Of-Speech Tagger (POS Tagger) is a tool that scans the text in specific language and allocate...
Language identification at the document level has been considered an almost solved problem in ...
Language identification at the document level has been considered an almost solved problem in some a...
Multilingual users of social media sometimes use multiple languages during conversation. Mixing mult...
ABSTRACT: Automatic understanding of noisy social media text is one of the prime present-day resear...
Natural Language Processing (NLP) tools typically struggle to process code-switched data and so ling...
In social media communication, multilin-gual speakers often switch between lan-guages, and, in such ...
In social media communication, multilin-gual speakers often switch between lan-guages, and, in such ...
We address the problem of part-of-speech tagging for English data from the popular microblogging ser...
Abstract The Code-mixed text classification is challenging due to the lack of code-mixed labeled da...
Part-of-Speech (POS) tagging can be described as a task of doing automatic annotation of syntactic c...
The paper reports work on collecting and annotating code-mixed English-Hindi so- cial media text ...
Code-mixing is frequently observed in user generated content on social media, especially from multil...
Code-mixing is frequently observed in user generated content on social media, especially from multil...
Code-mixing or language-mixing is a linguistic phenomenon where multiple language mix together durin...
A Part-Of-Speech Tagger (POS Tagger) is a tool that scans the text in specific language and allocate...
Language identification at the document level has been considered an almost solved problem in ...
Language identification at the document level has been considered an almost solved problem in some a...
Multilingual users of social media sometimes use multiple languages during conversation. Mixing mult...
ABSTRACT: Automatic understanding of noisy social media text is one of the prime present-day resear...
Natural Language Processing (NLP) tools typically struggle to process code-switched data and so ling...
In social media communication, multilin-gual speakers often switch between lan-guages, and, in such ...
In social media communication, multilin-gual speakers often switch between lan-guages, and, in such ...
We address the problem of part-of-speech tagging for English data from the popular microblogging ser...
Abstract The Code-mixed text classification is challenging due to the lack of code-mixed labeled da...
Part-of-Speech (POS) tagging can be described as a task of doing automatic annotation of syntactic c...