In this paper we discuss the parallel manual normalisation of samples extracted from Croatian and Serbian Twitter corpora. We describe the datasets, outline the unified guidelines provided to annotators, and present a series of analyses of standard-to-non-standard transformations found in the Twitter data. The results show that closed part-of-speech classes are transformed more frequently than the open classes, that the most frequently transformed lemmas are auxiliary and modal verbs, interjections, particles and pronouns, that character deletions are more frequent than insertions and replacements, and that more transformations occur at the word end than in other positions. Croatian and Serbian are found to share many, bu...
ReLDI-NormTagNER-hr 2.0 is a manually annotated corpus of Croatian tweets. It is meant as a gold-sta...
Data used in the experiments described in: Nikola Ljubešić, Katja Zupan, Darja Fišer and Tomaž Er...
The language used in social media is often characterized by the abundance of informal and non-standa...
In this paper we discuss the parallel manual normalisation of samples extracted from Croatian and Se...
ReLDI-NormTag-hr 1.0 is a manually annotated corpus of Croatian tweets. It is meant as a gold-standa...
ReLDI-NormTag-sr 1.1 is a manually annotated corpus of Serbian tweets. It is meant as a gold-standar...
ReLDI-NormTag-hr 1.1 is a manually annotated corpus of Croatian tweets. It is meant as a gold-standa...
ReLDI-NormTag-sr 1.0 is a manually annotated corpus of Serbian tweets. It is meant as a gold-standar...
In this paper we carry out a cross-lingual comparison of nonstandard features in the language of soc...
In this paper, we investigate the spelling conventions on the Twitter micro-blogging pla...
This paper deals with non-standard spelling variants of Serbian words on Twitter. The analysis is b...
ReLDI-NormTagNER-sr 2.1 is a manually annotated corpus of Serbian tweets. It is meant as a gold-stan...
ReLDI-NormTagNER-sr 2.0 is a manually annotated corpus of Serbian tweets. It is meant as a gold-stan...
ReLDI-NormTagNER-hr 2.1 is a manually annotated corpus of Croatian tweets. It is meant as a gold-sta...
Given the growing need to quickly process texts and extract information from the data for various pu...
ReLDI-NormTagNER-hr 2.0 is a manually annotated corpus of Croatian tweets. It is meant as a gold-sta...
Data used in the experiments described in: Nikola Ljubešić, Katja Zupan, Darja Fišer and Tomaž Er...
The language used in social media is often characterized by the abundance of informal and non-standa...
In this paper we discuss the parallel manual normalisation of samples extracted from Croatian and Se...
ReLDI-NormTag-hr 1.0 is a manually annotated corpus of Croatian tweets. It is meant as a gold-standa...
ReLDI-NormTag-sr 1.1 is a manually annotated corpus of Serbian tweets. It is meant as a gold-standar...
ReLDI-NormTag-hr 1.1 is a manually annotated corpus of Croatian tweets. It is meant as a gold-standa...
ReLDI-NormTag-sr 1.0 is a manually annotated corpus of Serbian tweets. It is meant as a gold-standar...
In this paper we carry out a cross-lingual comparison of nonstandard features in the language of soc...
In this paper, we investigate the spelling conventions on the Twitter micro-blogging pla...
This paper deals with non-standard spelling variants of Serbian words on Twitter. The analysis is b...
ReLDI-NormTagNER-sr 2.1 is a manually annotated corpus of Serbian tweets. It is meant as a gold-stan...
ReLDI-NormTagNER-sr 2.0 is a manually annotated corpus of Serbian tweets. It is meant as a gold-stan...
ReLDI-NormTagNER-hr 2.1 is a manually annotated corpus of Croatian tweets. It is meant as a gold-sta...
Given the growing need to quickly process texts and extract information from the data for various pu...
ReLDI-NormTagNER-hr 2.0 is a manually annotated corpus of Croatian tweets. It is meant as a gold-sta...
Data used in the experiments described in: Nikola Ljubešić, Katja Zupan, Darja Fišer and Tomaž Er...
The language used in social media is often characterized by the abundance of informal and non-standa...