For the Authorship Attribution (AA) task, character n-grams are considered among the best predictive features. In the English language, it has also been shown that some types of character n-grams perform better than others. This paper tackles the AA task in Portuguese by examining the performance of different types of character n-grams, and various combinations of them. The paper also experiments with different feature representations and machine-learning algorithms. Moreover, the paper demonstrates that the performance of the character n-gram approach can be improved by fine-tuning the feature set and by appropriately selecting the length and type of character n-grams. This relatively simple and language-independent approach to the AA task...
WOS:000417412800051This paper reports comparative authorship attribution results obtained on the Int...
Prior research has considered the sequential order of function words, after the contextual words of ...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
For the Authorship Attribution (AA) task, character n-grams are considered among the best predictive...
We present a novel method for computer-assisted authorship attribution based on character-level n-gr...
Language embeddings are often used as black-box word-level tools that provide powerful language anal...
We present a novel method for computer-assisted authorship attribution based on characterlevel n-gra...
Automatic authorship attribution aims to train computers to identify the author of a disputed text b...
Abstract. Automatic authorship identification offers a valuable tool for supporting crime investigat...
The frequencies of individual words have been the mainstay of computer-assisted authorial attributio...
Authorship attribution (AA) is the task of identifying authors of disputed or anonymous texts. It ca...
International audienceWe describe here the technical details of our participation to PAN 2012's "tra...
This paper presents work on using continuous representations for authorship attribution. In contra...
The main objective of this dissertation is to evaluate the discriminatory capacity of n-grams - i.e....
Automatic Language Identification of written texts is a well-established area of research in Computa...
WOS:000417412800051This paper reports comparative authorship attribution results obtained on the Int...
Prior research has considered the sequential order of function words, after the contextual words of ...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
For the Authorship Attribution (AA) task, character n-grams are considered among the best predictive...
We present a novel method for computer-assisted authorship attribution based on character-level n-gr...
Language embeddings are often used as black-box word-level tools that provide powerful language anal...
We present a novel method for computer-assisted authorship attribution based on characterlevel n-gra...
Automatic authorship attribution aims to train computers to identify the author of a disputed text b...
Abstract. Automatic authorship identification offers a valuable tool for supporting crime investigat...
The frequencies of individual words have been the mainstay of computer-assisted authorial attributio...
Authorship attribution (AA) is the task of identifying authors of disputed or anonymous texts. It ca...
International audienceWe describe here the technical details of our participation to PAN 2012's "tra...
This paper presents work on using continuous representations for authorship attribution. In contra...
The main objective of this dissertation is to evaluate the discriminatory capacity of n-grams - i.e....
Automatic Language Identification of written texts is a well-established area of research in Computa...
WOS:000417412800051This paper reports comparative authorship attribution results obtained on the Int...
Prior research has considered the sequential order of function words, after the contextual words of ...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...