International audienceWe describe here the technical details of our participation to PAN 2012's "traditional" authorship attribution tasks. The main originality of our approach lies in the use of a large quantity of varied features to represent textual data, processed by a maximum entropy machine learning tool. Most of these features make an intensive use of natural language processing annotation techniques as well as generic language resources such as lexicons and other linguistic databases. Some of the features were even designed specifically for the target data type (contemporary fiction). Our belief is that richer features, that integrate external knowledge about language, have an advantage over knowledge-poorer ones (such as words and ...
This study investigates the problem of appropriate choice of texts for the training set in machine-l...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
International audienceWe describe here the technical details of our participation to PAN 2012's "tra...
Applications of authorship attribution ‘in the wild ’ [Koppel, M., Schler, J., and Argamon, S. (2010...
International audienceThis paper reports on the procedure and learning models we adopted for the 'PA...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
The frequencies of individual words have been the mainstay of computer-assisted authorial attributio...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
WOS:000417412800051This paper reports comparative authorship attribution results obtained on the Int...
Authorship attribution (AA) is the process of identifying the author of a given text and from the ma...
Translatorship attribution deals with accurately attributing a translation to its translator. The ta...
Language embeddings are often used as black-box word-level tools that provide powerful language anal...
We present a novel method for computer-assisted authorship attribution based on characterlevel n-gra...
In practice, training language models for individual authors is often expensive because of limited d...
This study investigates the problem of appropriate choice of texts for the training set in machine-l...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
International audienceWe describe here the technical details of our participation to PAN 2012's "tra...
Applications of authorship attribution ‘in the wild ’ [Koppel, M., Schler, J., and Argamon, S. (2010...
International audienceThis paper reports on the procedure and learning models we adopted for the 'PA...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
The frequencies of individual words have been the mainstay of computer-assisted authorial attributio...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
WOS:000417412800051This paper reports comparative authorship attribution results obtained on the Int...
Authorship attribution (AA) is the process of identifying the author of a given text and from the ma...
Translatorship attribution deals with accurately attributing a translation to its translator. The ta...
Language embeddings are often used as black-box word-level tools that provide powerful language anal...
We present a novel method for computer-assisted authorship attribution based on characterlevel n-gra...
In practice, training language models for individual authors is often expensive because of limited d...
This study investigates the problem of appropriate choice of texts for the training set in machine-l...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...