Being able to identify the author of an unknown text is crucial. Although it is a well-studied field, it is still an open problem, since a standard approach has yet to be found. In this notebook, we propose our model for the Authorship Attribution task of PAN 2019, that focuses on cross-domain setting covering 4 different languages: French, Italian, English, and Spanish. We use n-grams of characters, words, stemmed words, and distorted text. Our model has an SVM for each feature and an ensemble architecture. Our final results outperform the baseline given by PAN in almost every problem. With this model, we reach the second place in the task with an F1-score of 68%
International audienceThis paper reports on the procedure and learning models we adopted for the 'PA...
Abstract We adopt Koppel et al.’s unmasking approach [5] as the major frame-work of our authorship v...
We describe the approach that we submitted to the 2015 PAN competition for the author profiling task...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
Authorship attribution (AA) is a very well studied research subject and the most prominent subtask o...
Authorship attribution is an important problem in information retrieval and computational linguistic...
Abstract This paper presents our approach to the Author Profiling (AP) task at PAN 2016. The task ai...
Abstract. The proposed solution for authorship attribution combines a couple of the most important f...
Existing research on Authorship Attribution (AA) focuses on texts for which a lot of data is availab...
Abstract. Authorship verification is one of the most challenging tasks in stylebased text categoriza...
In this paper we describe our k-Nearest Neighbor (k-NN) based Authorship Verification method for the...
We provide a corpus which comprises a set of cross-domain authorship attribution problems in each of...
Authors writing documents imprint identifying information within their texts: vocabulary, registry, ...
Most previous research on authorship attribution (AA) assumes that the training and test data are dr...
International audienceThis paper reports on the procedure and learning models we adopted for the 'PA...
Abstract We adopt Koppel et al.’s unmasking approach [5] as the major frame-work of our authorship v...
We describe the approach that we submitted to the 2015 PAN competition for the author profiling task...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
Authorship attribution (AA) is a very well studied research subject and the most prominent subtask o...
Authorship attribution is an important problem in information retrieval and computational linguistic...
Abstract This paper presents our approach to the Author Profiling (AP) task at PAN 2016. The task ai...
Abstract. The proposed solution for authorship attribution combines a couple of the most important f...
Existing research on Authorship Attribution (AA) focuses on texts for which a lot of data is availab...
Abstract. Authorship verification is one of the most challenging tasks in stylebased text categoriza...
In this paper we describe our k-Nearest Neighbor (k-NN) based Authorship Verification method for the...
We provide a corpus which comprises a set of cross-domain authorship attribution problems in each of...
Authors writing documents imprint identifying information within their texts: vocabulary, registry, ...
Most previous research on authorship attribution (AA) assumes that the training and test data are dr...
International audienceThis paper reports on the procedure and learning models we adopted for the 'PA...
Abstract We adopt Koppel et al.’s unmasking approach [5] as the major frame-work of our authorship v...
We describe the approach that we submitted to the 2015 PAN competition for the author profiling task...