In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discriminant Analysis (MDA) and the other based on a Support Vector Machine (SVM). The classification features we exploit are based on word frequencies in the text. We adopt an approach of preprocessing each text by stripping it of all characters except a-z and space. This is in order to increase the portability of the software to different types of texts. We test the methodology on a corpus of undisputed English texts, and use leave-one-out cross validation to demonstrate classification accuracies in excess of 90%. We further test our methods on the Federalist Papers, which have a partly disputed authorship and a fair degree of scholarly consensus...
This paper covers a text classification problem: the identification of the author of a text. It is n...
Automatic authorship attribution is an umbrella term for methods trying to derive authorship from te...
Authorship attribution is a task to identify the writer of unknown text and categorize it to known w...
In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discri...
In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discri...
In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discri...
In order to authorship attribution techniques, the Federalist Papers have been applied as a testing-...
Authorship attribution is the process of determining the writer of a document. In literature, there ...
Background: To recognize the authors of the texts by the use of statistical tools, one first needs t...
Techniques for identifying the author of an unattributed document can be applied to problems in info...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
Techniques that can effectively identify authors of texts are of great importance in scenarios such ...
Existing research on Authorship Attribution (AA) focuses on texts for which a lot of data is availab...
In recent years, Twitter has become a popular testing ground for techniques in authorship attributio...
With the rapid growth of internet usage, authorship authentication of online messages became challen...
This paper covers a text classification problem: the identification of the author of a text. It is n...
Automatic authorship attribution is an umbrella term for methods trying to derive authorship from te...
Authorship attribution is a task to identify the writer of unknown text and categorize it to known w...
In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discri...
In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discri...
In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discri...
In order to authorship attribution techniques, the Federalist Papers have been applied as a testing-...
Authorship attribution is the process of determining the writer of a document. In literature, there ...
Background: To recognize the authors of the texts by the use of statistical tools, one first needs t...
Techniques for identifying the author of an unattributed document can be applied to problems in info...
Authorship attribution is a problem in information retrieval and computational linguistics that invo...
Techniques that can effectively identify authors of texts are of great importance in scenarios such ...
Existing research on Authorship Attribution (AA) focuses on texts for which a lot of data is availab...
In recent years, Twitter has become a popular testing ground for techniques in authorship attributio...
With the rapid growth of internet usage, authorship authentication of online messages became challen...
This paper covers a text classification problem: the identification of the author of a text. It is n...
Automatic authorship attribution is an umbrella term for methods trying to derive authorship from te...
Authorship attribution is a task to identify the writer of unknown text and categorize it to known w...