We present an investigation of recently proposed character and word sequence kernels for the task of authorship attribution based on relatively short texts. Performance is compared with two corresponding probabilistic approaches based on Markov chains. Several configurations of the sequence kernels are studied on a relatively large dataset (50 authors), where each author covered several topics. Utilising Moffat smoothing, the two probabilistic approaches obtain similar performance, which in turn is comparable to that of character sequence kernels and is better than that of word sequence kernels. The results further suggest that when using a realistic setup that takes into account the case of texts which are not written by any hypothesised a...
Authorship detection is a challenging task due to many design choices the user has to decide on. The...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
We present an investigation of recently proposed character and word sequence kernels for the task of...
We present an investigation of recently proposed character and word sequence kernels for the task of...
We investigate the use of recently proposed character and word sequence kernels for the task of auth...
We investigate the use of recently proposed character and word sequence kernels for the task of auth...
Authorship attribution (AA) is the task of identifying authors of disputed or anonymous texts. It ca...
Authorship attribution is a task to identify the writer of unknown text and categorize it to known w...
Background: To recognize the authors of the texts by the use of statistical tools, one first needs t...
Applications of authorship attribution ‘in the wild ’ [Koppel, M., Schler, J., and Argamon, S. (2010...
In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discri...
We define a variable-order Markov model, representing a Probabilistic Context Free Grammar, built fr...
Conditional Complexity of Compression for Authorship Attribution Abstract: We introduce new stylomet...
In practice, training language models for individual authors is often expensive because of limited d...
Authorship detection is a challenging task due to many design choices the user has to decide on. The...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
We present an investigation of recently proposed character and word sequence kernels for the task of...
We present an investigation of recently proposed character and word sequence kernels for the task of...
We investigate the use of recently proposed character and word sequence kernels for the task of auth...
We investigate the use of recently proposed character and word sequence kernels for the task of auth...
Authorship attribution (AA) is the task of identifying authors of disputed or anonymous texts. It ca...
Authorship attribution is a task to identify the writer of unknown text and categorize it to known w...
Background: To recognize the authors of the texts by the use of statistical tools, one first needs t...
Applications of authorship attribution ‘in the wild ’ [Koppel, M., Schler, J., and Argamon, S. (2010...
In this paper, we develop two automated authorship attribution schemes, one based on Multiple Discri...
We define a variable-order Markov model, representing a Probabilistic Context Free Grammar, built fr...
Conditional Complexity of Compression for Authorship Attribution Abstract: We introduce new stylomet...
In practice, training language models for individual authors is often expensive because of limited d...
Authorship detection is a challenging task due to many design choices the user has to decide on. The...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...
The aim of this study is to find such a minimal size of text samples for authorship attribution that...