We study techniques for identifying an anonymous author via linguistic stylometry, i.e., comparing the writing style against a corpus of texts of known authorship. We experimentally demonstrate the effectiveness of our techniques with as many as 100,000 candidate authors. Given the increasing availability of writing samples online, our result has serious implications for anonymity and free speech - an anonymous blogger or whistleblower may be unmasked unless they take steps to obfuscate their writing style. While there is a huge body of literature on authorship recognition based on writing style, almost none of it has studied corpora of more than a few hundred authors. The problem becomes qualitatively different at a large scale, as we show...
Electronic text stylometry is concerned with analyzing the writing styles of input electronic texts ...
Authorship attribution is a task to identify the writer of unknown text and categorize it to known w...
Authorship verification rely on identification of a given document to verify whether it is written b...
Abstract — In the era of internet, the use of online blogs, forum, social network and email is very ...
Abstract—Stylometry consists of the analysis of linguis-tic styles and writing characteristics of th...
In this paper, we use a blog corpus to demonstrate that we can often identify the author of an anony...
Enhancing information retrieval systems with the ability to take the writing style of people into ac...
International audienceAuthorship analysis aims at studying writing styles to predict authorship of a...
In digital forensics, questions often arise about the authors of documents: their identity, demograp...
Widespread availability of free, public blog platforms has facilitated growth in the amount of indiv...
My paper was well received by the attendees. In my session, I was the only whose paper attracted a g...
Establishing authorship of online texts is fundamental to combat cybercrimes. Unfortunately, text le...
Establishing authorship of online texts is fundamental to combat cybercrimes. Unfortunately, text le...
International audienceIn this paper, we introduce a new method of representation learning that aims ...
Part 2: Forensic TechniquesInternational audienceStylometry is a form of authorship attribution that...
Electronic text stylometry is concerned with analyzing the writing styles of input electronic texts ...
Authorship attribution is a task to identify the writer of unknown text and categorize it to known w...
Authorship verification rely on identification of a given document to verify whether it is written b...
Abstract — In the era of internet, the use of online blogs, forum, social network and email is very ...
Abstract—Stylometry consists of the analysis of linguis-tic styles and writing characteristics of th...
In this paper, we use a blog corpus to demonstrate that we can often identify the author of an anony...
Enhancing information retrieval systems with the ability to take the writing style of people into ac...
International audienceAuthorship analysis aims at studying writing styles to predict authorship of a...
In digital forensics, questions often arise about the authors of documents: their identity, demograp...
Widespread availability of free, public blog platforms has facilitated growth in the amount of indiv...
My paper was well received by the attendees. In my session, I was the only whose paper attracted a g...
Establishing authorship of online texts is fundamental to combat cybercrimes. Unfortunately, text le...
Establishing authorship of online texts is fundamental to combat cybercrimes. Unfortunately, text le...
International audienceIn this paper, we introduce a new method of representation learning that aims ...
Part 2: Forensic TechniquesInternational audienceStylometry is a form of authorship attribution that...
Electronic text stylometry is concerned with analyzing the writing styles of input electronic texts ...
Authorship attribution is a task to identify the writer of unknown text and categorize it to known w...
Authorship verification rely on identification of a given document to verify whether it is written b...