This chapter proposes an objective approach to the formal analysis of literary prose in English in order to investigate the relation between lexical density and judgments of canonicity. Based on the concepts of literariness proposed by the Russian Formalists and lexical variety, a mathematical index is designed, relating three variables which take the materiality of text into consideration: (a) relative frequency of lexical bundles, (b) lexical bundle type/token ratio, and (c) word type/token ratio. The index is described and illustrated with 46 canonical and non-canonical literary works. Statistical analysis shows no significant relation between lexical richness and decisions of what has been classified as canonical, indicating that these ...
We report an ongoing study on statistical characteristics of texts written in different genres. It h...
This study investigates global properties of three categories of English text: canonical fiction, no...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
This paper compares “The Da Vinci Code” and its translation in Portuguese against the language of ca...
AbstractFor some time now research has been carried out in the field of lexicometry into the statist...
The possibility of challenging traditional practices is one of the advantages of carrying out comput...
We consider the task of predicting how literary a text is, with a gold standard from human ratings. ...
Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribu...
Indices of lexical diversity have been used to estimate the size of a writer’s vocabulary and/or a w...
How the frequency of words may be interpreted in the context of an informational analysis of textual...
It is explained why value judgments may be tolerated and analyzed in an empirical study of literatur...
This thesis studies parsing and literature with the Data-Oriented Parsing framework, which assumes t...
This study deals with lexical density of short stories written by O. Henry. The objectives of the s...
From at least as early as William Empson’s The Structure of Complex Words (1951), complexity has com...
This thesis examines what automatic indexing and genre classification may bring to fiction. The thes...
We report an ongoing study on statistical characteristics of texts written in different genres. It h...
This study investigates global properties of three categories of English text: canonical fiction, no...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...
This paper compares “The Da Vinci Code” and its translation in Portuguese against the language of ca...
AbstractFor some time now research has been carried out in the field of lexicometry into the statist...
The possibility of challenging traditional practices is one of the advantages of carrying out comput...
We consider the task of predicting how literary a text is, with a gold standard from human ratings. ...
Copyright © 2020 for this paper by its authors. Use permitted under Creative Commons License Attribu...
Indices of lexical diversity have been used to estimate the size of a writer’s vocabulary and/or a w...
How the frequency of words may be interpreted in the context of an informational analysis of textual...
It is explained why value judgments may be tolerated and analyzed in an empirical study of literatur...
This thesis studies parsing and literature with the Data-Oriented Parsing framework, which assumes t...
This study deals with lexical density of short stories written by O. Henry. The objectives of the s...
From at least as early as William Empson’s The Structure of Complex Words (1951), complexity has com...
This thesis examines what automatic indexing and genre classification may bring to fiction. The thes...
We report an ongoing study on statistical characteristics of texts written in different genres. It h...
This study investigates global properties of three categories of English text: canonical fiction, no...
While the use of statistical physics methods to analyze large corpora has been useful to unveil many...