Many features of texts and languages can now be inferred from statistical analyses using concepts from complex networks and dynamical systems. In this paper, we quantify how topological properties of word co-occurrence networks and intermittency (or burstiness) in word distribution depend on the style of authors. Our database contains 40 books by eight authors who lived in the nineteenth and twentieth centuries, for which the following network measurements were obtained: the clustering coefficient, average shortest path lengths and betweenness. We found that the two factors with stronger dependence on authors were skewness in the distribution of word intermittency and the average shortest paths. Other factors such as betweenness and Zipf's ...
<div><p>In recent years, graph theory has been widely employed to probe several language properties....
Several characteristics of written texts have been inferred from statistical analysis derived from n...
The use of statistical methods to analyze large databases of text has been useful in unveiling patte...
Many features of texts and languages can now be inferred from statistical analyses using concepts fr...
Statistical methods have been widely employed in many practical natural language processing applicat...
Automatic identification of authorship in disputed documents has benefited from complex network theo...
Several characteristics of written texts have been inferred from statistical analysis derived from n...
In this paper we have quantified the consistency of word usage in written texts represented by compl...
In this paper we have quantified the consistency of word usage in written texts represented by compl...
AbstractThe classification of texts has become a major endeavor with so much electronic material ava...
In this paper we have quantified the consistency of word usage in written texts represented by compl...
Statistical methods have been widely employed in many practical natural language processing applicat...
<div><p>The authorship attribution is a problem of considerable practical and technical interest. Se...
This is the author accepted manuscript. The final version is available from Springer nature via the ...
The classification of texts has become a major endeavor with so much electronic material available, ...
<div><p>In recent years, graph theory has been widely employed to probe several language properties....
Several characteristics of written texts have been inferred from statistical analysis derived from n...
The use of statistical methods to analyze large databases of text has been useful in unveiling patte...
Many features of texts and languages can now be inferred from statistical analyses using concepts fr...
Statistical methods have been widely employed in many practical natural language processing applicat...
Automatic identification of authorship in disputed documents has benefited from complex network theo...
Several characteristics of written texts have been inferred from statistical analysis derived from n...
In this paper we have quantified the consistency of word usage in written texts represented by compl...
In this paper we have quantified the consistency of word usage in written texts represented by compl...
AbstractThe classification of texts has become a major endeavor with so much electronic material ava...
In this paper we have quantified the consistency of word usage in written texts represented by compl...
Statistical methods have been widely employed in many practical natural language processing applicat...
<div><p>The authorship attribution is a problem of considerable practical and technical interest. Se...
This is the author accepted manuscript. The final version is available from Springer nature via the ...
The classification of texts has become a major endeavor with so much electronic material available, ...
<div><p>In recent years, graph theory has been widely employed to probe several language properties....
Several characteristics of written texts have been inferred from statistical analysis derived from n...
The use of statistical methods to analyze large databases of text has been useful in unveiling patte...