It is tempting to treat frequency trends from the Google Books data sets as indicators of the "true" popularity of various words and phrases. Doing so allows us to draw quantitatively strong conclusions about the evolution of cultural perception of a given topic, such as time or gender. However, the Google Books corpus suffers from a number of limitations which make it an obscure mask of cultural popularity. A primary issue is that the corpus is in effect a library, containing one of each book. A single, prolific author is thereby able to noticeably insert new phrases into the Google Books lexicon, whether the author is widely read or not. With this understood, the Google Books corpus remains an important data set to be considered more lexi...
The surge of post-truth political argumentation suggests that we are living in a special historical ...
Although most ‘big data’ relate to the present and very recent past, advances in data processing pow...
Although most ‘big data’ relate to the present and very recent past, advances in data processing pow...
<div><p>It is tempting to treat frequency trends from the Google Books data sets as indicators of th...
It is tempting to treat frequency trends from the Google Books data sets as indicators of the “true ...
The Google Books corpus contains millions of books in a variety of languages. Due to this incredible...
Genügt wohl nicht, wenn ich es kritisiere? Pechenick EA, Danforth CM, Dodds PS (2015) Characterizing...
L'article, publié dans Science, sur une des premières utilisations analytiques de Google Books, fond...
In this Perspective Article we assess the usefulness of Google’s new word frequencies for word recog...
Google recently released ngram frequencies based on Google Books, a massive collection of digitized ...
The objective of this paper is to verify if Google Books Ngram Viewer, a new tool working on a datab...
An author's literary style is influenced by the cultural time period in which the author lives. The ...
"This new interface for Google Books allows you to search more than 155 billion (155,000,000,000) wo...
<p>In the English data sets, the capitalized term rapidly surpasses the uncapitalized term in the 19...
Written language provides a snapshot of linguistic, cultural, and current events information for a g...
The surge of post-truth political argumentation suggests that we are living in a special historical ...
Although most ‘big data’ relate to the present and very recent past, advances in data processing pow...
Although most ‘big data’ relate to the present and very recent past, advances in data processing pow...
<div><p>It is tempting to treat frequency trends from the Google Books data sets as indicators of th...
It is tempting to treat frequency trends from the Google Books data sets as indicators of the “true ...
The Google Books corpus contains millions of books in a variety of languages. Due to this incredible...
Genügt wohl nicht, wenn ich es kritisiere? Pechenick EA, Danforth CM, Dodds PS (2015) Characterizing...
L'article, publié dans Science, sur une des premières utilisations analytiques de Google Books, fond...
In this Perspective Article we assess the usefulness of Google’s new word frequencies for word recog...
Google recently released ngram frequencies based on Google Books, a massive collection of digitized ...
The objective of this paper is to verify if Google Books Ngram Viewer, a new tool working on a datab...
An author's literary style is influenced by the cultural time period in which the author lives. The ...
"This new interface for Google Books allows you to search more than 155 billion (155,000,000,000) wo...
<p>In the English data sets, the capitalized term rapidly surpasses the uncapitalized term in the 19...
Written language provides a snapshot of linguistic, cultural, and current events information for a g...
The surge of post-truth political argumentation suggests that we are living in a special historical ...
Although most ‘big data’ relate to the present and very recent past, advances in data processing pow...
Although most ‘big data’ relate to the present and very recent past, advances in data processing pow...