A zipped folder of files keyed to HathiTrust volume IDs, each representing a volume of English-language fiction. The files contain word counts used in Ted Underwood, "The Historical Significance of Textual Distances," LaTeCH-CLfL, Santa Fe, 2018. The data is drawn ultimately from HathiTrust Digital Library.Ope
A popular form of term weighting in texts is to use TF*IDF, which takes a text's term frequencies an...
Contains fulltext : 67952.pdf (publisher's version ) (Closed access
Metadata for 774 works of fiction referenced in "Mapping Mutable Genres in Structurally Complex Volu...
A zipped folder of files keyed to HathiTrust volume IDs, each representing a volume of English-langu...
Data to support calculations in chapter 2 of the book _Distant Horizons._ It includes word counts fo...
This workset is data in support of the article "Mapping Mutable Genres in Structurally Complex Volum...
Data to support calculations in chapter 1 of the book _Distant Horizons._ It includes word counts fo...
A topic model of 29,341 volumes of fiction, written in English and published between 1880 and 1999. ...
Tab-separated files containing wordcounts from volumes of fiction. The names of the files are keyed ...
Data to support calculations in chapter 3 of the book _Distant Horizons._ It includes word counts fo...
Metadata for English-language fiction in HathiTrust Digital Library, after 1922. These volumes were ...
Using regularized logistic regression and hidden Markov models, we predict genre at the page level i...
Corpus-level term statistics are valuable for numerous text analysis activities, such as term weight...
The HathiTrust Digital Library (HTDL) was founded in 2008 with just over 2 million volumes in the co...
Derived data on time features related to 1,069 English-language novels published between 1700-1900. ...
A popular form of term weighting in texts is to use TF*IDF, which takes a text's term frequencies an...
Contains fulltext : 67952.pdf (publisher's version ) (Closed access
Metadata for 774 works of fiction referenced in "Mapping Mutable Genres in Structurally Complex Volu...
A zipped folder of files keyed to HathiTrust volume IDs, each representing a volume of English-langu...
Data to support calculations in chapter 2 of the book _Distant Horizons._ It includes word counts fo...
This workset is data in support of the article "Mapping Mutable Genres in Structurally Complex Volum...
Data to support calculations in chapter 1 of the book _Distant Horizons._ It includes word counts fo...
A topic model of 29,341 volumes of fiction, written in English and published between 1880 and 1999. ...
Tab-separated files containing wordcounts from volumes of fiction. The names of the files are keyed ...
Data to support calculations in chapter 3 of the book _Distant Horizons._ It includes word counts fo...
Metadata for English-language fiction in HathiTrust Digital Library, after 1922. These volumes were ...
Using regularized logistic regression and hidden Markov models, we predict genre at the page level i...
Corpus-level term statistics are valuable for numerous text analysis activities, such as term weight...
The HathiTrust Digital Library (HTDL) was founded in 2008 with just over 2 million volumes in the co...
Derived data on time features related to 1,069 English-language novels published between 1700-1900. ...
A popular form of term weighting in texts is to use TF*IDF, which takes a text's term frequencies an...
Contains fulltext : 67952.pdf (publisher's version ) (Closed access
Metadata for 774 works of fiction referenced in "Mapping Mutable Genres in Structurally Complex Volu...