The dataset has been collected in the frame of the Prac1 of the subject Tipology and Data Life Cycle of the Master's Degree in Data Science of the Universitat Oberta de Catalunya (UOC). The dataset contains 25 variables and 52478 records corresponding to books on the GoodReads Best Books Ever list (the larges list on the site). Original code used to retrieve the dataset can be found on github repository: github.com/scostap/goodreads_bbe_dataset The data was retrieved in two sets, the first 30000 books and then the remainig 22478. Dates were not parsed and reformated on the second chunk so publishDate and firstPublishDate are representet in a mm/dd/yyyy format for the first 30000 records and Month Day Year for the rest. Book cover image...
The dataset 'Booktitles per capita world 1500-2010' originaly forms part of the collection of Biblio...
A list of the books and authors most read by the members of 50 large English-based Goodreads book gr...
ABSTRACT Books are one of the most widely used objects in daily life. With the development of the ti...
The books dataset includes information about all published books from Springer Nature. This dataset ...
Combining human expertise with information from book-consumer digital data may generate what it take...
Before defining the dataset, I would like to explain that this is a part of an university project, m...
Books dataset is a collection of book descriptions from the Goodreads (https://www.goodreads.com/) w...
This dataset contains the data from the books in the website https://books.toscrape.com/, which is a...
Este conjunto de datos fue obtenido a partir de la página web todostuslibros y cuenta con informació...
There’s a significant struggle in the literary publishing industry with gathering data. With thousan...
The core of this experiment is the use of the entity-fishing algorithm, as created and deployed by D...
For more than a decade, open access book platforms have been distributing titles in order to maximis...
Python script and dataset. The dataset contains the number of publications in French one can find i...
The BookSampo dataset provides information as linked data on fiction literature published in Finland...
None of the American edition published after part 4. Parts 5 and 6 have imprinted: London, G, Routle...
The dataset 'Booktitles per capita world 1500-2010' originaly forms part of the collection of Biblio...
A list of the books and authors most read by the members of 50 large English-based Goodreads book gr...
ABSTRACT Books are one of the most widely used objects in daily life. With the development of the ti...
The books dataset includes information about all published books from Springer Nature. This dataset ...
Combining human expertise with information from book-consumer digital data may generate what it take...
Before defining the dataset, I would like to explain that this is a part of an university project, m...
Books dataset is a collection of book descriptions from the Goodreads (https://www.goodreads.com/) w...
This dataset contains the data from the books in the website https://books.toscrape.com/, which is a...
Este conjunto de datos fue obtenido a partir de la página web todostuslibros y cuenta con informació...
There’s a significant struggle in the literary publishing industry with gathering data. With thousan...
The core of this experiment is the use of the entity-fishing algorithm, as created and deployed by D...
For more than a decade, open access book platforms have been distributing titles in order to maximis...
Python script and dataset. The dataset contains the number of publications in French one can find i...
The BookSampo dataset provides information as linked data on fiction literature published in Finland...
None of the American edition published after part 4. Parts 5 and 6 have imprinted: London, G, Routle...
The dataset 'Booktitles per capita world 1500-2010' originaly forms part of the collection of Biblio...
A list of the books and authors most read by the members of 50 large English-based Goodreads book gr...
ABSTRACT Books are one of the most widely used objects in daily life. With the development of the ti...