Overview

Project Gutenberg (PG) is a volunteer effort to digitize and archive cultural works, to 'encourage the creation and distribution of eBooks'. It was founded in 1971 by Michael S. Hart and is the oldest digital library. This dataset is a collection of the top 1000 most popular books on Project Gutenberg, as determined by downloads. Each book has information about its authorship, publication date, congressional classication, and a few other fields. It also has some simple, computed statistics based on common metrics such as sentiment analysis, Flesch Kincaid Reading level, and average sentence length.

https://www.gutenberg.org/ebooks/search/?sort_order=downloads

Metrics

Classics Data File

Overall
Length: 1006 rows
Height: 5
Size: 2027.138 kilobytes
Indexes: 1
Atomics
Total Count: 38
Integers: 14 (37%)
Floats: 14 (37%)
Strings: 10 (26%)
Booleans: 0 (0%)
Longs: 0 (0%)
Null/None Values: 0 (0%)
Unknowns Types: 0 (0%)
Dictionaries
Total Count: 10
Average Branching Factor: 4
Levels: 3
Inconsistent Field Count: 0
Lists
Total Count: 5
Complex Lists: 1
Unions
Total Count: 0
Tags
Total Count: 16

Development: