Overview

Project Gutenberg (PG) is a volunteer effort to digitize and archive cultural works, to 'encourage the creation and distribution of eBooks'. It was founded in 1971 by Michael S. Hart and is the oldest digital library. This dataset is a collection of the top 1000 most popular books on Project Gutenberg, as determined by downloads. Each book has information about its authorship, publication date, congressional classication, and a few other fields. It also has some simple, computed statistics based on common metrics such as sentiment analysis, Flesch Kincaid Reading level, and average sentence length.

https://www.gutenberg.org/ebooks/search/?sort_order=downloads

Downloads

Download all of the following files.

Usage

Begin by adding the above JAR file to your classpath. This library has 1 function you can use.
import corgis.classics.ClassicsLibrary;
import corgis.classics.domain.*;
import java.util.ArrayList;

public class ExampleClass {
    public static void main(String[] args) {
        // Get access to the library
        ClassicsLibrary classicsLibrary = new ClassicsLibrary();
        // Access data inside the library
        ArrayList<Book> list_of_book = classicsLibrary.getBooks(false);
    }
}
Additionally, some of the functions can optionally return Big Data using the last argument (setting it to true). If you use this Big Data, it may be much slower. Wait until you are sure your code is correct before using these examples.
import corgis.classics.ClassicsLibrary;
import corgis.classics.domain.*;
import java.util.ArrayList;

public class ExampleClass {
    public static void main(String[] args) {
        // Get access to the library
        ClassicsLibrary classicsLibrary = new ClassicsLibrary();
        // Access data inside the library
        ArrayList<Book> list_of_book = classicsLibrary.getBooks(true);
    }
}

Documentation

We provide full JavaDoc for this library. Alternatively, you may refer to the below:

 classicsLibrary.getBooks(test)

Returns books from the dataset.