LLMpediaThe first transparent, open encyclopedia generated by LLMs

Project Gutenberg

Generated by DeepSeek V3.2
Note: This article was automatically generated by a large language model (LLM) from purely parametric knowledge (no retrieval). It may contain inaccuracies or hallucinations. This encyclopedia is part of a research project currently under review.
Article Genealogy
Parent: Memex Hop 2
Expansion Funnel Raw 44 → Dedup 34 → NER 26 → Enqueued 26
1. Extracted44
2. After dedup34 (None)
3. After NER26 (None)
Rejected: 8 (not NE: 8)
4. Enqueued26 (None)
Project Gutenberg
NameProject Gutenberg
FounderMichael S. Hart
Established1 December 1971
CountryUnited States
Websitewww.gutenberg.org

Project Gutenberg. It is a volunteer-driven digital library offering over 70,000 free e-books, primarily consisting of works for which U.S. copyright has expired. Founded by Michael S. Hart in 1971, it is the oldest digital library and pioneered the concept of free electronic texts. The project's mission is to encourage the creation and distribution of ebooks, making cultural works widely accessible in open formats.

History

The project was launched on December 1, 1971, when Michael S. Hart, a student at the University of Illinois at Urbana–Champaign, was given extensive computer time on a Xerox Sigma V mainframe. He typed the text of the United States Declaration of Independence, creating the first electronic document. Hart envisioned a future where vast computing resources would be available to the public and sought to make significant literary works freely available. Early milestones included the digitization of the Bible, the works of William Shakespeare, and Alice's Adventures in Wonderland by Lewis Carroll. For many years, operations were managed by Hart from Illinois, with key support from institutions like the Illinois Benedictine College. Following Hart's death in 2011, the project continues under the guidance of the Project Gutenberg Literary Archive Foundation, a non-profit corporation.

Content and scope

The collection focuses on culturally significant items, with a heavy emphasis on literature from the Western canon. This includes a vast array of fiction from authors like Jane Austen, Charles Dickens, and Fyodor Dostoevsky, as well as significant non-fiction, historical documents, and reference works like Roget's Thesaurus. While the majority of texts are in English, the library hosts works in over 60 languages, including German, French, and Portuguese. The scope is defined by copyright status; it primarily contains works published before 1929, ensuring they are in the public domain in the United States. Some items include audio books and sheet music.

The operation is strictly based on United States copyright law. It distributes works that are in the public domain in the United States, meaning their copyright has expired or they were never eligible. Due to variations in international law, such as the Berne Convention and differing copyright terms in the European Union, a work free in the U.S. may still be under copyright elsewhere, such as in the United Kingdom. The project does not accept works under modern copyright or those protected by trademark. All texts are released free of charge without DRM restrictions, under a philosophy aligned with the Free Software Movement.

Technology and formats

Texts are produced by volunteers who digitize, proofread, and format works. The project emphasizes simplicity and longevity, offering files in plain ASCII text as the primary, most durable format. It also provides multiple other formats for convenience, including HTML, EPUB, MOBI for Amazon Kindle, and PDF. The website and catalog are maintained by the distributed Project Gutenberg Consortia Center. The technical infrastructure has evolved from early mainframe storage to modern server farms, ensuring global accessibility.

Impact and reception

It is widely credited with democratizing access to literature and inspiring the entire ebook industry. Its model of volunteer-driven digitization directly influenced subsequent major digital library projects like Google Books and the Internet Archive. The project has received praise from educators, librarians, and accessibility advocates for providing free resources. It has been cited in numerous academic papers and was a forerunner to the open access movement. Criticisms have occasionally focused on the quality of early transcriptions or the predominance of older, canonical works.

Many derivative and sister projects have emerged. Distributed Proofreaders is a major web-based community that provides the majority of new texts. The Project Gutenberg Consortia Center aggregates collections from various sources. Internationally, projects like Project Gutenberg Australia and Project Gutenberg Canada handle works in the public domain under their respective national laws. Other notable related efforts include Wikisource, the Internet Archive, and the HathiTrust digital library.