The dataset genres.json contains (sub)genre classifications for novels published between 1770 and 1915. The genres covered are
gothic novels
"silver fork" novels
national tale novels
The project combines two sources of information. The word counts themselves come from the HathiTrust Research Center (HTRC), which has tabulated them at the page level in 4.8 million public-domain volumes. Information about genre comes from a parallel project led by Ted Underwood, and supported by the National Endowment for the Humanities and the American Council of Learned Societies.
Our flagship collection, under development since 1987, covers the history, literature and culture of the Greco-Roman world. We are applying what we have learned from Classics to other subjects within the humanities and beyond.
From the first book printed in English by William Caxton, through the age of Spenser and Shakespeare and the tumult of the English Civil War, Early English Books Online (EEBO) will contain over 125,000 titles listed in Pollard and Redgrave's Short-Title Catalogue (1475-1640), Wing's Short-Title Catalogue (1641-1700), the Thomason Tracts (1640-1661), and the Early English Tract Supplement - all in full digital facsimile from the Early English Books microfilm collection.
The Fabian Society collection includes: Pamphlets published as part of the Fabian Tracts series, 1884-2000, Minutes of Executive Committee meetings and other key committee meetings, 1884 to 1954, Pamphlets published as part of the Young Fabian pamphlet series, 1961-2009. The London School of Economics and Political Science
The English Short Title Catalogue (ESTC) lists over 460,000 items published between 1473 and 1800 mainly, but not exclusively, in English published mainly in the British Isles and North America from the collections of the British Library and over 2,000 other libraries
Deutscher Wortschatz contains data generated from newspapers and web resources that are publicly available. The data were collected per language and encompass statistics about co-occurrences of words in randomly selected sentences.
The Incunabula Short Title Catalogue is the international database of 15th-century European printing created by the British Library with contributions from institutions worldwide.