The Web Data Commons project extracts structured data from the Common Crawl, the largest web corpus available to the public, and provides the extracted data for public download in order to support researchers and companies in exploiting the wealth of information that is available on the Web.
RDFa is an extension to HTML5 that helps you markup things like People, Places, Events, Recipes and Reviews. Search Engines and Web Services use this markup to generate better search listings and give you better visibility on the Web, so that people can find your website more easily.
J. Frey, and S. Hellmann. 13th International Conference on Semantic Systems Proceedings (SEMANTiCS 2017) - Posters & Demonstrations Track, (September 2017)
S. Staab, J. Lehmann, and R. Verborgh. Companion Proceedings of the The Web Conference 2018, page 885--886. Republic and Canton of Geneva, Switzerland, International World Wide Web Conferences Steering Committee, (2018)
V. Alexiev, and G. Casamayor. Linked Data in Linguistics (LDL-2016): Managing, Building and Using Linked Language Resources, Portorož, Slovenia, (May 2016)
N. Ikonomov, B. Simeonov, J. Parvanova, and V. Alexiev. Digital Presentation and Preservation of Cultural and Scientific Heritage (DiPP 2013), Veliko Tarnovo, Bulgaria, (September 2013)