More and more websites have started to embed structured data describing products, people, organizations, places, and events into their HTML pages using markup standards such as Microdata, JSON-LD, RDFa, and Microformats. The Web Data Commons project extracts this data from several billion web pages. So far the project provides 11 different data set releases extracted from the Common Crawls 2010 to 2022. The project provides the extracted data for download and publishes statistics about the deployment of the different formats.
M. Atzmueller, и F. Puppe. Proc. 15th International Conference on Knowledge Engineering and Knowledge Management (EKAW 2006), 4248, стр. 318--325. (2006)
M. Atzmueller, J. Baumeister, и F. Puppe. Proc. 15th Intl. Conference on Applications of Declarative Programming and Knowledge Management (INAP 2004), стр. 203--213. Potsdam, Germany, (2004)
M. Atzmueller, J. Baumeister, и F. Puppe. Artificial Intelligence in Medicine. Special Issue on Intelligent Data Analysis in Medicine, 37 (1):
19--30(2006)
M. Atzmueller, F. Puppe, и H. Buscher. Proc. 19th International Joint Conference on Artificial Intelligence (IJCAI-05), стр. 647--652. Edinburgh, Scotland, (2005)
J. Baumeister, M. Atzmueller, и F. Puppe. Advances in Case-Based Reasoning, том 2416 из LNAI, стр. 28-42. (2002)Proc. 6th European Conference on Case-Based Reasoning (ECCBR 2002).
M. Atzmueller, F. Puppe, и H. Buscher. Proc. 10th International Workshop on Intelligent Data Analysis in Medicine and Pharmacology (IDAMAP-2005), стр. 46--51. Aberdeen, Scotland, (2005)
M. Atzmueller, J. Baumeister, и F. Puppe. Medical Data Analysis, Proc. 4th Intl. Symposium on Medical Data Analysis (ISMDA 2003), LNCS 2868, стр. 23-30. (2003)
R. Baeza-Yates, и A. Tiberi. KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, стр. 76--85. New York, NY, USA, ACM, (2007)