Description

  1. Metadata Statistics for a Large Web Corpus

ABSTRACT We provide an analysis of the adoption of metadata standards on the Web based a large crawl of the Web. In particular, we look at what forms of syntax and vocabularies publishers are using to mark up data inside HTML pages. We also describe the process that we have followed and the difficulties involved in web data extraction.

Preview

Tags

Users

  • @astrupp

Comments and Reviews