Abstract

This paper describes Seeker, a platform for large-scale text analytics, and SemTag, an application written on the platform to perform automated semantic tagging of large corpora. We apply SemTag to a collection of approximately 264 million web pages, and generate approximately 434 million automatically disambiguated semantic tags, published to the web as a label bureau providing metadata regarding the 434 million annotations. To our knowledge, this is the largest scale semantic tagging effort...

Description

stuff from citeyoulike

Links and resources

Tags

community

  • @gridinoc
  • @sam_chapman
  • @brightbyte
@brightbyte's tags highlighted