Аннотация

This paper describes Seeker, a platform for large-scale text analytics, and SemTag, an application written on the platform to perform automated semantic tagging of large corpora. We apply SemTag to a collection of approximately 264 million web pages, and generate approximately 434 million automatically disambiguated semantic tags, published to the web as a label bureau providing metadata regarding the 434 million annotations. To our knowledge, this is the largest scale semantic tagging effort...

Описание

stuff from citeyoulike

Линки и ресурсы

тэги

сообщество

  • @gridinoc
  • @sam_chapman
  • @brightbyte
@brightbyte- тэги данного пользователя выделены