@parismic

Cleaneval: a Competition for Cleaning Web Pages

, , , and . Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco, European Language Resources Association (ELRA), (May 2008)

Abstract

Cleaneval is a shared task and competitive evaluation on the topic of cleaning arbitrary web pages, with the goal of preparing web data for use as a corpus for linguistic and language technology research and development. The first exercise took place in 2007. We describe how it was set up, results, and lessons learnt

Description

Cleaneval: a Competition for Cleaning Web Pages - ACL Anthology

Links and resources

Tags

community

  • @parismic
  • @dblp
@parismic's tags highlighted