Abstract

Wrapper Induction for Information Extraction by Nicholas Kushmerick Chairperson of Supervisory Committee: Professor Daniel S. Weld Department of Computer Science and Engineering The Internet presents numerous sources of useful information---telephone directories, product catalogs, stock quotes, weather forecasts, etc. Recently, many systems have been built that automatically gather and manipulate such information on a user's behalf. However, these resources are usually formatted for use by people (e.g., the relevant content is embedded in HTML pages), so extracting their content is difficult. Wrappers are often used for this purpose. A wrapper is a procedure for extracting a particular ...

Links and resources

Tags

community

  • @gridinoc
  • @bsmyth
  • @brefeld
  • @diego_ma
  • @dblp
  • @cbrewster
@diego_ma's tags highlighted