копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A brief survey of web data extraction tools

A. Laender, B. Ribeiro-Neto, A. da Silva, и J. Teixeira. SIGMOD Rec., 31 (2): 84--93 (июня 2002)
DOI: http://dx.doi.org/10.1145/565117.565137

Аннотация

In the last few years, several works in the literature have addressed the problem of data extraction from Web pages. The importance of this problem derives from the fact that, once extracted, the data can be handled in a way similar to instances of a traditional database. The approaches proposed in the literature to address the problem of Web data extraction use techniques borrowed from areas such as natural language processing, languages and grammars, machine learning, information retrieval, databases, and ontologies. As a consequence, they present very distinct features and capabilities which make a direct comparison difficult to be done. In this paper, we propose a taxonomy for characterizing Web data extraction fools, briefly survey major Web data extraction tools described in the literature, and provide a qualitative analysis of them. Hopefully, this work will stimulate other studies aimed at a more comprehensive analysis of data extraction approaches and tools for Web data.

Линки и ресурсы

ключ BibTeX: Laender2002
тип записи: article
адрес: New York, NY, USA
год: 2002
месяц: June
журнал: SIGMOD Rec.
номер: 2
страницы: 84--93
издательство: ACM Press
том: 31
issn: 0163-5808
posted-at: 2007-12-20 19:58:25
citeulike-article-id: 1111013
priority: 4
DOI: http://dx.doi.org/10.1145/565117.565137
url: http://dx.doi.org/10.1145/565117.565137

тэги

@lillejul- тэги данного пользователя выделены

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 16 лет назад
Создан 16 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!