Article,

Extraction of Most Relevant Data from Deep Web Mining

, , , and .
International Journal of Innovative Science and Modern Engineering (IJISME), 3 (1): 16-18 (December 2014)

Abstract

Extraction of web content from the deep web page is the tough task to retrieve the relevant data because they are web page programming language dependent. The challenges of such web page extraction are increases every day due to expanding of huge web database, which makes the researchers to concentrate on deep web mining. Whenever user submits a query into search engine, it retrieves the list of best matching web page with short summary of notes such as title, some text from specific site. But retrieved information from web database is locked as deep web (Hidden Web or Invisible Web) on web page. In this paper, we proposed ontological technique with WordNet to extract the data records from the deep web pages. This technique discovers best matching words, eliminates unnecessary tags and able to extract large variety of data records with different structures.

Tags

Users

  • @ijisme_beiesp

Comments and Reviewsshow / hide

  • @ijisme_beiesp
    3 years ago (last updated 3 years ago)
    good
Please log in to take part in the discussion (add own reviews or comments).