Artikel,

Search engine case study: searching the web using genetic programming and MPI

R. Walker.
Parallel Computing, 27 (1-2): 71--89 (Januar 2001)

Abstract

The generation of a Web page follows distinct sources for the incorporation of information. The earliest format of these sources was an organized display of known information determined by the page designers' interest and/or design parameters. The sources may have been published in books or other printed literature, or disseminated as general information about the page designer. Due to a growth in Web pages, several new search engines have been developed in addition to the refinement of the already existing ones. The use of the refined search engines, however, still produces an array of diverse information when the same set of keywords are used in a Web search. Some degree of consistency in the search results can be achieved over a period of time when the same search engine is used, yet, most initial Web searches on a given topic are treated as final after some form of refinement/adjustment of the keywords used in the search process. To determine the applicability of a genetic programming (GP) model for the diverse set of Web documents, search strategies behind the current search engines for the World Wide Web were studied. The development of a GP model resulted in a parallel implementation of a pseudo-search engine indexer simulator. The training sets used in this study provided a small snapshot of the computational effort required to index Web documents accurately and efficiently. Future results will be used to develop and implement Web crawler mechanisms that are capable of assessing the scope of this research effort. The GP model results were generated on a network of SUN workstations and an IBM SP2.

BibTeX key: Walker:2001:PC
entry type: article
year: 2001
month: January
journal: Parallel Computing
number: 1-2
pages: 71--89
volume: 27
url: http://www.sciencedirect.com/science/article/B6V12-42K5HNX-4/1/57eb870c72fb7768bb7d824557444b72

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

%0 Journal Article %1 Walker:2001:PC %A Walker, Reginald L. %D 2001 %J Parallel Computing %K Distributed Information Search Web, Wide World algorithms, computing, engines genetic programming, retrieval, %N 1-2 %P 71--89 %T Search engine case study: searching the web using genetic programming and MPI %U http://www.sciencedirect.com/science/article/B6V12-42K5HNX-4/1/57eb870c72fb7768bb7d824557444b72 %V 27 %X The generation of a Web page follows distinct sources for the incorporation of information. The earliest format of these sources was an organized display of known information determined by the page designers' interest and/or design parameters. The sources may have been published in books or other printed literature, or disseminated as general information about the page designer. Due to a growth in Web pages, several new search engines have been developed in addition to the refinement of the already existing ones. The use of the refined search engines, however, still produces an array of diverse information when the same set of keywords are used in a Web search. Some degree of consistency in the search results can be achieved over a period of time when the same search engine is used, yet, most initial Web searches on a given topic are treated as final after some form of refinement/adjustment of the keywords used in the search process. To determine the applicability of a genetic programming (GP) model for the diverse set of Web documents, search strategies behind the current search engines for the World Wide Web were studied. The development of a GP model resulted in a parallel implementation of a pseudo-search engine indexer simulator. The training sets used in this study provided a small snapshot of the computational effort required to index Web documents accurately and efficiently. Future results will be used to develop and implement Web crawler mechanisms that are capable of assessing the scope of this research effort. The GP model results were generated on a network of SUN workstations and an IBM SP2.

@article{Walker:2001:PC, abstract = {The generation of a Web page follows distinct sources for the incorporation of information. The earliest format of these sources was an organized display of known information determined by the page designers' interest and/or design parameters. The sources may have been published in books or other printed literature, or disseminated as general information about the page designer. Due to a growth in Web pages, several new search engines have been developed in addition to the refinement of the already existing ones. The use of the refined search engines, however, still produces an array of diverse information when the same set of keywords are used in a Web search. Some degree of consistency in the search results can be achieved over a period of time when the same search engine is used, yet, most initial Web searches on a given topic are treated as final after some form of refinement/adjustment of the keywords used in the search process. To determine the applicability of a genetic programming (GP) model for the diverse set of Web documents, search strategies behind the current search engines for the World Wide Web were studied. The development of a GP model resulted in a parallel implementation of a pseudo-search engine indexer simulator. The training sets used in this study provided a small snapshot of the computational effort required to index Web documents accurately and efficiently. Future results will be used to develop and implement Web crawler mechanisms that are capable of assessing the scope of this research effort. The GP model results were generated on a network of SUN workstations and an IBM SP2.}, added-at = {2008-06-19T17:46:40.000+0200}, author = {Walker, Reginald L.}, biburl = {https://www.bibsonomy.org/bibtex/2aa91084b0a9ea846c996e2701ecbe5d2/brazovayeye}, interhash = {4eca357a8c67072f93979a2eadc204e7}, intrahash = {aa91084b0a9ea846c996e2701ecbe5d2}, journal = {Parallel Computing}, keywords = {Distributed Information Search Web, Wide World algorithms, computing, engines genetic programming, retrieval,}, month = {January}, number = {1-2}, pages = {71--89}, timestamp = {2008-06-19T17:53:47.000+0200}, title = {Search engine case study: searching the web using genetic programming and {MPI}}, url = {http://www.sciencedirect.com/science/article/B6V12-42K5HNX-4/1/57eb870c72fb7768bb7d824557444b72}, volume = 27, year = 2001 }

BibSonomy

Search engine case study: searching the web using genetic programming and MPI

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

Mehr Zitationsstile

Suchen auf