Аннотация

The design of a Web spider entails many things, including a concern for reasonable behavior, as well as more technical concerns. The RBSE Spider is a mechanism for exploring World Wide Web structure and indexing useful material thereby discovered. We relate our experience in constructing and operating this spider. 1 -- Introduction As the World Wide Web 3 increases in complexity and number of users, it will be increasingly difficult for users to find information. Recent statistics posted by Fletcher 6, McBryan 12 and others indicate that there are more than 100,000 artifacts now Web-accessible. Relying solely upon browsing of hyperlinks or hand-crafted indices to gain access to specific topics is intractable. This paper describes our experience with constructing a spider as part of our work on the Repository Based Software Engineering (RBSE) project. Web spiders are programs that traverse the Web, acting in some manner upon the information thereby uncovered. The RBSE spider disc...

Линки и ресурсы

тэги

сообщество

  • @chato
  • @lysander07
@lysander07- тэги данного пользователя выделены