This data set contains WWW-pages collected from computer science departments of various universities in January 1997 by the World Wide Knowledge Base (Web->Kb) project of the CMU text learning group. The 8,282 pages were manually classified into the following categories:
* student (1641)
* faculty (1124)
* staff (137)
* department (182)
* course (930)
* project (504)
* other (3764)