@dmartins

Sparse matrix reordering schemes for browsing hypertext.

, , and . 32, page 99-123. (1996)The mathematics of numerical analysis. 1995 AMS-SIAM summer seminar in applied mathematics, July 17--August 11, 1995, Park City, UT, USA. Providence.

Abstract

Many approaches for retrieving documents from electronic databases depend on the literal matching of words in a user's query to the keywords defining database objects. Since there is great diversity in the words people use to describe the same object, literal- or lexical- based methods can often retrieve irrelevant documents. Another approach to exploit the implicit higher-order structure in the association of terms with text objects is to compute the singular value decomposition (SVD) of large sparse term by text-object matrices. Latent Semantic Indexing (LSI) is a conceptual indexing method which employs the SVD to represent terms and objects by dominant singular subspaces so that user queries can be matched in a lower-rank semantic space.\par This paper considers a third, intermediate approach to facilitate the immediate detection of document (or term) clusters. We demonstrate that both traditional sparse matrix reordering schemes (e.g., Reverse Cuthill-McKee) and spectral-based approaches (e.g., Correspondence Analysis or Fiedler vector-based spectral bisection) can be used to permute original term by document (hypertext) matrices to a narrow-banded form suitable for the detection of document (or term) clusters. Although this approach would not exploit the higher-order semantic structure in the database, it can be used to develop browsing tools for hypertext and on-line information at a reduced computational cost.

Description

robotica-bib

Links and resources

Tags

community

  • @keinstein
  • @ks-plugin-devel
  • @dmartins
@dmartins's tags highlighted