Zusammenfassung

In this paper we address the problem of discovering miss- ing hypertext links in Wikipedia. The method we propose consists of two steps: first, we compute a cluster of highly similar pages around a given page, and then we identify can- didate links from those similar pages that might be missing on the given page. The main innovation is in the algorithm that we use for identifying similar pages, LTRank, which ranks pages using co-citation and page title information. Both LTRank and the link discovery method are manually evaluated and show acceptable results, especially given the simplicity of the methods and conservativeness of the eval- uation criteria

Links und Ressourcen

Tags

Community

  • @asmelash
  • @chato
  • @dblp
  • @magnuslechner
  • @brightbyte
@chatos Tags hervorgehoben