Abstract

We define a community on the web as a set of sites that have more links (in either direction) to members of the community than to non-members. Members of such a community can be efficiently identified in a maximum flow / minimum cut framework, where the source is composed of known members, and the sink consists of well-known non-members. A focused crawler that crawls to a fixed depth can approximate community membership by augmenting the graph induced by the crawl with links to a virtual sink...

Links and resources

Tags

community

  • @wnpxrz
  • @ldietz
  • @hotho
  • @grahl
@hotho's tags highlighted