jaeschke > search web engine

bookmarks (hide)4
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1YaCy - Freie Suchmaschinensoftware und dezentrale Websuche
http://yacy.net/
10 years ago by @jaeschke
show all tags
engine
free
open
p2p
search
web
yacy
enginefreeopenp2psearchwebyacy
(0)
copydelete
- community post
- history of this post
7sitemaps.org - Home
Sitemaps are an easy way for webmasters to inform search engines about pages on their sites that are available for crawling. In its simplest form, a Sitemap is an XML file that lists URLs for a site along with additional metadata about each URL (when it was last updated, how often it usually changes, and how important it is, relative to other URLs in the site) so that search engines can more intelligently crawl the site. Web crawlers usually discover pages from links within the site and from other sites. Sitemaps supplement this data to allow crawlers that support Sitemaps to pick up all URLs in the Sitemap and learn about those URLs using the associated metadata. Using the Sitemap protocol does not guarantee that web pages are included in search engines, but provides hints for web crawlers to do a better job of crawling your site. Sitemap 0.90 is offered under the terms of the Attribution-ShareAlike Creative Commons License and has wide adoption, including support from Google, Yahoo!, and Microsoft.
15 years ago by @jaeschke
show all tags
crawl
engine
google
metadata
search
sitemap
web
crawlenginegooglemetadatasearchsitemapweb
(0)
copydelete
- community post
- history of this post
1Official Google Blog: Your slice of the web
Today, we're pleased to announce the launch of Web History
18 years ago by @jaeschke
show all tags
engine
google
history
search
web
enginegooglehistorysearchweb
(0)
copydelete
- community post
- history of this post
26Dr. Dirk Lewandowski: Web Information Retrieval
http://www.durchdenken.de/lewandowski/web-ir/
19 years ago by @jaeschke
show all tags
search
web
engine
retrieval
information
searchwebengineretrievalinformation
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)10
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3Dynamic reference sifting: a case study in the homepage domain
J. Shakes, M. Langheinrich, and O. Etzioni. Computer Networks and ISDN Systems, 29 (8-13): 1193--1204 (1997)
11 years ago by @jaeschke
show all tags
crawler
engine
gaw
homepage
search
web
crawlerenginegawhomepagesearchweb
(0)
copydeleteadd this publication to your clipboard
3User browsing behavior-driven web crawling
M. Liu, R. Cai, M. Zhang, and L. Zhang. Proceedings of the 20th ACM international conference on Information and knowledge management, page 87--92. New York, NY, USA, ACM, (2011)
12 years ago by @jaeschke
show all tags
crawling
crowd
engine
search
web
alexandria
crawlingcrowdenginesearchwebalexandria
(0)
copydeleteadd this publication to your clipboard
3PageRank on an evolving graph
B. Bahmani, R. Kumar, M. Mahdian, and E. Upfal. Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, page 24--32. New York, NY, USA, ACM, (2012)
12 years ago by @jaeschke
show all tags
engine
pagerank
search
time
web
alexandria
enginepageranksearchtimewebalexandria
(0)
copydeleteadd this publication to your clipboard
2RankMass crawler: a crawler with high personalized pagerank coverage guarantee
J. Cho, and U. Schonfeld. Proceedings of the 33rd international conference on Very large data bases, page 375--386. VLDB Endowment, (2007)
12 years ago by @jaeschke
show all tags
crawling
engine
pagerank
search
web
alexandria
crawlingenginepageranksearchwebalexandria
(0)
copydeleteadd this publication to your clipboard
4Recrawl scheduling based on information longevity
C. Olston, and S. Pandey. Proceedings of the 17th international conference on World Wide Web, page 437--446. New York, NY, USA, ACM, (2008)
12 years ago by @jaeschke
show all tags
crawling
engine
search
web
alexandria
crawlingenginesearchwebalexandria
(0)
copydeleteadd this publication to your clipboard
4User-centric Web crawling
S. Pandey, and C. Olston. Proceedings of the 14th international conference on World Wide Web, page 401--411. New York, NY, USA, ACM, (2005)
12 years ago by @jaeschke
show all tags
crawling
engine
search
web
alexandria
crawlingenginesearchwebalexandria
(0)
copydeleteadd this publication to your clipboard
3Effective Web Crawling
C. Castillo. School of Engineering, Santiago, Chile, (November 2004)
12 years ago by @jaeschke
show all tags
crawling
engine
search
web
alexandria
crawlingenginesearchwebalexandria
(0)
copydeleteadd this publication to your clipboard
8The anatomy of a large-scale social search engine
D. Horowitz, and S. Kamvar. Proceedings of the 19th international conference on World wide web, page 431--440. New York, NY, USA, ACM, (2010)
13 years ago by @jaeschke
show all tags
aardvark
computing
engine
search
social
web
aardvarkcomputingenginesearchsocialweb
(0)
copydeleteadd this publication to your clipboard
2A novel Web usage mining approach for search engines
D. Zhang, and Y. Dong. Computer Networks, 39 (3): 303--310 (June 2002)
16 years ago by @jaeschke
show all tags
engine
mining
search
usage
web
engineminingsearchusageweb
(0)
copydeleteadd this publication to your clipboard
12Swoogle: a search and metadata engine for the semantic web.
L. Ding, T. Finin, A. Joshi, R. Pan, R. Cost, Y. Peng, P. Reddivari, V. Doshi, and J. Sachs. CIKM, page 652-659. (2004)
19 years ago by @jaeschke
show all tags
search
web
engine
metadate
swoogle
semantic
seminar2006
searchwebenginemetadateswooglesemanticseminar2006
(0)
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
⟩
⟩⟩

BibSonomy

bookmarks (hide)4
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1YaCy - Freie Suchmaschinensoftware und dezentrale Websuche

7sitemaps.org - Home

1Official Google Blog: Your slice of the web

26Dr. Dirk Lewandowski: Web Information Retrieval

publications (hide)10
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

3Dynamic reference sifting: a case study in the homepage domain

3User browsing behavior-driven web crawling

3PageRank on an evolving graph

2RankMass crawler: a crawler with high personalized pagerank coverage guarantee

4Recrawl scheduling based on information longevity

4User-centric Web crawling

3Effective Web Crawling

8The anatomy of a large-scale social search engine

2A novel Web usage mining approach for search engines

12Swoogle: a search and metadata engine for the semantic web.

browse

related tags

concepts

tags

bookmarks (hide)4 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)10 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

tags

bookmarks (hide)4
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)10
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...