Observed Web Robot Behavior On Decaying Web Subsites
D-Lib Magazine. February 2006.
J.A. Smith, F. McCown, and M.L. Nelson.
No download available.
We describe the observed crawling patterns of various search engines (including
Google, Yahoo and MSN) as they traverse a series of web subsites whose contents
decay at predetermined rates. We plot the progress of the crawlers through the
subsites, and their behaviors regarding the various file types included in the
web subsites. We chose decaying subsites because we were originally interested
in tracking the implication of using search engine caches for digital
preservation. However, some of the crawling behaviors themselves proved to be
interesting and have implications on using a search engine as an interface to a
digital library.
@article{jas:behavior,
author = {Joan A. Smith and Frank McCown and Michael L. Nelson},
title = {Observed Web Robot Behavior on Decaying Web Subsites},
journal = {{D-Lib M}agazine},
volume = {12},
number = {2},
year = {2006},
month = {February},
note = {\url{http://www.dlib.org/dlib/february06/smith/02smith.html}}
}