Finding:
Freebase
searching
Factz
searching
Articles
searching
Web crawler
freebase
help| A Web crawler (also known as a Web spider, Web robot, or—especially in the FOAF community—Web scutter) is a program or automated script that browses the World Wide Web in a methodical, automated manner. Other less frequently used names for Web crawlers are ants, automatic indexers, bots, and worms. This process is called Web crawling or spidering. Many sites, in particular search engines, use... Read enhanced Wikipedia article |
-
close
Web crawler
A web crawler (also known as a web spider, web robot, or—especially in the FOAF community—web scutter) is a program or automated script that browses the World Wide Web in a methodical, automated manner. -
close
Focused crawler
A focused crawler or topical crawler is a web crawler that attempts to download only web pages that are relevant to a pre-defined topic or set of topics. -
close
Crawler
Web crawler, a computer program that gathers and categorizes information on the Internet -
close
Web search engine
These pages are retrieved by a Web crawler (sometimes also known as a spider) — an automated Web browser which follows every link it sees. -
close
Spider trap
dynamic pages like calendars that produce an infinite number of pages for a web crawler to follow. -
close
Heritrix
Heritrix is the Internet Archive’s web crawler which was specially designed for web archiving. -
close
HTTrack
HTTrack Developed by Xavier Roche Latest release 3.43 (28 September 2008) OS Cross-platform Type Web crawler License GNU General Public License Website http://www.httrack.com ... Free web crawlers -
close
Archive site
By using a web crawler the service will not depend on an active community for their content, thereby building a larger database faster, which usually results in the community growing larger as well. -
close
Deep Web
Ntoulas et al. (2005) created a hidden-Web crawler that automatically generated meaningful queries to issue against search forms. -
close
Web archiving
Some web servers may return a different page for a web crawler than it would for a regular browser request.
Explore the following pages on Powerset:
parse:article:Web\scrawler
Web crawler