Web Crawler
a minimal Java web crawler Java web crawler Web Site Features http crawler validate domains
Platforms: Mac
License: Freeware | Size: 10.24 KB | Download (47): Web Crawler Download |
It is basically a program that can make you a search engine. It is a web crawler, has all the web site source code (in ASP, soon to be PHP as well), and a mysql database.
Platforms: Windows, Mac, *nix, PHP, BSD Solaris
License: Freeware | Download (60): URL Web Crawler Download |
PyGalleryCrawler project is a Web crawler for online image galleries. Installation: tar -xzf pygallerycrawler.tar.gz cd pygallerycrawler Extra python modules psyco @ http://psyco.sourceforge.net - performance Python Imaging Library aka PIL @ http://www.pythonware.com/products/pil/...
Platforms: *nix
License: Freeware | Size: 15.36 KB | Download (117): PyGalleryCrawler Download |
YACY is a distributed Web crawler and also a caching HTTP/HTTPS proxy. Pages that pass through the proxy are indexed and can be searched using a built-in HTTP server. YACY peers connect each other and form a P2P-based index exchange network based on distributed hash tables. Explicit web crawls...
Platforms: *nix
License: Freeware | Size: 5.7 MB | Download (115): YACY Download |
ItSucks software is a java web spider (web crawler) with the ability to download (and resume) files. It is also highly customizable with regular expressions and download templates. The application also provides a swing GUI and a console interface. All backend functionalities are also available...
Platforms: *nix
License: Freeware | Size: 2 MB | Download (136): ItSucks 0.2.0 Beta Download |
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. Heritrix (sometimes spelled heretrix, or misspelled or missaid as heratrix/heritix/ heretix/heratix) is an archaic word for heiress (woman who inherits). Since this crawler seeks to...
Platforms: Windows, Mac, *nix, Java, BSD Solaris
License: Freeware | Download (61): Heritrix Download |
Larbin is a Web crawler intended to fetch a large number of Web pages. It should be able to fetch more than 100 millions pages on a standard PC with much u/d. This set of PHP and Perl scripts, called webtools4larbin, can handle the output of Larbin.
Platforms: Windows, Mac, *nix, PHP, BSD Solaris
License: Freeware | Download (59): Webtools 4 larbin Download |
The web crawler is a program that automatically traverses the web by downloading the pages and following the links from page to page. A general purpose of web crawler is to download any web page that can be accessed through the links. This process is called web crawling or spidering. Many sites,...
Platforms: Windows, Mac, Linux, Linux Console, Linux Gnome, Linux GPL, Linux Open Source, Java
License: Freeware | Size: 57.62 MB | Download (56): Vietspider Web Data Extractor Download |
KrawlSite is a web crawler/spider/ offline browser/download manager application. To integrate with Konqueror, open the file associations page in the configuration dialog, select text/html mime type and in the embedded viewers list choose KrawlSite_Part. Now when you right click on a web-page in...
Platforms: *nix
License: Freeware | Size: 634.88 KB | Download (87): KrawlSite Download |
free web crawler, if use together with Search engine builder -lexst-SEA, you can index 10 billion pages.Multi threads and distributed free web crawler, for both internet and interanet.You can control how frequency the spider should crawl your pages, you can save the pages locally or sent to a...
Platforms: Windows
License: Freeware | Size: 532.48 KB | Download (49): LexstwebSpider Download |
free web crawler, if use together with Search engine builder -lexst-SEA, you can index 10 billion pages.Multi threads and distributed free web crawler, for both internet and interanet.You can control how frequency the spider should crawl your pages, you can save the pages locally or sent to a...
Platforms: Windows
License: Freeware | Size: 535 KB | Download (51): LexstwebSpider Spider Download |
Bee-rain is a web crawler that harvest and index file over the network. It is a free PHP search engine software.Bee-rain helps you in your sleep by comparing, filtering, analyzing citations of brands, companies and personalities on the Internet. Bee-rain then produces statistics and visualization...
Platforms: PHP
License: Freeware | Size: 4.28 MB | Download (47): Bee-rain Download |
FoxySpider is your personal web crawler! It can crawl into a given web site and retrieve what you really want (video clips, images, music files, etc.). FoxySpider displays the located items in a well-structured thumbnail gallery for ease of use.
Platforms: Mac
License: Freeware | Size: 71.68 KB | Download (34): FoxySpider Download |
AWSpider is a web crawler for Amazon Web Services, written in Python. #md5=53f2f7be2f926c9f4b2c867b657ef59b
Platforms: Mac
License: Freeware | Size: 40.96 KB | Download (42): AWSpider Download |
AWSpider is a web crawler for Amazon Web Services, written in Python. #md5=edb1228e36df1a72e28bb07119e7a2e2
Platforms: *nix
License: Freeware | Size: 40.96 KB | Download (41): John Wehr Download |
AWSpider is a web crawler for Amazon Web Services, written in Python #md5=edb1228e36df1a72e28bb07119e7a2e2
Platforms: *nix
License: Freeware | Size: 40.96 KB | Download (34): AWSpider for Linux Download |
WebReaper is web crawler or spider, which can work its way through a website, downloading pages, pictures and objects that it finds so that they can be viewed locally, without needing to be connected to the Internet. The sites can be saved locally as a fully-browsable website which can be viewed...
Platforms: Windows
License: Freeware | Size: 1.2 MB | Download (261): WebReaper Download |
A stand-alone web crawler that generates xml sitemaps, finds broken links, and validates markup & css so you can maintain error free, fully indexed sites without sacrificing productivity.
- Added free offers.
- Made compatible with 32-bit Macs.
- Modified scanner to ignore content received with...
Platforms: Mac
License: Freeware | Size: 9 MB | Download (49): WebLight for Mac OS Download |
The CMS - Bandits is a set of PHP scripts that implement an online HTML editor, calendar, search engine, RSS reader and editor, image gallery, comment system,Web crawler and many more.
Platforms: Windows, Mac, *nix, PHP, BSD Solaris
License: Freeware | Download (51): CMS - Bandits Download |
SiteAnalyzer it is a web crawler to scan the websites and check their technical and of SEO-parameters for errors and correct them effectively.
Key features of the SiteAnalyzer
- Scanning of all pages of the site, as well as images, scripts and documents
- Getting server response codes for...
Platforms: Windows, Windows 8, Windows 7, Windows Server
License: Freeware | Size: 8.13 MB | Download (737): SiteAnalyzer Download |