Web Crawler
Astanda Directory Project [ADP] is a directory, search engine, and spider/crawler software. Your own little breed of DMOZ and Google. ADP is the only software in its class ever to be written in PHP with MySQL database support. Supports virtually unlimited ammount of links and pages. Powered by...
Platforms: Windows, Mac, *nix, PHP, BSD Solaris
License: Freeware | Download (98): Astanda Directory Project 1.3b Download |
The Web Curator Tool (WCT) is an open-source workflow management application for selective web archiving. It is designed for use in libraries and other collecting organisations, and supports collection by non-technical users while still allowing complete control of the web harvesting process. It...
Platforms: *nix
License: Freeware | Size: 138.63 MB | Download (38): Web Curator Tool Download |
Fresh WebSuction is designed for downloading Web pages or web contents (reference material, software files, online books, e-zines, news articles) for viewing offline, and saving money on Internet connection fees. If you pay fees for the amount of time you are online you can save money by quickly...
Platforms: Windows
License: Freeware | Size: 1.02 MB | Download (52): Fresh Websuction Download |
Octoparse is a powerful but easy-to-use and free web scraping tool that is being used by tens of thousands of individuals and companies all over the world. The word of Octoparse is the combination of the two words, “Octopus†and “Parseâ€Â....
Platforms: Windows, Windows 7
License: Freeware | Size: 666 KB | Download (61): Octoparse Download |
PHPCrawl is a set of classes written in PHP for crawling/spidering websites, so just call it a webcrawler-library for PHP. The crawler "spiders" websites and delivers information about all found pages, links, files and so on to users of the library. By overriding a special method of the...
Platforms: Windows, Mac, *nix, PHP, BSD Solaris
License: Freeware | Download (63): PHPCrawl Download |
Wayback Machine is an open source java implementation of the The Internet Archive Wayback Machine. The current production version of the Wayback Machine is implemented in perl, and lacks in maintainability and extensibility. Also, the code is not open source. Primary motivation for the new...
Platforms: *nix
License: Freeware | Size: 1.8 MB | Download (104): Wayback Machine Download |
External Site Catalog allows you to index and search external sites in a Plone site. ExternalSiteCatalog is a web crawler that can index external sites and make them searchable in Plone. You can specify the sites to index in a Plone Configlet, and directly index them from Plone, or let a...
Platforms: *nix
License: Freeware | Size: 204.8 KB | Download (98): External Site Catalog Download |
Desktop IRIS is an easy-to-use search program that can be successfully downloaded and accessed by anyone. It allows you to intuitively find stored information from your desktop and network without imposing any restrictions on the number of files and folder locations indexed. The system uses the...
Platforms: Windows, Mac
License: Freeware | Size: 55.55 MB | Download (131): Desktop IRIS Download |
Netifera is a new modular open source platform for creating network security tools. This project provides many advantages for both security developers and researchers who want to implement new tools as well as the community of users of these tools. * TCP and UDP network scanning and service...
Platforms: Mac, Linux
License: Freeware | Size: 9.77 MB | Download (430): netifera Download |
This plugin was born because there was a no other real alternative to enable content sparationd and some CMS like functionalities in WordPress. The main goal was, to enhance WordPressd-deOaos functionalities, to hide some unwanted categories, from defined parts of the blog.Today, ACE can override...
Platforms: PHP
License: Freeware | Size: 10 KB | Download (53): Advanced Category Excluder Download |
This module is under active development for a Drupal 6 upgrade have a look at the roadmap/progress Nutch RoadmapNutch is a web crawler/indexer/search engine that is based on Lucene. It is a Java tool.This module allows you to have basic control over the Nutch crawl lifecycle through the Drupal...
Platforms: PHP
License: Freeware | Size: 10 KB | Download (46): Nutch search engine integration Download |
Xapian is an Open Source Probabilistic Information Retrieval library, released under the GPL. Xapian iss written in C , with bindings to allow use from other languages (Perl, Java, Python, PHP, and TCL are currently supported; Guile and C# are being worked on). Xapian is designed to be a highly...
Platforms: *nix
License: Freeware | Size: 604.16 KB | Download (47): Xapian and Omega Download |
Majento SiteAnalyzer is a web crawler (web spider) to scan the websites and check their technical and of SEO-parameters for errors and correct them effectively.
Key features
- Scanning of all pages of the site, as well as images, scripts and documents
- Obtaining server response codes for...
Platforms: Windows, Windows 8, Windows 7, Windows Server
License: Freeware | Size: 2.63 MB | Download (158): Majento SiteAnalyzer Download |
Schema Crawler project is a platform (database and OS) independent command-line tool to output your database schema and data in a readable form. The output is designed to be diff-ed with previous versions of your database schema. Schema Crawler is also an API that improves on standard JDBC...
Platforms: *nix
License: Freeware | Size: 747.52 KB | Download (128): Schema Crawler Download |
Crawl the web with this free ActiveX web bot component. Advanced features include caching, "avoid" patterns, and robots.txt compliance.
Platforms: Windows
License: Freeware | Size: 532.48 KB | Download (510): Web Bot Programming Library Download |
Audit your website security with Acunetix Web Vulnerability Scanner As many as 70% of web sites have vulnerabilities that could lead to the theft of sensitive corporate data such as credit card information and customer lists. Hackers are concentrating their efforts on web-based applications -...
Platforms: Windows
License: Freeware | Size: 13.1 MB | Download (68): Acunetix Web Vulnerability Scanner FREE Download |
Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
Platforms: Windows, Mac, Solaris, Linux
License: Freeware | Size: 9.52 MB | Download (51): Ex-Crawler Download |
Web Accounts Plus is a great utility that allows users to maintain their ever growing list of web accounts, credit cards, and frequent traveler information for free ! The information can be password protected and the application can be hidden in the system tray. You can drag and drop...
Platforms: Windows
License: Freeware | Size: 3.12 MB | Download (179): Web Accounts Plus Download |
A free WYSIWYG editor for the design of Web content using HTML and SVG (Scalable Vector Graphics). Features include an advanced Text Editor with spell checker and link editor, vector graphics editor, a Renderer with support for radial shading, linear shading and transparency, enhanced page layout...
License: Freeware | Size: 4.73 MB | Download (791): IMS Web Dwarf Download |
Web Roller records a real user activity in terms of HTTP requests and creates a simple script. You can modify the script using embedded test data generating capabilities. You can set various parameters for requests and the script, and then run script or a batch of scripts. It creates a...
Platforms: Windows
License: Freeware | Size: 982 KB | Download (236): Web Roller Download |