Yelp Scraper
ScraperPOD is a framework for scraping results from search engines. SYNOPSIS use WWW::Scraper; # Name your Scraper module / search engine as the first parameter, use WWW::Scraper(eBay); # or in the new() method $scraper = new WWW::Scraper(eBay); Classic WWW::Search mode # Use a...
Platforms: *nix
License: Freeware | Size: 102.4 KB | Download (104): ScraperPOD Download |
It uses the Arin API to connect and return queries.
Platforms: Windows, Mac, *nix, Python, BSD Solaris
License: Freeware | Download (68): ARIN Whois Scraper Download |
like to keep my TV rips well-organized, and labeled with the correct episode title. Id-deOaove been using the excellent Bulk Rename Utility to quickly rename with proper series title, season and episode numbersd-deOCLbut the individual episode titles were always a pain. Begrudgingly, Id-deOaod...
Platforms: PHP
License: Freeware | Size: 10 KB | Download (52): IMDB Episode Scraper Download |
WWW::PDAScraper is a Perl class for scraping PDA-friendly content from websites. Synopsis use WWW::PDAScraper; my $scraper = WWW::PDAScraper->new qw ( NewScientist Yahoo::Entertainment ); $scraper->scrape(); or use WWW::PDAScraper; my $scraper = WWW::PDAScraper->new; $scraper->scrape...
Platforms: *nix
License: Freeware | Size: 70.66 KB | Download (89): WWW::PDAScraper Download |
libgtkhtml is a GNOME library for gtkhtml2 module. Installation: The simplest way to compile this package is: 1. `cd to the directory containing the packages source code and type `./configure to configure the package for your system. If youre using `csh on an old version of System V, you...
Platforms: *nix
License: Freeware | Size: 819.2 KB | Download (137): libgtkhtml Download |
openModeller project is a generic framework for carrying out fundamental niche modelling experiments - typically used to predict species distribution given a set of environmental raster layers. The openModeller Desktop application builds on the openModeller library by providing a user friendly...
Platforms: *nix
License: Freeware | Size: 645.12 KB | Download (101): openModeller Desktop Download |
Hirudov project is a Java Swing application for leeching Web content. It features a multipurpose workspace where links can be extracted from a URL, manipulated, and filtered. The workspace can also be used to generate filename ranges. Other features include editable Referer and User Agent...
Platforms: *nix
License: Freeware | Size: 82.94 KB | Download (95): Hirudo Download |
jsPro is a set of libraries that extend the core Javascript classes and provide a number of new classes. The aim is to provide a rich set of object-based functionality to the Javascript language.. Get hirudo at SourceForge.net. Fast, secure and free downloads from the largest Open Source...
Platforms: *nix
License: Freeware | Size: 51.2 KB | Download (88): jsPro Download |
The web crawler is a program that automatically traverses the web by downloading the pages and following the links from page to page. A general purpose of web crawler is to download any web page that can be accessed through the links. This process is called web crawling or spidering. Many sites,...
Platforms: Windows, Mac, Linux, Linux Console, Linux Gnome, Linux GPL, Linux Open Source, Java
License: Freeware | Size: 57.62 MB | Download (56): Vietspider Web Data Extractor Download |
OutWit is the Web collection engine for everyone. It runs on your Windows, MacOS or Linux machine and allows you to browse through and easily grab online information, collect media, contacts or files from the Internet, in a few clicks. Find pictures in galleries and search engines and download...
Platforms: Linux, Linux Gnome, Linux GPL, Linux Open Source
License: Freeware | Size: 3.06 MB | Download (57): OutWit Hub for Firefox Download |
Samsung AnyWeb Print is a simple utility which enables you to collect and to print various kinds of web contents on a customized layout.
You can add web contents to your Scrap Board via drag & drop feature after marking the area on your browser, and then rearrange them on a single page or...
Platforms: Windows
License: Freeware | Size: 17.8 MB | Download (54): Samsung AnyWeb Print Download |
HTML-to-XML is a .NET component that can help you transform a HTML file into a well-formed XML for parsing. If effect, it is designed to be an HTML parser / scraper.
Once HTML is converted to XHTML (i.e. well-formed XML), the plethora of existing XML parsing components and libraries can be...
Platforms: Windows, XP, 2003, Windows Vista
License: Freeware | Download (63): Chilkat .NET HTML-to-XML Download |
Java program to extract postings and comments from http://www.livejournal.com (blog) into DB and view/classify/process it. LJ loader. Components to reuse: perl-like, but efficient Web pages scraper, trees analyzer, concurrent scheduler.
Platforms: Windows, Mac, Linux
License: Freeware | Size: 63.19 KB | Download (50): LJLoader Download |
A .Net Library Implementation of an XML screen-scraper currently found in XBMC The intention of the library is to make integratation of XML scrapers easy, essentially converting a text file into working program logic.
Platforms: Windows, Mac, Linux
License: Freeware | Size: 796.5 KB | Download (52): ScraperXML Download |
Makes scraping sites, parsing XML, and following hyperlinks a snap. A number of other scraper-type modules depend on it, so its easier to update one place than in many. Saves developers a lot of time.The module can be extended to include features such as proxy rotation, delayed hits or useragent...
Platforms: Windows, Mac, *nix, PHP, BSD Solaris
License: Freeware | Download (61): DataMiner API Download |
Simple Real Estate Pack is a package of real estate tools and widgets designed specifically for real estate industry blogs and web sites. The plugin includes mortgage and home affordability calculators, closing cost estimator, live mortgage rates, Trulia and ALTOS statistical charts, local...
Platforms: PHP
License: Freeware | Size: 665.6 KB | Download (49): Simple Real Estate Pack Download |
This Plugin is only supported on PHP 5.2 or greater.RSS Injection allows you to modify the post for your RSS feed. You may be able think of you own reasons for doing this, but it was originally designed to add a copyright message and link to the feed to show the posts origins should a blog...
Platforms: PHP
License: Freeware | Size: 10 KB | Download (39): RSS Injection Download |
1. GatherProxy harvest thousands fresh proxies from over 200 websites (default) and from trust source (Hidemyass, Spys, Proxynova ...). You can also extract proxies by using your custom list.
2. SSL checking feature, Black List Checking feature, Google Passed proxies checking feaature 3. Dont...
Platforms: Windows, Other
License: Freeware | Size: 1.42 KB | Download (53): GatherProxy Download |
XML::RSS::SimpleGen is a Perl module for writing RSS files, simply. It transparently handles all the unpleasant details of RSS, like proper XML escaping, and also has a good number of Do-What-I-Mean features, like not changing the modtime on a written-out RSS file if the file content hasn't...
Platforms: *nix
License: Freeware | Size: 30.72 KB | Download (40): XML::RSS::SimpleGen Download |
gnome-doc-utils packages contains a collection of documentation utilities for the Gnome project. Notably, it contains utilities for building documentation and all auxiliary files in your source tree, and it contains the DocBook XSLT stylesheets that were once distributed with Yelp. Starting with...
Platforms: *nix
License: Freeware | Size: 512 KB | Download (39): gnome-doc-utils Download |