Download Shareware and Freeware Software for Windows, Linux, Macintosh, PDA

line Home  |  About Us  |  Link To Us  |  FAQ  |  Contact

Serving Software Downloads in 976 Categories, Downloaded 29.642.903 Times

WWW::PDAScraper 0.1

  Date Added: November 02, 2010  |  Visits: 615

WWW::PDAScraper

Report Broken Link
Printer Friendly Version


Product Homepage
Download (72 downloads)



WWW::PDAScraper is a Perl class for scraping PDA-friendly content from websites. Synopsis use WWW::PDAScraper; my $scraper = WWW::PDAScraper->new qw ( NewScientist Yahoo::Entertainment ); $scraper->scrape(); or use WWW::PDAScraper; my $scraper = WWW::PDAScraper->new; $scraper->scrape qw( NewScientist Yahoo::Entertainment ); or perl -MWWW::PDAScraper -e "scrape qw( NewScientist Yahoo::Entertainment )" Having written various kludgey scripts to download PDA-friendly content from various websites, I decided to try and write a generalised solution which would * parse out the section of a news page which contains the links we want * munge those links into the URL for the print-friendly version, if possible * download those pages and make an index page for them The moving of the pages to your PDA is not part of the scope of the module: the open-source browser and "distiller", Plucker, from http://plkr.org/ is recommended. Just get it to read the index.html file with a depth of 1 from disk, using a URL like file:///path/to/index.html The Sub-modules WWW::PDAScraper uses a set of rules for scraping a particular website from a second module, i.e. WWW::PDAScraper::Yahoo::Entertainment::TV contains the rules for scraping the Yahoo TV News website: package WWW::PDAScraper::Yahoo::Entertainment::TV; # WWW::PDAScraper.pm rules for scraping the # Yahoo TV website sub config { return { name => Yahoo TV, start_from => http://news.yahoo.com/i/763, chunk_spec => [ "_tag", "div", "id", "indexstories" ], url_regex => [ $, &printer=1 ] }; } 1; A more or less random selection of modules is included, as well as a full set for Yahoo, to demonstrate a logical set of modules in categories. Creating a new sub-module ought to be relatively simple, see the template provided, WWW::PDAScraper::Template.pm - you need name, start_from, then either chunk_spec or url_spec, then optionally a url_regex for transformation into the print-friendly URL. Then either move your new module to the same location as the other ones on your system, or make sure theyre available to your script with a line like use lib /path/to/local/modules/PDAScraper/.

Requirements: No special requirements
Platforms: Linux
Keyword: Content Libraries New Newscientist Yahoo Pdascraper Perl Programming Scraping Tv Url Www Wwwpdascraper Yahoo
Users rating: 0/10

License: Freeware Size: 70.66 KB
USER REVIEWS
More Reviews or Write Review


WWW::PDASCRAPER RELATED
Libraries  -  WWW::Search::Yahoo 2.405
WWW::Search::Yahoo is a Perl backend for searching www.yahoo.com SYNOPSIS use WWW::Search; my $oSearch = new WWW::Search(Yahoo); my $sQuery = WWW::Search::escape_query("sushi restaurant Columbus Ohio"); $oSearch->native_query($sQuery);...
27.65 KB  
Network & Internet  -  mod_perl 2.0.3
mod_perl project brings together the full power of the Perl programming language and the Apache HTTP server. You can use Perl to manage Apache, respond to requests for web pages and much more. mod_perl gives you a persistent Perl interpreter...
3.5 MB  
Libraries  -  Mail::Webmail::Yahoo 0.601
Mail::Webmail::Yahoo is a Perl module that enables bulk download of yahoo.com -based webmail. SYNOPSIS use Mail::Webmail::Yahoo; $yahoo = Mail::Webmail::Yahoo->new(%options); @folders = $yahoo->get_folder_list(); @messages =...
28.67 KB  
Libraries  -  Geo::Coder::Yahoo 0.02
Geo::Coder::Yahoo is a Perl module with geocode addresses with the Yahoo! API. SYNOPSIS Provides a thin Perl interface to the Yahoo! Geocoding API. use Geo::Coder::Yahoo; my $geocoder = Geo::Coder::Yahoo->new(appid => my_app ); my...
5.12 KB  
Libraries  -  Finance::Currency::Convert::Yahoo 0.2
Finance::Currency::Convert::Yahoo is Perl mdoule that convert currencies using Yahoo. SYNOPSIS use Finance::Currency::Convert::Yahoo; $Finance::Currency::Convert::Yahoo::CHAT = 1; $_ = Finance::Currency::Convert::Yahoo::convert(1,USD,GBP);...
6.14 KB  
Libraries  -  Finance::Quote::Yahoo::Base 1.12
Finance::Quote::Yahoo::Base is a Perl module with common functions for fetching Yahoo info. SYNOPSIS Base functions for use by the Finance::Quote::Yahoo::* modules. This module is not called directly, rather it provides a set of base...
92.16 KB  
Libraries  -  Net::YMSG 1.2
Net::YMSG is a Perl interface to the Yahoo! Messenger IM protocol. SYNOPSIS use Net::YMSG; my $yahoo = Net::YMSG->new( id => your_yahoo_id, password => your_password, ); $yahoo->login or die "Cant login Yahoo!Messenger";...
15.36 KB  
Libraries  -  Finance::YahooChart 0.01
Finance::YahooChart is a Perl module to get a chart from Yahoo! Finance. SYNOPSIS use Finance::YahooChart; %img = getchart(symbol => $symbol, size => $size, type => $type, include => $include); print width=$img{width} height=$img{height}>";...
9.22 KB  
Libraries  -  WWW::YahooMaps 0.3
WWW::YahooMaps is a Perl module which can create links to Yahoo! Maps. SYNOPSIS use WWW::YahooMaps; #first method: PASSING ADDRESS BIT-BY-BIT my %addr = ( "street" => "555 N Michigan Ave", "city" => "Chicago, IL 60611", "country" =>...
4.1 KB  
Code Management Tools  -  Virgil B-03
Virgil projetc is an new, object-oriented, light-weight programming language that is designed for building software for resource-constrained embedded systems at the lowest level. Microcontroller programmers can now develop complete software...
583.68 KB  
NEW DOWNLOADS IN PROGRAMMING, LIBRARIES
Programming  -  FLEX-db Digital Asset Manager 3.0.9
FLEX-db - an enterprise Digital Asset Manager (DAM). It ingests and links metadata with files, creates thumbnails, and processes files using business rules. FLEX-db has a JSP client, Java app server for file input and output and an EJB metadata...
21.57 MB  
Programming  -  Libicom 0.9.0
The libicom library is a character based dynamicly linked library for Linux. It is used to remotely control the Icom IC-R8500 wide band receiver via an RS232 link. All call and return parameters to the control functions are character string based....
20.48 KB  
Programming  -  dotdesktop 0.3
Dotdesktop library provides ability to parse desktop entry file and access the information in a convenient way. Desktop entry file format is defined by freedesktop.org, it is used to describe information about an application such as the name and...
327.68 KB  
Programming  -  Cedalion for Linux 0.2.6
Cedalion is a programming language that allows its users to add new abstractions and define (and use) internal DSLs. Its innovation is in the fact that it uses projectional editing to allow the new abstractions to have no syntactic limitations.
471.04 KB  
Programming  -  libyasl 0.2
Libyasl is a C++ class library to easily realize TCP/UDP/Multicast clientsand servers in IPv4 and IPv6 environments under GNU/Linux systems.
143.36 KB  
Libraries  -  wolfSSL 3.11.0
The wolfSSL embedded SSL/TLS library is a lightweight SSL library written in ANSI standard C and targeted for embedded and RTOS environments - primarily because of its small size, speed, and feature set. It is commonly used in standard operating...
2.73 MB  
Libraries  -  EuGTK 4.8.9
Makes it easy to develop good- looking, fast, cross-platform programs that run on Linux, OS X, and Windows. Euphoria is a very fast interpreted/compiled language with straight-forward syntax. EuGTK allows programming in a clean, object-oriented...
10.68 MB  
Libraries  -  Linux User Group Library Manager 1.0
The LUG Library Manager is a project to help Linux User Groups start their own library. A LUG library is helpful to the community at large because it increases access to information, and gives everyone the opportunity to become more knowledgeable.
5.35 KB  
Libraries  -  Module::MakefilePL::Parse 0.12
Module::MakefilePL::Parse is a Perl module to parse required modules from Makefile.PL. SYNOPSIS use Module::MakefilePL::Parse; open $fh, Makefile.PL; $parser = Module::MakefilePL::Parse->new( join("", ) ); $info = $parser->required;...
8.19 KB  
Libraries  -  sqlpp 0.06
sqlpp Perl package is a SQL preprocessor. sqlpp is a conventional cpp-alike preprocessor taught to understand SQL ( PgSQL, in particular) syntax specificities. In addition to the standard #define/#ifdef/#else/#endif cohort, provides also...
10.24 KB