Download Shareware and Freeware Software for Windows, Linux, Macintosh, PDA

line Home  |  About Us  |  Link To Us  |  FAQ  |  Contact

Serving Software Downloads in 956 Categories, Downloaded 50.321.304 Times

YAPE::HTML 1.11

  Date Added: April 25, 2010  |  Visits: 1.247

YAPE::HTML

Report Broken Link
Printer Friendly Version


Product Homepage
Download (382 downloads)



YAPE::HTML is Yet Another Parser/Extractor for HTML. SYNOPSIS use YAPE::HTML; use strict; my $content = "< html>...< /html>"; my $parser = YAPE::HTML->new($content); my ($extor,@fonts,@urls,@headings,@comments); # here is the tokenizing part while (my $chunk = $parser->next) { if ($chunk->type eq tag and $chunk->tag eq font) { if (my $face = $chunk->get_attr(face)) { push @fonts, $face; } } } # here we catch any errors unless ($parser->done) { die sprintf "bad HTML: %s (%s)", $parser->error, $parser->chunk; } # here is the extracting part # < A> tags with HREF attributes # < IMG> tags with SRC attributes $extor = $parser->extract(a => [href], img => [src]); while (my $chunk = $extor->()) { push @urls, $chunk->get_attr( $chunk->tag eq a ? href : src ); } # < H1>, < H2>, ..., < H6> tags $extor = $parser->extract(qr/^h[1-6]$/ => []); while (my $chunk = $extor->()) { push @headings, $chunk; } # all comments $extor = $parser->extract(-COMMENT => []); while (my $chunk = $extor->()) { push @comments, $chunk; } YAPE MODULES The YAPE hierarchy of modules is an attempt at a unified means of parsing and extracting content. It attempts to maintain a generic interface, to promote simplicity and reusability. The API is powerful, yet simple. The modules do tokenization (which can be intercepted) and build trees, so that extraction of specific nodes is doable. This module is yet another parser and tree-builder for HTML documents. It is designed to make extraction and modification of HTML documents simplistic. The API allows for easy custom additions to the document being parsed, and allows very specific tag, text, and comment extraction..

Requirements: No special requirements
Platforms: Linux
Keyword: Chunk Html Libraries Programming Tag Yape Yapehtml Yet Another Yet Another Parser
Users rating: 0/10

License: Freeware Size: 19.46 KB
USER REVIEWS
More Reviews or Write Review


YAPE::HTML RELATED
Libraries  -  HTML::LoL 1.3
HTML::LoL is a Perl module that can construct HTML from pleasing Perl data structures. SYNOPSIS use HTML::LoL; &hl(sub { print shift }, [body => {bgcolor => white}, [p => Document body, ...], ...]); This module allows you to use Perl...
5.12 KB  
Libraries  -  HTML::Chunks 1.53
HTML::Chunks is a simple nested template engine for HTML, XML and XHTML. This class implements a simple text-based template engine, originally intented to allow web applications to completely separate layout HTML from programming logic. However,...
10.24 KB  
Libraries  -  HTML::Manipulator 0.07
HTML::Manipulator is a Perl extension for manipulating HTML files. SYNOPSIS use HTML::Manipulator; my $html = New news);...
12.29 KB  
Libraries  -  HTML Objects 1.2.4
HTML Objects is a Perl module library for turning HTML tags into Perl objects. HTML Objects allows Web pages to be manipulated as a data structure rather than text. Once manipulation is done, the entire page is generated via depth-first...
25.6 KB  
Network & Internet  -  django-html-field 0.1.3
django-html-field provides a HTML sanitization and tag stripping either at the database level or at the form level, with no dependencies outside of the standard library.
10.24 KB  
Libraries  -  GtkHTML 3.15.90
GtkHTML is a HTML rendering/editing library. GtkHTML is not designed to be the ultimate HTML browser/editor: instead, it is designed to be easily embedded into applications that require lightweight HTML functionality. Whats New in This...
1.8 MB  
Development Editors  -  PPWIZARD 04.106
PPWIZARD is a free html preprocessor which is powerful and yet easy to use. If you know what server side includes are then you could think of ppwizard as a "super SSI". Preprocessors allow you to easily include external files (e.g. common page...
1.5 MB  
Web Server Tools  -  Get Going Server 1.00.1175
FREE - Website creator / HTTP Server, DIY, GUI-based, Portable, No html / script programming required. Quick, simple and easy website creation - Browser-based tick, type and save editing system.Built-in server for DIY webhosting from your own...
574 KB  
Libraries  -  Locale::Messages 1.16
Locale::Messages is a Perl module with Gettext Like Message Retrieval. SYNOPSIS use Locale::Messages (:locale_h :libintl_h); gettext $msgid; dgettext $textdomain, $msgid; dcgettext $textdomain, $msgid, LC_MESSAGES; ngettext $msgid,...
460.8 KB  
Libraries  -  PerlPoint::Parser 0.45
PerlPoint::Parser Perl module is a PerlPoint Parser. SYNOPSIS # load the module: use PerlPoint::Parser; # build the parser and run it # to get intermediate data in @stream my ($parser)=new PerlPoint::Parser; $parser->run( stream =>...
419.84 KB  
NEW DOWNLOADS IN PROGRAMMING, LIBRARIES
Programming  -  Cedalion for Linux 0.2.6
Cedalion is a programming language that allows its users to add new abstractions and define (and use) internal DSLs. Its innovation is in the fact that it uses projectional editing to allow the new abstractions to have no syntactic limitations.
471.04 KB  
Programming  -  Math::GMPf 0.29
Math::GMPf - perl interface to the GMP library's floating point (mpf) functions.
30.72 KB  
Programming  -  Net::Wire10 1.08
Net::Wire10 is a Pure Perl connector that talks to Sphinx, MySQL and Drizzle servers. Net::Wire10 implements the low-level network protocol, alias the MySQL wire protocol version 10, necessary for talking to one of the aforementioned...
30.72 KB  
Programming  -  logilab-common 0.56.2
a bunch of modules providing low level functionnalities shared among some python projects devel Please note that some of the modules have some extra dependencies. For instance, logilab.common.db will require a db-api 2.0 compliant...
174.08 KB  
Programming  -  OpenSSL for linux 1.0.0a
The OpenSSL Project is a collaborative effort to develop a robust, commercial-grade, full-featured, and Open Source toolkit implementing the Secure Sockets Layer (SSL v2/v3) and Transport Layer Security (TLS v1) protocols as well as a...
3.83 MB  
Libraries  -  wolfSSL 4.0.0
The wolfSSL embedded SSL/TLS library is a lightweight SSL library written in ANSI standard C and targeted for embedded and RTOS environments - primarily because of its small size, speed, and feature set. It is commonly used in standard operating...
3.88 MB  
Libraries  -  EuGTK 4.8.9
Makes it easy to develop good- looking, fast, cross-platform programs that run on Linux, OS X, and Windows. Euphoria is a very fast interpreted/compiled language with straight-forward syntax. EuGTK allows programming in a clean, object-oriented...
10.68 MB  
Libraries  -  Linux User Group Library Manager 1.0
The LUG Library Manager is a project to help Linux User Groups start their own library. A LUG library is helpful to the community at large because it increases access to information, and gives everyone the opportunity to become more knowledgeable.
5.35 KB  
Libraries  -  Module::MakefilePL::Parse 0.12
Module::MakefilePL::Parse is a Perl module to parse required modules from Makefile.PL. SYNOPSIS use Module::MakefilePL::Parse; open $fh, Makefile.PL; $parser = Module::MakefilePL::Parse->new( join("", ) ); $info = $parser->required;...
8.19 KB  
Libraries  -  sqlpp 0.06
sqlpp Perl package is a SQL preprocessor. sqlpp is a conventional cpp-alike preprocessor taught to understand SQL ( PgSQL, in particular) syntax specificities. In addition to the standard #define/#ifdef/#else/#endif cohort, provides also...
10.24 KB