Download Shareware and Freeware Software for Windows, Linux, Macintosh, PDA

line Home  |  About Us  |  Link To Us  |  FAQ  |  Contact

Serving Software Downloads in 976 Categories, Downloaded 29.990.270 Times

YAPE::HTML 1.11

  Date Added: April 25, 2010  |  Visits: 1.041

YAPE::HTML

Report Broken Link
Printer Friendly Version


Product Homepage
Download (359 downloads)

YAPE::HTML is Yet Another Parser/Extractor for HTML. SYNOPSIS use YAPE::HTML; use strict; my $content = "< html>...< /html>"; my $parser = YAPE::HTML->new($content); my ($extor,@fonts,@urls,@headings,@comments); # here is the tokenizing part while (my $chunk = $parser->next) { if ($chunk->type eq tag and $chunk->tag eq font) { if (my $face = $chunk->get_attr(face)) { push @fonts, $face; } } } # here we catch any errors unless ($parser->done) { die sprintf "bad HTML: %s (%s)", $parser->error, $parser->chunk; } # here is the extracting part # < A> tags with HREF attributes # < IMG> tags with SRC attributes $extor = $parser->extract(a => [href], img => [src]); while (my $chunk = $extor->()) { push @urls, $chunk->get_attr( $chunk->tag eq a ? href : src ); } # < H1>, < H2>, ..., < H6> tags $extor = $parser->extract(qr/^h[1-6]$/ => []); while (my $chunk = $extor->()) { push @headings, $chunk; } # all comments $extor = $parser->extract(-COMMENT => []); while (my $chunk = $extor->()) { push @comments, $chunk; } YAPE MODULES The YAPE hierarchy of modules is an attempt at a unified means of parsing and extracting content. It attempts to maintain a generic interface, to promote simplicity and reusability. The API is powerful, yet simple. The modules do tokenization (which can be intercepted) and build trees, so that extraction of specific nodes is doable. This module is yet another parser and tree-builder for HTML documents. It is designed to make extraction and modification of HTML documents simplistic. The API allows for easy custom additions to the document being parsed, and allows very specific tag, text, and comment extraction..

Requirements: No special requirements
Platforms: Linux
Keyword: Chunk Html Libraries Programming Tag Yape Yapehtml Yet Another Yet Another Parser
Users rating: 0/10

License: Freeware Size: 19.46 KB
YAPE::HTML RELATED
Libraries  -  HTML::YaTmpl 1.8
HTML::YaTmpl is yet another template processor. SYNOPSIS use HTML::YaTmpl; my $t=HTML::YaTmpl->new( file=>template.tmpl ); $t->evaluate( key1=>$value1, key2=>[$val21, $val22, ...] ... ); $t->evaluate_to_file( $outputfilename,...
41.98 KB  
Libraries  -  HTML::LoL 1.3
HTML::LoL is a Perl module that can construct HTML from pleasing Perl data structures. SYNOPSIS use HTML::LoL; &hl(sub { print shift }, [body => {bgcolor => white}, [p => Document body, ...], ...]); This module allows you to use Perl...
5.12 KB  
Libraries  -  HTML::Chunks 1.53
HTML::Chunks is a simple nested template engine for HTML, XML and XHTML. This class implements a simple text-based template engine, originally intented to allow web applications to completely separate layout HTML from programming logic. However,...
10.24 KB  
Libraries  -  HTML::Manipulator 0.07
HTML::Manipulator is a Perl extension for manipulating HTML files. SYNOPSIS use HTML::Manipulator; my $html = New news);...
12.29 KB  
Libraries  -  HTML Objects 1.2.4
HTML Objects is a Perl module library for turning HTML tags into Perl objects. HTML Objects allows Web pages to be manipulated as a data structure rather than text. Once manipulation is done, the entire page is generated via depth-first...
25.6 KB  
Libraries  -  GtkHTML 3.15.90
GtkHTML is a HTML rendering/editing library. GtkHTML is not designed to be the ultimate HTML browser/editor: instead, it is designed to be easily embedded into applications that require lightweight HTML functionality. Whats New in This...
1.8 MB  
Libraries  -  ePerl 2.2.13
ePerl is an Embedded Perl 5 Language. SYNOPSIS eperl [-d name=value] [-D name=value] [-B begin_delimiter] [-E end_delimiter] [-i] [-m mode] [-o outputfile] [-k] [-I directory] [-P] [-C] [-L] [-x] [-T] [-w] [-c] [inputfile] eperl [-r] [-l]...
194.56 KB  
Audio Tools  -  Jaikoz Tagger 4.5.0
The Jaikoz Audio Tag Editor is a powerful yet simple to use tool that allows you to organize, edit and correct thousands of songs with ease. Key Features are: -> Supports Mp4, M4p, M4a, Mp3, Wma, Flac and Ogg Audio Files -> Acoustic matching using...
16.06 MB  
Network & Internet  -  django-html-field 0.1.3
django-html-field provides a HTML sanitization and tag stripping either at the database level or at the form level, with no dependencies outside of the standard library.
10.24 KB  
Web Server Tools  -  Get Going Server 1.00.1175
FREE - Website creator / HTTP Server, DIY, GUI-based, Portable, No html / script programming required. Quick, simple and easy website creation - Browser-based tick, type and save editing system.Built-in server for DIY webhosting from your own...
574 KB  
NEW DOWNLOADS IN PROGRAMMING, LIBRARIES
Programming  -  FLEX-db Digital Asset Manager 3.0.9
FLEX-db - an enterprise Digital Asset Manager (DAM). It ingests and links metadata with files, creates thumbnails, and processes files using business rules. FLEX-db has a JSP client, Java app server for file input and output and an EJB metadata...
21.57 MB  
Programming  -  Libicom 0.9.0
The libicom library is a character based dynamicly linked library for Linux. It is used to remotely control the Icom IC-R8500 wide band receiver via an RS232 link. All call and return parameters to the control functions are character string based....
20.48 KB  
Programming  -  dotdesktop 0.3
Dotdesktop library provides ability to parse desktop entry file and access the information in a convenient way. Desktop entry file format is defined by freedesktop.org, it is used to describe information about an application such as the name and...
327.68 KB  
Programming  -  Cedalion for Linux 0.2.6
Cedalion is a programming language that allows its users to add new abstractions and define (and use) internal DSLs. Its innovation is in the fact that it uses projectional editing to allow the new abstractions to have no syntactic limitations.
471.04 KB  
Programming  -  libyasl 0.2
Libyasl is a C++ class library to easily realize TCP/UDP/Multicast clientsand servers in IPv4 and IPv6 environments under GNU/Linux systems.
143.36 KB  
Libraries  -  wolfSSL 3.12.0
The wolfSSL embedded SSL/TLS library is a lightweight SSL library written in ANSI standard C and targeted for embedded and RTOS environments - primarily because of its small size, speed, and feature set. It is commonly used in standard operating...
2.73 MB  
Libraries  -  EuGTK 4.8.9
Makes it easy to develop good- looking, fast, cross-platform programs that run on Linux, OS X, and Windows. Euphoria is a very fast interpreted/compiled language with straight-forward syntax. EuGTK allows programming in a clean, object-oriented...
10.68 MB  
Libraries  -  Linux User Group Library Manager 1.0
The LUG Library Manager is a project to help Linux User Groups start their own library. A LUG library is helpful to the community at large because it increases access to information, and gives everyone the opportunity to become more knowledgeable.
5.35 KB  
Libraries  -  Module::MakefilePL::Parse 0.12
Module::MakefilePL::Parse is a Perl module to parse required modules from Makefile.PL. SYNOPSIS use Module::MakefilePL::Parse; open $fh, Makefile.PL; $parser = Module::MakefilePL::Parse->new( join("", ) ); $info = $parser->required;...
8.19 KB  
Libraries  -  sqlpp 0.06
sqlpp Perl package is a SQL preprocessor. sqlpp is a conventional cpp-alike preprocessor taught to understand SQL ( PgSQL, in particular) syntax specificities. In addition to the standard #define/#ifdef/#else/#endif cohort, provides also...
10.24 KB