Download Shareware and Freeware Software for Windows, Linux, Macintosh, PDA

line Home  |  About Us  |  Link To Us  |  FAQ  |  Contact

Serving Software Downloads in 956 Categories, Downloaded 50.208.069 Times

Unicode::Map8 0.12

  Date Added: March 06, 2010  |  Visits: 1.017

Unicode::Map8

Report Broken Link
Printer Friendly Version


Product Homepage
Download (108 downloads)



Unicode::Map8 is a mapping table between 8-bit chars and Unicode. SYNOPSIS require Unicode::Map8; my $no_map = Unicode::Map8->new("ISO646-NO") || die; my $l1_map = Unicode::Map8->new("latin1") || die; my $ustr = $no_map->to16("V}re norske tegn b|r {resn"); my $lstr = $l1_map->to8($ustr); print $lstr; print $no_map->tou("V}re norske tegn b|r {resn")->utf8 The Unicode::Map8 class implement efficient mapping tables between 8-bit character sets and 16 bit character sets like Unicode. The tables are efficient both in terms of space allocated and translation speed. The 16-bit strings is assumed to use network byte order. The following methods are available: $m = Unicode::Map8->new( [$charset] ) The object constructor creates new instances of the Unicode::Map8 class. I takes an optional argument that specify then name of a 8-bit character set to initialize mappings from. The argument can also be a the name of a mapping file. If the charset/file can not be located, then the constructor returns undef. If you omit the argument, then an empty mapping table is constructed. You must then add mapping pairs to it using the addpair() method described below. $m->addpair( $u8, $u16 ); Adds a new mapping pair to the mapping object. It takes two arguments. The first is the code value in the 8-bit character set and the second is the corresponding code value in the 16-bit character set. The same codes can be used multiple times (but using the same pair has no effect). The first definition for a code is the one that is used. Consider the following example: $m->addpair(0x20, 0x0020); $m->addpair(0x20, 0x00A0); $m->addpair(0xA0, 0x00A0); It means that the character 0x20 and 0xA0 in the 8-bit charset maps to themselves in the 16-bit set, but in the 16-bit character set 0x0A0 maps to 0x20. $m->default_to8( $u8 ) Set the code of the default character to use when mapping from 16-bit to 8-bit strings. If there is no mapping pair defined for a character then this default is substituted by to8() and recode8(). $m->default_to16( $u16 ) Set the code of the default character to use when mapping from 8-bit to 16-bit strings. If there is no mapping pair defined for a character then this default is used by to16(), tou() and recode8(). $m->nostrict; All undefined mappings are replaced with the identity mapping. Undefined character are normally just removed (or replaced with the default if defined) when converting between character sets. $m->to8( $ustr ); Converts a 16-bit character string to the corresponding string in the 8-bit character set. $m->to16( $str ); Converts a 8-bit character string to the corresponding string in the 16-bit character set. $m->tou( $str ); Same an to16() but return a Unicode::String object instead of a plain UCS2 string. $m->recode8($m2, $str); Map the string $str from one 8-bit character set ($m) to another one ($m2). Since we assume we know the mappings towards the common 16-bit encoding we can use this to convert between any of the 8-bit character sets. $m->to_char16( $u8 ) Maps a single 8-bit character code to an 16-bit code. If the 8-bit character is unmapped then the constant NOCHAR is returned. The default is not used and the callback method is not invoked. $m->to_char8( $u16 ) Maps a single 16-bit character code to an 8-bit code. If the 16-bit character is unmapped then the constant NOCHAR is returned. The default is not used and the callback method is not invoked. The following callback methods are available. You can override these methods by creating a subclass of Unicode::Map8. $m->unmapped_to8 When mapping to 8-bit character string and there is no mapping defined (and no default either), then this method is called as the last resort. It is called with a single integer argument which is the code of the unmapped 16-bit character. It is expected to return a string that will be incorporated in the 8-bit string. The default version of this method always returns an empty string. Example: package MyMapper; @ISA=qw(Unicode::Map8); sub unmapped_to8 { my($self, $code) = @_; require Unicode::CharName; "<" . Unicode::CharName::uname($code) . ">"; } $m->unmapped_to16 Likewise when mapping to 16-bit character string and no mapping is defined then this method is called. It should return a 16-bit string with the bytes in network byte order. The default version of this method always returns an empty string..

Requirements: No special requirements
Platforms: Linux
Keyword: Character Character Set Character Sets Character String Libraries M- Mapping Mapping Table No Mapping Programming String
Users rating: 0/10

License: Freeware Size: 102.4 KB
USER REVIEWS
More Reviews or Write Review


UNICODE::MAP8 RELATED
Libraries  -  Set::Integer::Gapfillers 0.07
Set::Integer::Gapfillers is a Perl module that can fill in the gaps between integer ranges. SYNOPSIS use Set::Integer::Gapfillers; $gf = Set::Integer::Gapfillers->new( lower => -12, upper => 62, sets => [ [ 1, 17 ], # Note: Use comma, not...
17.41 KB  
Libraries  -  StrBio 1.0
StrBio is a set of Java classes and libraries useful for development of software for computational structural biology research. They are licenced under the LGPL. he strbio.org classes are the basis for several published research projects,...
12.7 MB  
Libraries  -  Classeine::Sdk 0.0.3.0-b0004
Classeine::Sdk project is a set of general purpose class libraries written in C++ for NetBSD..
51.2 KB  
Programming  -  Portable Object Compiler 0.2.2
Portable Object Compiler project consists of a set of Objective-C class libraries and a precompiler (translator) that generates plain C code..
839.68 KB  
Libraries  -  Chilkat Class Libraries for Java 9.2.1
The full suite of Chilkat components & libraries are now available for the Java programming language. Commercially licensed components include: Email (POP3 / SMTP) IMAP Zip, GZip, and Unix Compress Encryption MIME and S/MIME FTP HTTP MHT...
 
Utilities  -  armidale - simplified web development 1.0
Armidale is a set of Java programs and libraries used to radically simplify the development, deployment and use of web applications which have rich graphical user interfaces.
4.66 MB  
Programming  -  Open WS-Policy 0.01
Open WS-Policy,an Open Source Web Services Policy Framework implementation,is a set of open source Java libraries that implement the ws-policy specifications.
4.16 MB  
File Converters  -  File Touch 1.0.0.1
When developing or testing software, there are times when you need to reset a file's date and time created, last modified, and/or last accessed. FileTouch modifies file date/times on individual files, or sets of files. Simply run the single...
119 KB  
Shell & Desktop  -  Hashell 0.013a
Hashell project is a shell that is written in Haskell language and which it is intended to provide a set of abstractions that allows you to use Haskell as a shell programming language. So that way you can use haskell for your daily...
15.36 KB  
Graphics Viewers  -  Java Applet Picture Viewer using AWT 1.0
The Java Applet AWT Picture Browser allows anyone to set up and maintain an online gallery of images using no server scripting, and no JavaScript. The JPEG images are transferred to the server as folders of *.jpg files.
25.45 KB  
NEW DOWNLOADS IN PROGRAMMING, LIBRARIES
Programming  -  Cedalion for Linux 0.2.6
Cedalion is a programming language that allows its users to add new abstractions and define (and use) internal DSLs. Its innovation is in the fact that it uses projectional editing to allow the new abstractions to have no syntactic limitations.
471.04 KB  
Programming  -  Math::GMPf 0.29
Math::GMPf - perl interface to the GMP library's floating point (mpf) functions.
30.72 KB  
Programming  -  Net::Wire10 1.08
Net::Wire10 is a Pure Perl connector that talks to Sphinx, MySQL and Drizzle servers. Net::Wire10 implements the low-level network protocol, alias the MySQL wire protocol version 10, necessary for talking to one of the aforementioned...
30.72 KB  
Programming  -  logilab-common 0.56.2
a bunch of modules providing low level functionnalities shared among some python projects devel Please note that some of the modules have some extra dependencies. For instance, logilab.common.db will require a db-api 2.0 compliant...
174.08 KB  
Programming  -  OpenSSL for linux 1.0.0a
The OpenSSL Project is a collaborative effort to develop a robust, commercial-grade, full-featured, and Open Source toolkit implementing the Secure Sockets Layer (SSL v2/v3) and Transport Layer Security (TLS v1) protocols as well as a...
3.83 MB  
Libraries  -  wolfSSL 4.0.0
The wolfSSL embedded SSL/TLS library is a lightweight SSL library written in ANSI standard C and targeted for embedded and RTOS environments - primarily because of its small size, speed, and feature set. It is commonly used in standard operating...
3.88 MB  
Libraries  -  EuGTK 4.8.9
Makes it easy to develop good- looking, fast, cross-platform programs that run on Linux, OS X, and Windows. Euphoria is a very fast interpreted/compiled language with straight-forward syntax. EuGTK allows programming in a clean, object-oriented...
10.68 MB  
Libraries  -  Linux User Group Library Manager 1.0
The LUG Library Manager is a project to help Linux User Groups start their own library. A LUG library is helpful to the community at large because it increases access to information, and gives everyone the opportunity to become more knowledgeable.
5.35 KB  
Libraries  -  Module::MakefilePL::Parse 0.12
Module::MakefilePL::Parse is a Perl module to parse required modules from Makefile.PL. SYNOPSIS use Module::MakefilePL::Parse; open $fh, Makefile.PL; $parser = Module::MakefilePL::Parse->new( join("", ) ); $info = $parser->required;...
8.19 KB  
Libraries  -  sqlpp 0.06
sqlpp Perl package is a SQL preprocessor. sqlpp is a conventional cpp-alike preprocessor taught to understand SQL ( PgSQL, in particular) syntax specificities. In addition to the standard #define/#ifdef/#else/#endif cohort, provides also...
10.24 KB