Download Shareware and Freeware Software for Windows, Linux, Macintosh, PDA

line Home  |  About Us  |  Link To Us  |  FAQ  |  Contact

Serving Software Downloads in 976 Categories, Downloaded 29.540.630 Times

Sort::ArbBiLex 4.01

  Date Added: May 23, 2010  |  Visits: 675

Sort::ArbBiLex

Report Broken Link
Printer Friendly Version


Product Homepage
Download (72 downloads)



Sort::ArbBiLex is a Perl module that can make sort functions for arbitrary sort orders. SYNOPSIS use Sort::ArbBiLex ( fulani_sort, # ask for a &fulani_sort to be defined "a A c C ch Ch CH ch Ch CH e E l L lh Lh LH n N r R s S u U z Z " ); @words = <>; @stuff = fulani_sort(@words); foreach (@stuff) { print "<$_>n" } CONCEPTS Writing systems for different languages usually have specific sort orders for the glyphs (characters, or clusters of characters) that each writing system uses. For well-known national languages, these different sort orders (or someones idea of them) are formalized in the locale for each such language, on operating system flavors that support locales. However, there are problems with locales; cf. perllocale. Chief among the problems relevant here are: * The basic concept of "locale" conflates language/dialect, writing system, and character set -- and country/region, to a certain extent. This may be inappropriate for the text you want to sort. Notably, this assumes standardization where none may exist (whats THE sort order for a language that has five different Roman-letter-based writing systems in use?). * On many OS flavors, there is no locale support. * Even on many OS flavors that do suport locales, the user cannot create his own locales as needed. * The "scope" of a locale may not be what the user wants -- if you want, in a single program, to sort the array @foo by one locale, and an array @bar by another locale, this may prove difficult or impossible. In other words, locales (even if available) may not sort the way you want, and are not portable in any case. This module is meant to provide an alternative to locale-based sorting. This module makes functions for you that implement bi-level lexicographic sorting according to a sort order you specify. "Lexicographic sorting" means comparing the letters (or properly, "glyphs", as Ill call them here, when a single glyph can encompass several letters, as with digraphs) in strings, starting from the start of the string (so that "apple" comes after "apoplexy", say) -- as opposed to, say, sorting by numeric value. "Lexicographic sorting" is sometimes used to mean just "ASCIIbetical sorting", but I use it to mean the sort order used by lexicographers, in dictionaries (at least for alphabetic languages). Consider the words "resume" and "résumé" (the latter should display on your POD viewer with acute accents on the es). If you declare a sort order such that e-acute ("é") is a letter after e (no accent), then "résumé" (with accents) would sort after every word starting with "re" (no accent) -- so "résumé" (with accents) would come after "reward". If, however, you treated e (no accent) and e-acute as the same letter, the ordering of "resume" and "résumé" (with accents) would be unpredictable, since they would count as the same thing -- whereas "resume" should always come before "résumé" (with accents) in English dictionaries. What bi-level lexicographic sorting means is that you can stipulate that two letters like e (no accent) and e-acute ("é") generally count as the same letter (so that they both sort before "reward"), but that when theres a tie based on comparison that way (like the tie between "resume" and "résumé" (with accents)), the tie is broken by a stipulation that at a second level, e (no accent) does come before e-acute ("é"). (Some systems of sort order description allow for any number of levels in sort orders -- but I cant imagine a case where this gets you anything over a two-level sort.) Moreover, the units of sorting for a writing system may not be characters exactly. In some forms of Spanish, ch, while two characters, counts as one glyph -- a "letter" after c (at the first level, not just the second, like the e in the paragraph above). So "cuerno" comes before "chile". A character-based sort would not be able to see that "ch" should count as anything but "c" and "h". So this library doesnt assume that the units of comparison are necessarily individual characters..

Requirements: No special requirements
Platforms: Linux
Keyword: Arbbilex Ch Lexicographic Sorting Libraries Module Perl Module Programming Rsum Sort Sort Order Sortarbbilex Sorting
Users rating: 0/10

License: Freeware Size: 16.38 KB
USER REVIEWS
More Reviews or Write Review


SORT::ARBBILEX RELATED
Libraries  -  Alien Perl module 0.91
Alien Perl module package contains external libraries wrapped up for your viewing pleasure! SYNOPSIS perldoc Alien; Alien is a package that exists just to hold together an idea, the idea of Alien:: packages, so there is no code here, just...
10.24 KB  
Libraries  -  OpenGeoDB Perl module 0.4
OpenGeDB Perl module is a module to access the OpenGeoDB database and calculate all ZIP codes in a certain radius..
3.07 KB  
Network & Internet  -  MyCMS perl module 1.0
MyCMS perl module provides the MN::CMS Perl module used by the MyCMS. MyCMS perl module contains Perl object classes to manage the data of MyCMS (such as articles, links, and images). MN::CMS is a perl module that allows you to manage an...
16.38 KB  
Libraries  -  Syntax::Highlight::Perl 1.0
Syntax::Highlight::Perl is a Perl module that can highlight Perl Syntactical Structures. SYNOPSIS # simple procedural use Syntax::Highlight::Perl :BASIC; # or :FULL print format_string($my_string); # OO use Syntax::Highlight::Perl; my...
33.79 KB  
Libraries  -  Chart::Graph::Xrt2d 3.2
Chart::Graph::Xrt2d is a Perl module for creating graph charts. SYNOPSIS #Include module use Chart::Graph::Xrt2d qw(xrt2d); # Function call xrt2d(%options, [%data_options1, @data_set1], [%data_options2, @data_set2], . . ); This...
245.76 KB  
Libraries  -  Class::Adapter::Builder 1.02
Class::Adapter::Builder is a Perl module that can generate Class::Adapter classes. SYNOPSIS package My::Adapter; use strict; use Class::Adapter::Builder ISA => Specific::API, METHODS => [ qw{foo bar baz} ], method => different_method;...
24.58 KB  
Libraries  -  Apache::TestRun 1.28
Apache::TestRun is a Perl module to run the test suite. SYNOPSIS The Apache::TestRun package controls the configuration and running of the test suite. METHODS Several methods are sub-classable, if the default behavior should be changed....
143.36 KB  
Libraries  -  Locale::Maketext::Extract::Run 0.62
Locale::Maketext::Extract::Run is a Perl module interface to xgettext.pl. SYNOPSIS use Locale::Maketext::Extract::Run xgettext; xgettext(@ARGV);.
83.97 KB  
Libraries  -  PDF::FromHTML::Twig 0.23
PDF::FromHTML::Twig is a Perl module with PDF::FromHTML guts. SYNOPSIS (internal use only) No user-serviceable parts inside..
52.22 KB  
Libraries  -  Convert::EastAsianWidth 0.03
Convert::EastAsianWidth is a Perl module that can convert between full- and half-width characters. SYNOPSIS # Exports to_fullwidth() and to_halfwidth() by default use Convert::EastAsianWidth; my $u = to_fullwidth(ABC); # Full-width variant...
14.34 KB  
NEW DOWNLOADS IN PROGRAMMING, LIBRARIES
Programming  -  FLEX-db Digital Asset Manager 3.0.9
FLEX-db - an enterprise Digital Asset Manager (DAM). It ingests and links metadata with files, creates thumbnails, and processes files using business rules. FLEX-db has a JSP client, Java app server for file input and output and an EJB metadata...
21.57 MB  
Programming  -  Libicom 0.9.0
The libicom library is a character based dynamicly linked library for Linux. It is used to remotely control the Icom IC-R8500 wide band receiver via an RS232 link. All call and return parameters to the control functions are character string based....
20.48 KB  
Programming  -  dotdesktop 0.3
Dotdesktop library provides ability to parse desktop entry file and access the information in a convenient way. Desktop entry file format is defined by freedesktop.org, it is used to describe information about an application such as the name and...
327.68 KB  
Programming  -  Cedalion for Linux 0.2.6
Cedalion is a programming language that allows its users to add new abstractions and define (and use) internal DSLs. Its innovation is in the fact that it uses projectional editing to allow the new abstractions to have no syntactic limitations.
471.04 KB  
Programming  -  libyasl 0.2
Libyasl is a C++ class library to easily realize TCP/UDP/Multicast clientsand servers in IPv4 and IPv6 environments under GNU/Linux systems.
143.36 KB  
Libraries  -  EuGTK 4.8.9
Makes it easy to develop good- looking, fast, cross-platform programs that run on Linux, OS X, and Windows. Euphoria is a very fast interpreted/compiled language with straight-forward syntax. EuGTK allows programming in a clean, object-oriented...
10.68 MB  
Libraries  -  Linux User Group Library Manager 1.0
The LUG Library Manager is a project to help Linux User Groups start their own library. A LUG library is helpful to the community at large because it increases access to information, and gives everyone the opportunity to become more knowledgeable.
5.35 KB  
Libraries  -  Module::MakefilePL::Parse 0.12
Module::MakefilePL::Parse is a Perl module to parse required modules from Makefile.PL. SYNOPSIS use Module::MakefilePL::Parse; open $fh, Makefile.PL; $parser = Module::MakefilePL::Parse->new( join("", ) ); $info = $parser->required;...
8.19 KB  
Libraries  -  sqlpp 0.06
sqlpp Perl package is a SQL preprocessor. sqlpp is a conventional cpp-alike preprocessor taught to understand SQL ( PgSQL, in particular) syntax specificities. In addition to the standard #define/#ifdef/#else/#endif cohort, provides also...
10.24 KB  
Libraries  -  App::SimpleScan::Substitution::Line 2.02
App::SimpleScan::Substitution::Line is a line with optional fixed variable values. SYNOPSIS my $line = App::SimpleScan::Substitution::Line->new(" this "); # Use only this value when substituting " ". $line->fix(substituite =>...
54.27 KB