Download Shareware and Freeware Software for Windows, Linux, Macintosh, PDA

line Home  |  About Us  |  Link To Us  |  FAQ  |  Contact

Serving Software Downloads in 976 Categories, Downloaded 29.611.941 Times

Plucene::Analysis::PorterStemFilter 1.25

  Date Added: September 15, 2010  |  Visits: 767

Plucene::Analysis::PorterStemFilter

Report Broken Link
Printer Friendly Version


Product Homepage
Download (72 downloads)



Plucene::Analysis::PorterStemFilter - Porter stemming on the token stream. SYNOPSIS # isa Plucene::Analysis:::TokenFilter my $token = $porter_stem_filter->next; This class transforms the token stream as per the Porter stemming algorithm. Note: the input to the stemming filter must already be in lower case, so you will need to use LowerCaseFilter or LowerCaseTokenizer farther down the Tokenizer chain in order for this to work properly! The Porter Stemmer implements Porter Algorithm for normalization of English words by stripping their extensions and is used to generalize the searches. For example, the Porter algorithm maps both search and searching (as well as searchnessing) to search such that a query for search will also match documents that contains the word searching. Note that the Porter algorithm is specific to the English language and may give unpredictable results for other languages. Also, make sure to use the same analyzer during the indexing and the searching. You can find more information on the Porter algorithm at www.tartarus.org/~martin/PorterStemmer. A nice online demonstration of the Porter algorithm is available at www.scs.carleton.ca/~dquesnel/java/stuff/PorterApplet.html. METHODS next my $token = $porter_stem_filter->next; Returns the next input token, after being stemmed..

Requirements: No special requirements
Platforms: Linux
Keyword: Algorithm Libraries Pluceneanalysisporterstemfilter Porter Porterstemfilter Programming Stemming Token
Users rating: 0/10

License: Freeware Size: 327.68 KB
USER REVIEWS
More Reviews or Write Review


PLUCENE::ANALYSIS::PORTERSTEMFILTER RELATED
Utilities  -  PyStem 1.0.4
PyStem is a fast Python module with the the Porter stemming algorithm (a process for removing the commoner morphological and inflexional endings from words in English; its main use is as part of a term normalisation process that is usually done...
10.24 KB  
Libraries  -  Lazarus x32 0.9.30.2
The Lazarus IDE application was designed to be a stable and feature rich visual programming environment for the FreePascal Compiler. It supports the creation of self-standing graphical and console applications. Lazarus is the class libraries for...
64.5 MB  
Libraries  -  Lazarus x64 0.9.30.2
The Lazarus IDE application was designed to be a stable and feature rich visual programming environment for the FreePascal Compiler. It supports the creation of self-standing graphical and console applications. Lazarus is the class libraries for...
75.5 MB  
Libraries  -  Lazarus for Mac OS X 0.9.30.2
The Lazarus IDE application was designed to be a stable and feature rich visual programming environment for the FreePascal Compiler. It supports the creation of self-standing graphical and console applications. Lazarus is the class libraries for...
121.3 MB  
Libraries  -  Lazarus for Linux 0.9.30.2
The Lazarus IDE application was designed to be a stable and feature rich visual programming environment for the FreePascal Compiler. It supports the creation of self-standing graphical and console applications. Lazarus is the class libraries for...
53.6 MB  
Utilities  -  Fuse4X 0.9.0
Fuse4X allows you to extend Mac OS X's native file handling capabilities via 3rd-party file systems. It can be used as a software building block other products. As a user, installing the Fuse4X software package will let you use any...
481.28 KB  
Modules  -  Dutch Stemmer 5.x-1.x-dev 1.0
It reduces each word in the index to its basic root or stem so that variations on a word (e.g. "boom, bomen" or "opheffen, opheffing, opheffende") are considered equivalent when searching.The result of stemming is better search...
 
Teaching Tools  -  Grass GIS for Mac OS X 6.4.1-3
Commonly referred to as GRASS, this is a free Geographic Information System (GIS) used for geospatial data management and analysis, image processing, graphics/maps production, spatial modeling, and visualization. GRASS is currently used in...
59 MB  
Miscellaneous  -  Porter Stemming Algorithm 2.1
This is a fairly faithful implementation of the Porter stemming algorithm that reduces English words to their stems.There is a deviation in the way compound words are stemmed, such as hyphenated words and words starting with certain prefixes. For...
 
Libraries  -  Algorithm::NeedlemanWunsch 0.02
Algorithm::NeedlemanWunsch is a sequence alignment with configurable scoring. SYNOPSIS use Algorithm::NeedlemanWunsch; sub score_sub { if (!@_) { return -2; # gap penalty } return ($_[0] eq $_[1]) ? 1 : -1; } my $matcher =...
10.24 KB  
NEW DOWNLOADS IN PROGRAMMING, LIBRARIES
Programming  -  FLEX-db Digital Asset Manager 3.0.9
FLEX-db - an enterprise Digital Asset Manager (DAM). It ingests and links metadata with files, creates thumbnails, and processes files using business rules. FLEX-db has a JSP client, Java app server for file input and output and an EJB metadata...
21.57 MB  
Programming  -  Libicom 0.9.0
The libicom library is a character based dynamicly linked library for Linux. It is used to remotely control the Icom IC-R8500 wide band receiver via an RS232 link. All call and return parameters to the control functions are character string based....
20.48 KB  
Programming  -  dotdesktop 0.3
Dotdesktop library provides ability to parse desktop entry file and access the information in a convenient way. Desktop entry file format is defined by freedesktop.org, it is used to describe information about an application such as the name and...
327.68 KB  
Programming  -  Cedalion for Linux 0.2.6
Cedalion is a programming language that allows its users to add new abstractions and define (and use) internal DSLs. Its innovation is in the fact that it uses projectional editing to allow the new abstractions to have no syntactic limitations.
471.04 KB  
Programming  -  libyasl 0.2
Libyasl is a C++ class library to easily realize TCP/UDP/Multicast clientsand servers in IPv4 and IPv6 environments under GNU/Linux systems.
143.36 KB  
Libraries  -  wolfSSL 3.11.0
The wolfSSL embedded SSL/TLS library is a lightweight SSL library written in ANSI standard C and targeted for embedded and RTOS environments - primarily because of its small size, speed, and feature set. It is commonly used in standard operating...
2.73 MB  
Libraries  -  EuGTK 4.8.9
Makes it easy to develop good- looking, fast, cross-platform programs that run on Linux, OS X, and Windows. Euphoria is a very fast interpreted/compiled language with straight-forward syntax. EuGTK allows programming in a clean, object-oriented...
10.68 MB  
Libraries  -  Linux User Group Library Manager 1.0
The LUG Library Manager is a project to help Linux User Groups start their own library. A LUG library is helpful to the community at large because it increases access to information, and gives everyone the opportunity to become more knowledgeable.
5.35 KB  
Libraries  -  Module::MakefilePL::Parse 0.12
Module::MakefilePL::Parse is a Perl module to parse required modules from Makefile.PL. SYNOPSIS use Module::MakefilePL::Parse; open $fh, Makefile.PL; $parser = Module::MakefilePL::Parse->new( join("", ) ); $info = $parser->required;...
8.19 KB  
Libraries  -  sqlpp 0.06
sqlpp Perl package is a SQL preprocessor. sqlpp is a conventional cpp-alike preprocessor taught to understand SQL ( PgSQL, in particular) syntax specificities. In addition to the standard #define/#ifdef/#else/#endif cohort, provides also...
10.24 KB