Download Shareware and Freeware Software for Windows, Linux, Macintosh, PDA

line Home  |  About Us  |  Link To Us  |  FAQ  |  Contact

Serving Software Downloads in 976 Categories, Downloaded 30.156.334 Times

HTML Entity Based Codepage Inference 0.01

  Date Added: March 06, 2010  |  Visits: 787

HTML Entity Based Codepage Inference

Report Broken Link
Printer Friendly Version


Product Homepage
Download (73 downloads)

HEBCI is a technique that allows a web form handler to transparently detect the character set its data was encoded with. By using carefully-chosen character references, the browsers encoding can be inferred. Thus, it is possible to guarantee that data is in a standard encoding without relying on (often unreliable) webserver/browser encoding interactions. The ideal solution will be entirely browser-neutral and passive. Unfortunately, the HTML spec doesnt define any mechanism for this. We need to find some other, sneakier, way to extract the current character encoding from the browser. Luckily for us, there is a trick we can use for this: entity codes. Entity codes are strings like &, which were (are) used to encode specific characters without using Unicode. When the browser displays a page, it replaces these with the appropriate character from the current encoding. Thus, & becomes the character 0x26 in most codepages. By itself, this is merely implementation trivia. However, this translation process occurs whenever a user submits a form. That is, the browser parses any entities in the form variables and replaces them with the current encodings representation of those characters when the user clicks submit. Thus, any entity codes within the form fields are passed along as character values in the browsers current encoding. So, all we have to do is find an entity that is encoded differently in two different codepages. We slip that into a form field, and then look at its value when we get data. This allows us to differentiate between the two encodings. In fact, we could look at all entities in many codepages, and find the ones that allowed us to disambiguate between many codepages. This is what Ive done. We add hidden form elements with values containing various entity codes, such as °, ÷, and —. Then, when the user submits the form, we take each of those and compare them against a list of what character has what value in what codepage. That is, each codepage has a unique fingerprint for the values of °,÷,—. For MacRoman, its a1,d6,d1; for UTF-8, c2b0,c3b7,e28094. Thus, we only have to go through our table of codepage-to-fingerprint mappings, and see which fingerprint matches. Note that, once this table is discovered, the cost of fingerprinting a given form submission is very low. And, in the case of misses, you can assume whatever your pages default codepage is. This fallthrough case is equivalent to what the code would have done before adding this detection layer..

Requirements: No special requirements
Platforms: Linux
Keyword: Based Character Character Set Codepage Entity Entity Codes Form Form Handler Hebci Html Html Entity Html Entity Based Codepage Inference Inference Its Data Web Form
Users rating: 0/10

License: Freeware Size: 5.12 KB
HTML ENTITY BASED CODEPAGE INFERENCE RELATED
Libraries  -  Cal3D 0.11.0
Cal3d project is a skeletal based 3d character animation library written in C++ in a platform-/graphic API-independent way. Originally designed to be used in a 3d client for Worldforge, it evolved into a stand-alone product which can be used in...
2.9 MB  
Games  -  DnD - The Next Generator 1.0
This project want to create a Perl and XML based DnD Character Generator
2.42 KB  
Utilities  -  Evil Clipboard 1.0
Copy and paste special characters to your clipboard Evil Clipboard is a free and easy to use Java software that allows you to copy and paste a control character into any application or web form!
10.24 KB  
Games  -  Dragon's Blade II FX 1.0.3.5
The sequel to the popular Windows Phone RPG, Dragon's Blade has arrived! Dragon's Blade II continues the saga with classic turn-based RPG gameplay set in a massive new world. Enjoy the story at your own pace or play online with...
24 MB  
Entertainment  -  RPG Kit Lite 1.0.0.0
Have you ever had difficulty naming a NPC in a role playing game? This app solves the NPC naming problem by providing a random name generator based on character class. So if you are in a bind and need to come up with a great name for that NPC...
4 MB  
WYSIWYG Editors  -  JXHTMLEDIT 4.0.005
JXHTMLEDIT is a free Open Source browser-based HTML/XHTML content authoring tool based on the Java 2 Platform that allows WYSIWYG editing on multiple platforms (require the Sun JavaTM Plug-in 1.4 or higher installed on client).JXHTMLEdit is a...
 
Language  -  ZC Trigram Generator 1.01
The ZC Trigram Generator is a free program that generates new words based on a set of input words. It analyzes the input words and uses them to create random words with similar arrangements of letters. These words are typically pronounceable and...
1.37 MB  
Libraries  -  STL AVL Map 1.2
STL AVL Map is an implementation of AVL-tree-based map, multimap, set and multiset containers for g++ 4.1.1. Almost everything on this library is based on libstdc++ v3 code, so the same license applies. The AVL operations (in lib/tree.cpp) were...
102.4 KB  
Libraries  -  HTML::Template::Set 1.009
HTML::Template::Set is a HTML::Template extension that adds set support. SYNOPSIS in your HTML: apples_to_oranges Apples Are Green <...
9.22 KB  
Web Browsers  -  YSlow 3.0.0
YSlow analyzes web pages and suggests ways to improve their performance based on a set of rules for high performance web pages. YSlow is a Firefox add-on integrated with the Firebug web development tool. YSlow grades web page based on one of three...
 
NEW DOWNLOADS IN LINUX SOFTWARE, NETWORK & INTERNET
Linux Software  -  Polling Autodialer Software 3.4
ICTBroadcast Auto Dialer software has a survey campaign for telephone surveys and polls. This auto dialer software automatically dials a list of numbers and asks them a set of questions that they can respond to, by using their telephone keypad....
488 B  
Linux Software  -  Total Video Converter Mac Free 3.5.5
Total Video Converter Mac Free developed by EffectMatrix Ltd is the official legal version of Total Video Converter which was a globally recognized brand since 2006. Total Video Converter Mac Free is a free but powerful all-in-one video...
17.7 MB  
Linux Software  -  Skeith mod_log_sql Analyzer 2.10beta2
Skeith is a php based front end for analyzing logs for Apache using mod_log_sql.
47.5 KB  
Linux Software  -  SLAX 6.0+
Slax is a modern, portable, small and fast Linux operating system with a modular approach and outstanding design. Despite its small size, Slax provides a wide collection of pre-installed software for daily use, including a well organized graphical...
190 KB  
Linux Software  -  GTK+ 2.5
GTK+, which stands for the GIMP Toolkit, is a library for creating graphical user interfaces for the X Window System. It is designed to be small, efficient, and flexible. GTK+ is written in C with a very object-oriented approach. Language bindings...
60 MB  
Network & Internet  -  Free WiFi Hotspot 3.3.1
Free WiFi Hotspot is a super easy solution to turn your laptop or notebook into a portable Wi-Fi hotspot, wirelessly sharing your internet connections like DSL, Cable, Bluetooth, Mobile Broadband Card, Dial-Up, etc. through the built-in wireless...
1.04 MB  
Network & Internet  -  Easy Uploads 1.8
Easy uploads is a file storage media streaming application designed by Filestreamers that allows you to upload, store, and stream your files from their virtually unlimited file storage server. Easy Uploads can backup,share, and stream your files...
615.97 KB  
Network & Internet  -  IPv6 CARE 3.2b
IPv6 CARE, "IPv6 Compliant Automatic Runtime Environment", is a Linux tool able to patch ipv6-agnostic programs on-the-fly ('patch' mode). It can also generate a diagnosis about the IPv6 compliance of an application ('check' mode).
409.6 KB  
Network & Internet  -  PacketFence ZEN 3.1.0
PacketFence is a fully supported, trusted, Free and Open Source network access control (NAC) system. Boosting an impressive feature set including a captive-portal for registration and remediation, centralized wired and wireless management, 802.1X...
1024 MB  
Network & Internet  -  django-dbstorage 1.3
A Django file storage backend for files in the database.
10.24 KB