Encode: Encoding character data
The Encode library provides a unified interface for
converting strings from different encodings into a common
representation, and vice versa. This representation is
isomorphic to the Unicode character set, and the encodings
can be both standard and user-defined. For this purpose,
the Encode module defines the Encode.UPoint
data type
and the Encode.Encoding
type class with the encode
and
decode
methods.
The Encode library is being proposed as a Haskell analogy to the Encode extension in Perl, http://search.cpan.org/dist/Encode/.
The Exec.Encode.Main and Exec.Decode.Main programs
mimick the fuction calls to encode
and decode
,
respectively, with the following usage of the executables:
decode ArabTeX < decode.d | encode Buckwalter > encode.d decode MacArabic < data.MacArabic > data.UTF8 encode WinArabic < data.UTF8 > data.WinArabic
The installation instructions are given in INSTALL. For the list of supported encoding schemes, please refer to Encode and the source files of the programs. With the executables, the naming of encodings is case-insensitive and some further options are allowed:
encode|decode [--OPTIONS] [ENCODING] -h --help show usage information -l --lines use line-oriented mode -p text --prefix=text prefix input with text -s text --suffix=text suffix input with text -v --version show program's version
The PureFP library is an edited excerpt from the Functional Parsing library developed by Peter Ljunglöf in his licenciate thesis /Pure Functional Parsing an advanced tutorial/, Göteborg University and Chalmers University of Technology, April 2002, http://www.ling.gu.se/~peb/software.html.
The Parsek library is included for experimental purposes and subsumes PureFP.Parsers.Stream. Parsek was developed by Koen Claessen in his functional pearl article /Parallel Parsing Processes/, Journal of Functional Programming, 14(6), 741757, Cambridge University Press, 2004, http://www.cs.chalmers.se/~koen/pubs/entry-jfp04-parser.html
The Version library is just a simple support for working with the CVS/SVN revision keyword.
This software is published under the GNU General Public License. Only the Encode.hs and Version.hs files are instead subject to the Revised BSD License. Note the copyright and license details in the headers of the files, and see LICENSE, LicenseBSD and LicenseGPL distributed with this package.
Downloads
- Encode-1.3.1.tar.gz [browse] (Cabal source package)
- Package description (as included in the package)
Maintainer's Corner
For package maintainers and hackage trustees
Candidates
- No Candidates
Versions [RSS] | 0.7, 1.1, 1.2, 1.3.1, 1.3.2, 1.3.3, 1.3.4, 1.3.5, 1.3.6, 1.3.7, 1.3.8 |
---|---|
Dependencies | base (<=4.2), Cabal, mtl [details] |
License | LicenseRef-GPL |
Copyright | 2010 |
Author | Otakar Smrz |
Maintainer | otakar.smrz mff.cuni.cz |
Category | Text, Codec |
Home page | http://ufal.mff.cuni.cz/~smrz/ |
Uploaded | by OtakarSmrz at 2010-02-13T17:28:10Z |
Distributions | |
Reverse Dependencies | 2 direct, 0 indirect [details] |
Executables | decode, encode |
Downloads | 10230 total (12 in the last 30 days) |
Rating | (no votes yet) [estimated by Bayesian average] |
Your Rating | |
Status | Docs uploaded by user Build status unknown [no reports yet] |