From mboxrd@z Thu Jan 1 00:00:00 1970 Received: (from majordomo@localhost) by pauillac.inria.fr (8.7.6/8.7.3) id LAA18354; Fri, 21 Mar 2003 11:03:13 +0100 (MET) X-Authentication-Warning: pauillac.inria.fr: majordomo set sender to owner-caml-list@pauillac.inria.fr using -f Received: from concorde.inria.fr (concorde.inria.fr [192.93.2.39]) by pauillac.inria.fr (8.7.6/8.7.3) with ESMTP id LAA18924 for ; Fri, 21 Mar 2003 11:03:12 +0100 (MET) Received: from mbg.sphere.ne.jp (mbg.sphere.ne.jp [210.150.254.179]) by concorde.inria.fr (8.11.1/8.11.1) with ESMTP id h2LA3Af02329 for ; Fri, 21 Mar 2003 11:03:11 +0100 (MET) Received: from localhost (pl159.nas926.o-tokyo.nttpc.ne.jp [210.165.109.159]) by mbg.sphere.ne.jp (Postfix) with ESMTP id D022A3A8F8 for ; Fri, 21 Mar 2003 19:03:08 +0900 (JST) Date: Fri, 21 Mar 2003 18:54:52 +0900 (JST) Message-Id: <20030321.185452.100218459.yoriyuki@mbg.sphere.ne.jp> To: caml-list@inria.fr Subject: [Caml-list] camomile-0.3.0 is relased. From: Yamagata Yoriyuki X-Mailer: Mew version 2.2 on Emacs 21.2 / Mule 5.0 (SAKAKI) Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit X-Spam: no; 0.00; yamagata:01 yoriyuki:01 sphere:01 camomile:01 implemented:01 api:01 functor:01 encodings:01 mappings:01 locale:01 collation:01 functors:01 prdownloads:01 sourceforge:01 normaliser:01 Sender: owner-caml-list@pauillac.inria.fr Precedence: bulk camomile-0.3.0 is released. camomile is a Unicode library for ocaml. Currently camomile provides * Unicode data types. (Base, UTF8) the Unicode character type and several Unicode string implementations. (abstract type internally implemented as int array, resizable character array, and UTF8 encoded normal string.) Most string API of camomile is a functor over string implementation. Also it should be easy to add new implementation as OS wch. * Character info. (UCharInfo) * Code converter. (CharEncodings) about 200 encodings are supported. Conversion is either string to string, Unicode string to/from string, OO-channel, and Stream. * Unicode normalforms, NFD NFKD NFC NFKC. (Normalform) Due to the presence of combined characters such as accents, Unicode strings have several equivalent representations. Unicode standard defines several "normal forms" for such cases. This module provides conversion to such normal forms. * Case mappings (CaseMap) Internationalised case mapping. German eszet, Greek sigma, Turkish i (for Turkish locale) are correctly handled. (Azeri and Lithuanian supports are implemented but not tested) Behaviour of case mapping can be adjusted by locale. * String Comparison (UCol) Behaviour of comparison also can be adjusted by locale. By default, it uses Unicode collation algorithm. (I believe it is the same as Java.) Changes from 0.2.x are * Functor design: OO-design is abandoned. API taking unicode strings now becomes functors over unicode string implementation. * The locale can be specified for case mapping and string comparison. Still quality of the code is alpha, I think. Download : http://prdownloads.sourceforge.net/camomile/camomile-0.3.0.tar.bz2 Homepage : http://camomile.sourceforge.net camomile comes with a test suite, which includes, among other tests, conformance tests of Unicode standard for collator and normaliser. test suite : http://prdownloads.sourceforge.net/camomile/blender-0.3.0.tar.bz2 -- Yamagata Yoriyuki ------------------- To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/ Beginner's list: http://groups.yahoo.com/group/ocaml_beginners