From: "Bünzli Daniel" <daniel.buenzli@erratique.ch>
To: caml-list List <caml-list@inria.fr>
Subject: ANNOUNCE: Xmlm 1.0.0
Date: Tue, 18 Mar 2008 00:26:20 +0100 [thread overview]
Message-ID: <B1DD8118-F70B-48DF-9A9F-6645160BFB79@erratique.ch> (raw)
Hello,
A new version of Xmlm is available. It is incompatible with the
previous one. The callback interface was dropped in favour of a
streaming (pull) interface. The tree interface was dropped in
favour of a custom tree construction interface tightly integrated
with the streaming interface. This has the following advantages :
* The input and output processes become symmetric.
* Reclaim your recursion ! There no inversion of control, the client
drives
the parsing process.
* Makes it easy to create tag level parsers and use parser combinator
approaches.
* Allows to use the custom tree api on specific parts of the
document only.
You can get a feel of the new interface by looking at the examples at
http://erratique.ch/software/xmlm/doc/Xmlm#ex
Other significant changes are :
* Namespace support, all names are now expanded names (attributes
pertaining
to namespaces are preserved in the result).
* Whitespace stripping respects the xml:space attribute.
* Xmlm is functorized on string and internal buffer types. Among
other things this can be used to perform hash-consing, to
overcome caml string limitations or to process the character
stream, e.g. to normalize unicode characters or to convert to a
custom encoding.
* UTF-8 documents can start with an UTF-8 encoded BOM.
* The file test/xhtml.ml couples each XHTML character entity with
its corresponding UTF-8 encoded character string. You can use
it to program a suitable entity callback to parse xhtml.
(New) project homepage : http://erratique.ch/software/xmlm
As always, your (possibly negative yet constructive) feedback is
welcome,
Daniel
reply other threads:[~2008-03-17 23:26 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=B1DD8118-F70B-48DF-9A9F-6645160BFB79@erratique.ch \
--to=daniel.buenzli@erratique.ch \
--cc=caml-list@inria.fr \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox