From: Richard Jones <rich@annexia.org>
To: Luca de Alfaro <luca@dealfaro.org>
Cc: caml-list@yquem.inria.fr
Subject: Re: [Caml-list] Fast XML parser
Date: Thu, 19 Jul 2007 12:38:54 +0100 [thread overview]
Message-ID: <20070719113854.GA19281@furbychan.cocan.org> (raw)
In-Reply-To: <28fa90930707181458p26eac6e6y7b45018b7c91ca65@mail.gmail.com>
On Wed, Jul 18, 2007 at 02:58:35PM -0700, Luca de Alfaro wrote:
> I am interested in parsing Wiki markup language that has a few tags, like
> <pre>...</pre>, <math>...,</math>.
> These tags are sparse, meaning that the ratio of number of tags / number of
> bytes is low.
> I would like, given a string (or a stream) with such tags, to parse it as
> fast as possible. Efficiency is a primary consideration, and so is
> simplicity of the implementation.
> Do you have any advice about the library I should be using?
There's some code in COCANWIKI which does exactly this:
http://sandbox.merjis.com/release
Look at the file scripts/lib/wikilib.ml.
It's not a particularly clever implementation, but it has a great deal
of testing in the real world.
As well as <xml>-like syntax it also does a lot of standard wiki
syntax like '* ' for bullet points, paragraphs, indents for
preformatted sections and so on. And it outputs pure unadulterated
XHTML.
Rich.
--
Richard Jones
Red Hat
next prev parent reply other threads:[~2007-07-19 11:38 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-07-18 21:58 Luca de Alfaro
2007-07-18 22:11 ` [Caml-list] " Gabriel Kerneis
2007-07-18 22:48 ` Till Varoquaux
2007-07-19 6:24 ` Gabriel Kerneis
2007-07-19 9:02 ` Till Varoquaux
2007-07-19 11:38 ` Richard Jones [this message]
2007-07-20 7:01 ` Jon Harrop
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20070719113854.GA19281@furbychan.cocan.org \
--to=rich@annexia.org \
--cc=caml-list@yquem.inria.fr \
--cc=luca@dealfaro.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox