Mailing list for all users of the OCaml language and system.
 help / color / mirror / Atom feed
From: Mike Lin <nilekim@gmail.com>
To: caml-list@inria.fr
Subject: Re: [Caml-list] Parallelized parsing
Date: Mon, 20 Apr 2009 17:35:32 -0400	[thread overview]
Message-ID: <2a1a1a0c0904201435y12e36603t60fb40fd1a7d8260@mail.gmail.com> (raw)
In-Reply-To: <200904202215.27735.jon@ffconsultancy.com>

There is certainly a reasonable body of basic CS research on
parallelizing CFG algorithms such as CYK, the Earley parser, and to a
lesser extent the more practical LALR strategy used by yacc etc. (In
the latter case it seems to get easier if you're willing to trade off
determinism when parsing ambiguous grammars.)

I know some people who use some of this stuff in very specific
contexts (RNA folding), but I haven't seen any practical
general-purpose tools like a parallel yacc...

Overall, I don't actually know much more than you could figure out
from Google Scholar in an hour but hopefully these were some useful
search terms.

On Mon, Apr 20, 2009 at 5:15 PM, Jon Harrop <jon@ffconsultancy.com> wrote:
>
> I'm desperately trying to prepare for the imminent drop of a rock-solid
> multicore-friendly OCaml implementation and was wondering what work has been
> done on parallelized parsers and/or parallel-friendly grammars?
>
> For example, Mathematica syntax for nested lists of integers looks like:
>
>  {{{1, 2}}, {{3, 4}, {4, 5}}, ..}
>
> and there are obvious divide-and-conquer approaches to lexing and parsing that
> grammar. You can recursively subdivide the string (e.g. memory mapped from a
> file) to build a tree of where the tokens { , and } appear by index and then
> recursively convert the tree into an AST.
>
> What other grammars can be lexed and/or parsed efficiently in parallel?
>
> --
> Dr Jon Harrop, Flying Frog Consultancy Ltd.
> http://www.ffconsultancy.com/?e
>
> _______________________________________________
> Caml-list mailing list. Subscription management:
> http://yquem.inria.fr/cgi-bin/mailman/listinfo/caml-list
> Archives: http://caml.inria.fr
> Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
> Bug reports: http://caml.inria.fr/bin/caml-bugs
>


  reply	other threads:[~2009-04-20 21:35 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-04-20 21:15 Jon Harrop
2009-04-20 21:35 ` Mike Lin [this message]
2009-04-21  0:52   ` [Caml-list] " Yitzhak Mandelbaum
2009-04-21 15:55     ` Jon Harrop
2009-04-21  1:44 ` Polymorphism problem Eliot Handelman
2009-04-21  8:50   ` [Caml-list] " Mauricio Fernandez
2009-04-21  7:19 ` [Caml-list] Parallelized parsing David MENTRE
2009-04-21 16:04   ` Jon Harrop

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=2a1a1a0c0904201435y12e36603t60fb40fd1a7d8260@mail.gmail.com \
    --to=nilekim@gmail.com \
    --cc=caml-list@inria.fr \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox