From: Gerd Stolpmann <info@gerd-stolpmann.de>
To: Jon Harrop <jon@ffconsultancy.com>
Cc: Caml List <caml-list@inria.fr>
Subject: Re: [Caml-list] Parallel n-queens solver
Date: Mon, 25 Apr 2011 12:36:08 +0200 [thread overview]
Message-ID: <1303727768.3782.66.camel@thinkpad> (raw)
In-Reply-To: <013e01cc02d7$e23b8f00$a6b2ad00$@ffconsultancy.com>
Am Montag, den 25.04.2011, 00:32 +0100 schrieb Jon Harrop:
> Gerd recently posted this article about parallelizing an n-queens solver in
> OCaml:
>
> http://blog.camlcity.org/blog/multicore3.html
>
> The idea is to reuse the same basic definitions and parallelize the program
> simply by changing the "run" function. I have tried to translate this OCaml
> to F# without benefit of being able to run the original OCaml code but I
> have checked the results against the known solutions to this problem.
>
> Here is Gerd's sequential OCaml:
>
> module Sequential = struct
> let run n =
> let t0 = Unix.gettimeofday() in
> let ht = Hashtbl.create 91 in
> for k = 0 to n-1 do
> solve k n
> (fun b ->
> if not (Hashtbl.mem ht b) then (
> let b = Array.copy b in
> List.iter
> (fun b' ->
> Hashtbl.add ht b' ()
> )
> (transformations b);
> print b
> )
> )
> done;
> let t1 = Unix.gettimeofday() in
> printf "Number solutions: %n\n%!" (Hashtbl.length ht / 8);
> printf "Time: %.3f\n%!" (t1-.t0)
> end
>
> My equivalent F#:
>
> module Sequential =
> let run n =
> let timer = System.Diagnostics.Stopwatch.StartNew()
> let solve k =
> let sols = ResizeArray()
> solve k n (transformations >> Seq.min >> sols.Add)
> sols.ToArray()
> let sols =
> Array.init n solve
> |> Seq.concat
> |> Seq.distinct
> |> Seq.length
> printfn "Number solutions: %n" sols
> printfn "Time: %.3f" timer.Elapsed.TotalSeconds
>
> Gerd's sequential OCaml run on his 4-core Opteron and my sequential F# run
> on a 4-core W3520 have very similar performance characteristics.
>
> Now for parallelization. Gerd's fastest parallel OCaml solution (aka MP2) is
> 287 lines long. In contrast, the F# can be parallelized by adding just 12
> characters to the source code of the sequential implementation:
>
> module Parallel =
> let run n =
> let timer = System.Diagnostics.Stopwatch.StartNew()
> let solve k =
> let sols = ResizeArray()
> solve k n (transformations >> Seq.min >> sols.Add)
> sols.ToArray()
> let sols =
> Array.Parallel.init n solve
> |> PSeq.concat
> |> PSeq.distinct
> |> PSeq.length
> printfn "Number solutions: %n" sols
> printfn "Time: %.3f" timer.Elapsed.TotalSeconds
>
> I just replaced Array.init with Array.Parallel.init and Seq with PSeq.
>
> Moreover, this trivially parallelized F# implementation also achieves
> performance on this machine comparable to Gerd's parallel OCaml on his
> machine. In particular, the absolute performance results for my parallel F#
> are better in every single case. However, it is not clear how scalable these
> parallel solutions are. On an 8-core E5405 Xeon I get only 5x speedup
> compared to 3.8x speedup on 4 cores.
>
> There can be little doubt that this solution is vastly more readable and
> maintainable though.
Well admitted. Netmulticore is an add-on to an existing compiler and
runtime, which explains a lot of the additional verbosity. Also, it is
right now focusing on the core mechanisms for parallelization, so there
is nothing like a parallel array initializer. Actually, I'm quite happy
that a quite complex approach like MP2 (using 3 processes communicating
with each other, and a bit over-engineered) can be worked out in only
287 lines.
Without having checked in detail, I'm quite sure a number of parallel
design patterns can be supported by higher-level constructs. Because of
the "fork-style" process creation, not everything is well-suited for
this, e.g. a parallel Array.init makes only sense when the array lives
in shared memory (which exists as Netmcore_array). In some sense, the
challenge is how well three ideas can be brought together:
- fork-style multi-processing
- value heaps in shared memory that support mutation
- the right mix of functional and imperative programming that is best
for the problem to solve
Generally, I think FP data structures are easier to support in this
context, e.g. with parallel list or stream operations (imagine
parallelized map and filter operations). Let's see how this evolves -
Netmulticore is at its beginning, and there is still a lot of room for
improvement.
Gerd
>
> --
> Dr Jon Harrop, Flying Frog Consultancy Ltd.
> http://www.ffconsultancy.com
>
>
--
------------------------------------------------------------
Gerd Stolpmann, Bad Nauheimer Str.3, 64289 Darmstadt,Germany
gerd@gerd-stolpmann.de http://www.gerd-stolpmann.de
Phone: +49-6151-153855 Fax: +49-6151-997714
------------------------------------------------------------
next prev parent reply other threads:[~2011-04-25 10:36 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-04-24 23:32 Jon Harrop
2011-04-25 10:36 ` Gerd Stolpmann [this message]
2011-04-25 16:36 ` Frédéric Gava
2011-04-25 19:30 ` Jon Harrop
2011-04-25 23:33 ` Eray Ozkural
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1303727768.3782.66.camel@thinkpad \
--to=info@gerd-stolpmann.de \
--cc=caml-list@inria.fr \
--cc=jon@ffconsultancy.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox