* [Caml-list] Bigarray access speed
@ 2002-08-15 22:50 Richard Nyberg
2002-08-16 10:27 ` malc
0 siblings, 1 reply; 8+ messages in thread
From: Richard Nyberg @ 2002-08-15 22:50 UTC (permalink / raw)
To: caml-list
In the following small programs 3) is faster than 1) and 2), which run
equally fast. However, 4 is significantly slower than the rest. If you
change the line "a.{i} <- a.{i} + i" to "a.{i} <- i" the execution time
is halfed but it's still much slower.
Are access to Bigarrays slower when passed to functions? If so, is it
fixable? or is there some workaround?
I stumbled upon this while coding on a school assignment ((not too ;) fast
multiplication of large integers).
1)
let a = Array.make 1000000 0 in
for i = 0 to 999999 do
a.(i) <- a.(i) + i;
done;;
2)
let a = Array.make 1000000 0 in
let rec loop a i =
if i <= 999999 then begin
a.(i) <- a.(i) + i;
loop a (i + 1)
end in
loop a 0;;
3)
open Bigarray;;
let a = Array1.create int c_layout 1000000 in
Array1.fill a 0;
for i = 0 to 999999 do
a.{i} <- a.{i} + i
done;;
4)
open Bigarray;;
let a = Array1.create int c_layout 1000000 in
Array1.fill a 0;
let rec loop a i =
if i <= 999999 then begin
a.{i} <- a.{i} + i;
loop a (i + 1)
end in
loop a 0;;
-Richard
-------------------
To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Bigarray access speed
2002-08-15 22:50 [Caml-list] Bigarray access speed Richard Nyberg
@ 2002-08-16 10:27 ` malc
2002-08-16 10:40 ` Markus Mottl
` (2 more replies)
0 siblings, 3 replies; 8+ messages in thread
From: malc @ 2002-08-16 10:27 UTC (permalink / raw)
To: Richard Nyberg; +Cc: caml-list
On Fri, 16 Aug 2002, Richard Nyberg wrote:
> In the following small programs 3) is faster than 1) and 2), which run
> equally fast. However, 4 is significantly slower than the rest. If you
> change the line "a.{i} <- a.{i} + i" to "a.{i} <- i" the execution time
> is halfed but it's still much slower.
>
> Are access to Bigarrays slower when passed to functions? If so, is it
> fixable? or is there some workaround?
>
> I stumbled upon this while coding on a school assignment ((not too ;) fast
> multiplication of large integers).
>
> 1)
> let a = Array.make 1000000 0 in
> for i = 0 to 999999 do
> a.(i) <- a.(i) + i;
> done;;
>
> 2)
> let a = Array.make 1000000 0 in
> let rec loop a i =
> if i <= 999999 then begin
> a.(i) <- a.(i) + i;
> loop a (i + 1)
> end in
> loop a 0;;
>
> 3)
> open Bigarray;;
> let a = Array1.create int c_layout 1000000 in
> Array1.fill a 0;
> for i = 0 to 999999 do
> a.{i} <- a.{i} + i
> done;;
>
> 4)
> open Bigarray;;
> let a = Array1.create int c_layout 1000000 in
> Array1.fill a 0;
> let rec loop a i =
> if i <= 999999 then begin
> a.{i} <- a.{i} + i;
> loop a (i + 1)
> end in
> loop a 0;;
http://caml.inria.fr/archives/200110/msg00148.html
An aside(not all facts are cheked):
Bigarrays(of at least floats) can have a slight edge over normals arrays.
To get maximal speed of the inner loops data needs to be naturally
aligned. OCaml does nothing to enforce it for non-big arrays. Bigarrays on
the other hand are mmaped(4k on IA32) and you get perfectly aligned data
for free. I was thinking that maybe Array can be extended with
make[create]_aligned, for speed/space tradeoff.
--
mailto:malc@pulsesoft.com
-------------------
To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Bigarray access speed
2002-08-16 10:27 ` malc
@ 2002-08-16 10:40 ` Markus Mottl
2002-08-16 11:08 ` float array alignment; was " William Chesters
2002-08-16 18:46 ` Richard Nyberg
2 siblings, 0 replies; 8+ messages in thread
From: Markus Mottl @ 2002-08-16 10:40 UTC (permalink / raw)
To: malc; +Cc: Richard Nyberg, caml-list
On Fri, 16 Aug 2002, malc wrote:
> Bigarrays(of at least floats) can have a slight edge over normals arrays.
> To get maximal speed of the inner loops data needs to be naturally
> aligned. OCaml does nothing to enforce it for non-big arrays. Bigarrays on
> the other hand are mmaped(4k on IA32) and you get perfectly aligned data
> for free. I was thinking that maybe Array can be extended with
> make[create]_aligned, for speed/space tradeoff.
Additionally, it would also be nice to have a specialized "create"
function for (naturally unboxed) float arrays such that they need not
be initialized with a given float value. This may be beneficial for
algorithms that allocate work space whose contents is not necessarily
fully needed but filled on demand.
Regards,
Markus Mottl
--
Markus Mottl markus@oefai.at
Austrian Research Institute
for Artificial Intelligence http://www.oefai.at/~markus
-------------------
To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
^ permalink raw reply [flat|nested] 8+ messages in thread
* float array alignment; was Re: [Caml-list] Bigarray access speed
2002-08-16 10:27 ` malc
2002-08-16 10:40 ` Markus Mottl
@ 2002-08-16 11:08 ` William Chesters
2002-08-19 12:56 ` Xavier Leroy
2002-08-16 18:46 ` Richard Nyberg
2 siblings, 1 reply; 8+ messages in thread
From: William Chesters @ 2002-08-16 11:08 UTC (permalink / raw)
To: caml-list
malc writes:
> To get maximal speed of the inner loops data needs to be naturally
> aligned. OCaml does nothing to enforce it for non-big arrays. Bigarrays on
> the other hand are mmaped(4k on IA32) and you get perfectly aligned data
> for free. I was thinking that maybe Array can be extended with
> make[create]_aligned, for speed/space tradeoff.
I did this once to be able to interface with Fortran libs on Sparc32,
and I still have a patch (against ocaml-2.01) lying around. It was
actually quite thoroughly tested, but it's probably not very tidy. The
main gotcha iirc was getting output_value/input_value to preserve
alignment :).
-------------------
To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: float array alignment; was Re: [Caml-list] Bigarray access speed
2002-08-16 11:08 ` float array alignment; was " William Chesters
@ 2002-08-19 12:56 ` Xavier Leroy
2002-08-19 13:07 ` malc
0 siblings, 1 reply; 8+ messages in thread
From: Xavier Leroy @ 2002-08-19 12:56 UTC (permalink / raw)
To: William Chesters; +Cc: caml-list
malc writes:
> To get maximal speed of the inner loops data needs to be naturally
> aligned. OCaml does nothing to enforce it for non-big arrays. Bigarrays on
> the other hand are mmaped(4k on IA32) and you get perfectly aligned data
> for free. I was thinking that maybe Array can be extended with
> make[create]_aligned, for speed/space tradeoff.
As William Chester said, allocating 8-aligned arrays isn't really
hard, but keeping them 8-aligned across copying collection,
compaction, and structured I/O is quite a pain.
My experiments indicate that the lack of alignment on float arrays
(or more precisely the fact that they are 4-aligned instead of
8-aligned) has negligible impact on performance for the IA32 (Pentium)
and PowerPC processors, but non-negligible for SPARC and MIPS.
And of course on a 64-bit architecture the problem goes away because
everything in the Caml heap is then 8-aligned. Since I expect IA32
and PowerPC to remain dominant until we massively switch to 64-bit
processors, there's no urgent need to do something about float array
alignment.
- Xavier Leroy
-------------------
To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: float array alignment; was Re: [Caml-list] Bigarray access speed
2002-08-19 12:56 ` Xavier Leroy
@ 2002-08-19 13:07 ` malc
0 siblings, 0 replies; 8+ messages in thread
From: malc @ 2002-08-19 13:07 UTC (permalink / raw)
To: Xavier Leroy; +Cc: William Chesters, caml-list
On Mon, 19 Aug 2002, Xavier Leroy wrote:
> malc writes:
> > To get maximal speed of the inner loops data needs to be naturally
> > aligned. OCaml does nothing to enforce it for non-big arrays. Bigarrays on
> > the other hand are mmaped(4k on IA32) and you get perfectly aligned data
> > for free. I was thinking that maybe Array can be extended with
> > make[create]_aligned, for speed/space tradeoff.
>
> As William Chester said, allocating 8-aligned arrays isn't really
> hard, but keeping them 8-aligned across copying collection,
> compaction, and structured I/O is quite a pain.
>
> My experiments indicate that the lack of alignment on float arrays
> (or more precisely the fact that they are 4-aligned instead of
> 8-aligned) has negligible impact on performance for the IA32 (Pentium)
> and PowerPC processors, but non-negligible for SPARC and MIPS.
> And of course on a 64-bit architecture the problem goes away because
> everything in the Caml heap is then 8-aligned. Since I expect IA32
> and PowerPC to remain dominant until we massively switch to 64-bit
> processors, there's no urgent need to do something about float array
> alignment.
IA32 is now much bigger family, and unlucky owners of AMD 7th generation
machines, such as myself, do pay a price for unaligned double precission
float accesses.
--
mailto:malc@pulsesoft.com
-------------------
To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Bigarray access speed
2002-08-16 10:27 ` malc
2002-08-16 10:40 ` Markus Mottl
2002-08-16 11:08 ` float array alignment; was " William Chesters
@ 2002-08-16 18:46 ` Richard Nyberg
2 siblings, 0 replies; 8+ messages in thread
From: Richard Nyberg @ 2002-08-16 18:46 UTC (permalink / raw)
To: malc; +Cc: caml-list
> http://caml.inria.fr/archives/200110/msg00148.html
Yes. That message explained it very well :)
Thanks!
-Richard
-------------------
To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [Caml-list] Bigarray access speed
@ 2002-08-16 6:19 Joe HELL
0 siblings, 0 replies; 8+ messages in thread
From: Joe HELL @ 2002-08-16 6:19 UTC (permalink / raw)
To: rnyberg, caml-list
Bigarray is designed to facilitate operation on large numerical array when
used with external C functions.
Pure caml use of bigarray is generally slower.
_________________________________________________________________
Chat with friends online, try MSN Messenger: http://messenger.msn.com
-------------------
To unsubscribe, mail caml-list-request@inria.fr Archives: http://caml.inria.fr
Bug reports: http://caml.inria.fr/bin/caml-bugs FAQ: http://caml.inria.fr/FAQ/
Beginner's list: http://groups.yahoo.com/group/ocaml_beginners
^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2002-08-19 13:08 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2002-08-15 22:50 [Caml-list] Bigarray access speed Richard Nyberg
2002-08-16 10:27 ` malc
2002-08-16 10:40 ` Markus Mottl
2002-08-16 11:08 ` float array alignment; was " William Chesters
2002-08-19 12:56 ` Xavier Leroy
2002-08-19 13:07 ` malc
2002-08-16 18:46 ` Richard Nyberg
2002-08-16 6:19 Joe HELL
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox