[
Home
]
[ Index:
by date
|
by threads
]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
[ Message by date: previous | next ] [ Message in thread: previous | next ] [ Thread: previous | next ]
| Date: | -- (:) |
| From: | Hal Daume III <hdaume@I...> |
| Subject: | bigarrays much lower than normal ones |
I've been hitting the limiting size of normal float arrays and was having
a look at the Bigarray module. Unfortunately, it seems roughly 3-4 times
*slower* than the standard array, which is pretty much unacceptable for
me. Am I doing something naively wrong, or are the Bigarrays truly this
slow? The timing results I get (i686, redhat) are along the liens of:
stdarray, safe:
12.000u 0.030s 0:12.18 98.7% 0+0k 0+0io 107pf+0w
12.060u 0.030s 0:12.22 98.9% 0+0k 0+0io 107pf+0w
stdarray, unsafe:
11.990u 0.070s 0:12.21 98.7% 0+0k 0+0io 107pf+0w
12.130u 0.040s 0:12.31 98.8% 0+0k 0+0io 107pf+0w
bigarray, 64 bit:
39.760u 0.040s 0:40.35 98.6% 0+0k 0+0io 110pf+0w
39.750u 0.030s 0:40.09 99.2% 0+0k 0+0io 110pf+0w
bigarray, 32 bit:
41.950u 0.050s 0:42.60 98.5% 0+0k 0+0io 110pf+0w
42.070u 0.040s 0:42.53 99.0% 0+0k 0+0io 110pf+0w
(safe vs. unsafe is when compiled normally or with -unsafe; 64bit vs 32bit
is the 'kind' used for the bigarrays.)
I'm also really shocked that the 32 bit float bigarrays are slower than
the 64 bit ones!
Can someone explain this to me?
The code is:
<standard array>
open Array
let normalize a =
let s = fold_left (+.) 0. a in
for i = 0 to length a - 1 do
a.(i) <- a.(i) /. s;
done;
()
let _ =
let a = make 1000000 0. in
for iter = 1 to 100 do
for i = 0 to 999999 do
let i' = float_of_int i in
a.(i) <- log (0.01 *. i' *. i' +. 3. *. i' +. 4.);
done;
normalize a;
done;
()
<big array>
open Bigarray
let normalize a =
let _N = Array1.dim a in
let rec sum n acc =
if n >= _N then acc
else sum (n+1) (acc +. Array1.get a n) in
let s = sum 0 0. in
for i = 0 to _N - 1 do
Array1.set a i (Array1.get a i /. s);
done;
()
let _ =
let a = Array1.create float32 c_layout 1000000 in
for iter = 1 to 100 do
for i = 0 to 999999 do
let i' = float_of_int i in
Array1.set a i (log (0.01 *. i' *. i' +. 3. *. i' +. 4.));
done;
normalize a;
done;
()
If you put the array allocation inside the iter loop, nothing changes
much, relatively, on the timing results.
- Hal
--
Hal Daume III | hdaume@isi.edu
"Arrest this man, he talks in maths." | www.isi.edu/~hdaume