Hi,
On Mon, Oct 19, 2009 at 7:59 AM, harto <[email protected]>wrote:
> I've just started learning Clojure, so I'm trying to figure out the
> correct way of doing things. I've been trying to create and 'modify' a
> large vector for an online programming exercise, but I'm running into
> some performance issues.
>
> Any general tips would be appreciated!
>
> Firstly, I'm creating a vector of booleans like this:
>
> (defn vector-of [n value]
> (vec (for [_ (range n)] value)))
>
> It takes quite a long time for large values of n, though:
>
> user=> (time (dorun (vector-of 1e7 true)))
> "Elapsed time: 6734.509528 msecs"
>
You don't need dorun here: dorun is sometimes used in benchmarks to force
the computation of lazy-seqs but here you are using a vector, not a lazy
seq. So calling dorun on it causes the allocation of a seq of 1e7 items
which takes time. If you used dorun to prevent the printing of the vector at
the REPL, just di (tcime (count (vector-of 1e7 true))). See:
user=> (time (dorun (vector-of 1e7 nil)))
"Elapsed time: 4155.616985 msecs"
nil
user=> (time (count (vector-of 1e7 nil)))
"Elapsed time: 2384.609217 msecs"
10000000
If you study your function:
(defn vector-of [n value]
(vec (for [_ (range n)] value)))
You'll see that to create a vector of n items you use two intermediate seqs
of size n items (range and the result of for).
If you get rid of some allocations, you'll get huge speed-ups.
(defn vector-of [n value]
(loop [n (int n) v []]
(if (zero? n)
v
(recur (dec n) (conj v value)))))
user=> (time (count (vector-of 1e7 nil)))
"Elapsed time: 1170.371793 msecs"
10000000
Since you don't care about intermediate values of v, you can use transients:
(defn vector-of [n value]
(loop [n (int n) v (transient [])]
(if (zero? n)
(persistent! v)
(recur (dec n) (conj! v value)))))
user=> (time (count (vector-of 1e7 nil)))
"Elapsed time: 228.272239 msecs"
10000000
> Secondly, I'm iterating across one of these large vectors using
> something like the following (contrived) function:
>
> (defn set-flags [v]
> (loop [i 0
> v v]
> (if (< i (count v))
> (recur (inc i) (assoc v i false))
> v)))
>
> user=> (let [v (vector-of 1e7 true)] (time (dorun (set-flags v))))
> "Elapsed time: 15563.916114 msecs"
>
>
For reference (using my fastest vector-of):
user=> (let [v (vector-of 1e7 true)] (time (count (set-flags v))))
"Elapsed time: 5653.562953 msecs"
10000000
Here too you don't care about intermediate values of v:
(defn set-flags [v]
(loop [i 0 v (transient v)]
(if (< i (count v))
(recur (inc i) (assoc! v i false))
(persistent! v))))
user=> (let [v (vector-of 1e7 true)] (time (count (set-flags v))))
"Elapsed time: 1423.473584 msecs"
10000000
And if you store (count n) in a local:
(defn set-flags [v]
(let [n (count v)]
(loop [i 0 v (transient v)]
(if (< i n)
(recur (inc i) (assoc! v i false))
(persistent! v)))))
user=> (let [v (vector-of 1e7 true)] (time (count (set-flags v))))
"Elapsed time: 931.911668 msecs"
10000000
hth,
Christophe
--
Professional: http://cgrand.net/ (fr)
On Clojure: http://clj-me.blogspot.com/ (en)
--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to [email protected]
Note that posts from new members are moderated - please be patient with your
first post.
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/clojure?hl=en
-~----------~----~----~----~------~----~------~--~---