JP Moresmau's Programming Blog

You might find that at least some of the performance issue you're having are directly because of the DeepSeq analog you've created. Running a recursive seq over a list every time it's passed from one function to another is actually quite slow.

You would probably benefit *heavily* from using strict data types, instead of type aliases for lazy types.

Float's already strict, so you can stick with the same declaration for it. But the other declarations should be something like:

type Value = Float
data Neuron = Neuron !(StrictList Value) !Value
data Network = Network !(StrictList Neuron) !(StrictList Neuron)
data StrictList a = Nil | Cons !a !(StrictList a)

The use of ! in data types is part of haskell 98, so doesn't require an extension. The semantics of it are that when the constructor is evaluated to WHNF, it also evaluates each element with a ! to WHNF.

The advantage to using strict data structures over something like deepseq is that the strictness is applied just once - when the expression resulting in the constructor is first evaluated. After that, no use of seq is required, meaning you can avoid repeated traversals of your data structures looking for values not in WHNF.

Thank you Carl. Insightful comments, but I don't think in my case I'm losing too much: I only run deepSeq once per training iteration, not through each function call. But yes, ensuring the lists are always strict, and not just at regular points in the program, may improve performance. Can I get a strict list with the normal list API (fold, zip, etc) somewhere, or do I have to roll out my own?

Looks like the answer is... Maybe. If you can use the vector library, it's certain to be faster than any sort of list. But I'm not sure if its api is appropriate for what you're doing.

See the documentation on hackage. It doesn't look like there's a strict singly-linked list type on hackage, though. So it will depend a lot on your use pattern.

I agree with Carl albeit more forcefully. Using deepSeq (or equivalents) is like smashing your door in with a sledgehammer because your key sticks in the lock. It gets the door open, but it lacks finesse and has some unpleasant consequences. Usually these types of problems can be solved with a few well placed seqs, often only one. However, it is much easier to design your code not to have these problems in the first place than to try to patch them up after the fact. While there are common patterns of good and bad behavior, usually you can figure out what you should do when defining each function by just giving a little thought to the behavior and usage patterns. Using strict data types, as Carl suggests, is an important way of getting good performance by construction. However, unlike Carl's data types suggests, the appropriate solution is usually not "make everything in sight strict." So for example, what is usually desired is not Carl's StrictList type, but a head strict list, data L a = Nil | Cons !a (L a). In general, you usually want the "spines" of your structures lazy and often the elements strict.

	type Value = Float
	type Neuron= ([Value],Value)
	type Network = ([Neuron],[Neuron])

	class DeepSeq a where
	deepSeq :: a -> a
	deepSeq x=x `seq` x

	instance DeepSeq Value

	instance DeepSeq a => DeepSeq [a] where
	deepSeq []=[]
	deepSeq (x:xs)=deepSeq x `seq` deepSeq xs `seq` (x:xs)

	instance DeepSeq Neuron where
	deepSeq (vals,t)=
	let v2=deepSeq vals
	in v2 `seq` (v2,deepSeq t)

	instance DeepSeq Network where
	deepSeq (is,os)=
	let is2=deepSeq is
	os2=deepSeq os
	in is2 `seq` os2 `seq` (is2, os2)

JP Moresmau's Programming Blog

Friday, September 24, 2010

Haskell Neural Network: plugging a space leak

4 comments:

Friend of Eclipse

About Me