List manipulation performance in Haskell

Question

I am currently learning Haskell and I am curious about the following:

If I add an element to a list in Haskell, Haskell returns a (completely?) new list, and doesn't manipulate the original one.

Now let's say I have a list of a million elements and I append one element at the end. Does Haskell "copy" the whole list (1 million elements) and adds the element to that copy? Or is there a neat "trick" going on behind the scenes to avoid copying the whole list?

And if there isn't a "trick", is the process of copying large lists not as expensive as I think it is?

Luis Casillas · Answer 1 · 2015-05-15T22:38:47.973

This is a surprisingly complex question, because of two features of Haskell and GHC:

Lazy evaluation
List fusion

List fusion means that in some situations, GHC can rewrite list processing code into a loop that doesn't allocate list cells. So depending on the context where it is used, the same code could incur no additional cost.

Lazy evaluation means that if the result of an operation is not consumed, then you don't pay the cost of computing it. So for example, this is cheap, because you only have to construct the first ten elements of the list:

example = take 10 ([1..1000000] ++ [1000001])

In fact, in that code the take 10 can fuse with the list append, so it's the same as just [1..10].

But let's just assume that we're consuming all of the elements of all of the lists that we make, and that the compiler isn't fusing our list operations. Now to your questions:

If I add an element to a List in Haskell, Haskell returns a (completly?) new list, and doesn't manipulate the original one. Now let's say I have a List of a million elements and I append one element at the end. Does Haskell "copy" the whole list (1 million elements) and adds the element to that copy? Or is there a neat "trick" going on behind the scenes to avoid copying the whole list?

There exist tricks to avoid copying the whole list, but by appending to its end you defeat them. The thing to understand is that functional data structures are normally designed so that operations that "modify" them will exploit structure-sharing to reuse as much of the old structure as possible. So for example, appending two lists can be defined like this:

(++) :: [a] -> [a] -> [a]
[] ++ ys = ys
(x:xs) ++ ys = x : xs ++ ys

Looking at this definition, you can tell that the list ys will be reused in the result. So if we have xs = [1..3], ys = [4..5] and xs ++ ys, all fully evaluated and retained in memory at once, it will look something like this memory-wise:

           +---+---+    +---+---+    +---+---+
      xs = | 1 | -----> | 2 | -----> | 3 | -----> []
           +---+---+    +---+---+    +---+---+

           +---+---+    +---+---+ 
      ys = | 4 | -----> | 5 | -----> []
           +---+---+    +---+---+    
             ^
             |
             +------------------------------------+
                                                  |
           +---+---+    +---+---+    +---+---+    |
xs ++ ys = | 1 | -----> | 2 | -----> | 3 | -------+
           +---+---+    +---+---+    +---+---+

That is the long way of saying this: if you do xs ++ ys, and it doesn't fuse, and you consume the whole list, then that will create a copy of xs but reuse the memory for ys.

But now let's look again at this bit of your question:

Now let's say I have a List of a million elements and I append one element at the end. Does Haskell "copy" the whole list (1 million elements) and adds the element to that copy?

That would be something like [1..1000000] ++ [1000001], and yes, it would copy the whole million elements. But on the other hand, [0] ++ [1..1000000] would only copy the [0]. The rule of thumb is this:

Adding elements at the beginning of a list is most efficient.
Adding elements at the end of a list is often inefficient, particularly if you do it over and over.

The general solutions to this sort of problem are:

Modify your algorithm so that you use lists in an access pattern they support efficiently.
Don't use lists; use some other sequence data structure that efficiently supports the access pattern you need for the problem at hand. Another answer mentioned difference lists, but others worth mentioning are:

Nice! I didn't know about structure sharing. – Robin May 15 '15 at 21:54 — Robin, May 15 '15 at 21:54

score 13 · Accepted Answer · edited Jun 22 '20 at 07:24

It depends on the data structure you're using. If you're using normal Haskell lists, these would be analogous to a typical linked list implementation in C or C++. With this structure, appends and indexing (worst-case) are O(n) complexity, while prepends are O(1) complexity. If you are appending frequently and your list is growing linearly this will effectively be O(n^2). For large lists this is a problem. This is regardless of what language you're using, Haskell, C, C++, Python, Java, C#, or even Assembler.

However, if you were to use a structure like Data.Sequence.Seq, then it uses the proper structure internally to provide O(1) prepends and appends, but the cost is that it can take up a bit more RAM. All data structures have tradeoffs, though, it's up to you which one you want to use.

Alternatively, you can also use Data.Vector.Vector or Data.Array.Array, which both provide fixed-length, contiguous memory arrays, but appending and prepending is expensive because you have to copy the entire array to a new location in RAM. Indexing is O(1), though, and mapping or folding over one of these structures would be much faster because chunks of the array can fit into your CPU cache at a time, as opposed to linked lists or sequences that have elements scattered all over your RAM.

Does Haskell "copy" the whole list (1 million elements) and adds the element to that copy?

Not necessarily, the compiler can determine if it's safe to just have the last value's next pointer change to point at the new value instead of the empty list, or if it's unsafe it may be necessary to copy the entire list. These problems are inherent to the data structure, not the language, though. In general, I would say that Haskell's lists are better than C linked lists because the compiler is more capable of analyzing when this is safe than a programmer is, and C compiler won't do this sort of analysis, they just do exactly as they're told.

I agree to what you say but the your Big O notation is not correct. O(500000500000) == O(1) == constant time (see http://en.wikipedia.org/wiki/Big_O_notation#Multiplication_by_a_constant ). Sure, you can argue that if you try to "append a million elements" then it always runs in O(1) as there's no variable left and the operation "append a million times" does indeed run in constant time. But I don't think that's what you want to say. — Johannes Weiss, May 15 '15 at 14:16
I know you didn't exactly claim otherwise, but GHC never changes the value of an existing heap object in the way you describe in the last paragraph. (What GHC is fairly good at is avoiding constructing the intermediate value on the heap in the first place.) — Reid Barton, May 15 '15 at 16:16
Hey @ReidBarton could you tell me more (or post a resource) about how GHC avoids constructing the intermediate value on the heap? — Robin, May 15 '15 at 18:52
@jackrandom, it's a complicated subject. Look for information on inlining, unboxing and list fusion. — Reid Barton, May 15 '15 at 23:33

score 3 · Answer 3 · answered May 15 '15 at 14:08

When using lists, appending is expensive and the list has to be copied, though not the elements. Also, prepending is cheap as the new value is just pointing to the original list.

Take appending "third" to ["first", "second"]: the new list is (:) "first" ((:) "second" ((:) "third" [])). Thus the first constructor must be a new one as the second argument must be a new value as the ... The strings are not duplicated though. The new list points to the same strings in memory.

Note that in the case where the old value is discarded, the compiler might decide to reuse it instead of allocating memory for new values and garbage collecting the old ones. In any case, appending will be done in O(n) as it needs to find the end of it.

Now if your program is appending a lot to lists, you might want to use a different data structures to be able to append in O(1) such as DList form the package dlist. (https://hackage.haskell.org/package/dlist-0.5/docs/Data-DList.html)

the appendings are not the problem. nothing precludes lists being implemented with their elements stored in a big pre-allocated array, plus `start` and `end` position. both `xs` and `xs ++ [a]` can use the same array. even prependings are not a problem if we start in the middle, or use lists (/ arrays) of (pointers to) array blocks. it is the *insertions* that are problematic. `case xs of (a:as) ...` would just create `as = (start+1,end,array)` from `xs = (start,end,array)`, behind the scenes. — Will Ness, Mar 08 '18 at 10:52

List manipulation performance in Haskell

3 Answers3