Python: Finding differences between elements of a list

Question

Given a list of numbers, how does one find differences between every (i)-th elements and its (i+1)-th?

Is it better to use a lambda expression or maybe a list comprehension?

For example:

Given a list t=[1,3,6,...], the goal is to find a list v=[2,3,...] because 3-1=2, 6-3=3, etc.

score 196 · Accepted Answer · answered Mar 08 '10 at 11:23

196

>>> t
[1, 3, 6]
>>> [j-i for i, j in zip(t[:-1], t[1:])]  # or use itertools.izip in py2k
[2, 3]

answered Mar 08 '10 at 11:23

SilentGhost

307,395
66
306
293

17

In case you need absolute differences, `[abs(j-i) for i,j in zip(t, t[1:])]` – Anil Jul 14 '15 at 00:09
In case you want to make it more efficient: `list(itertools.starmap(operator.sub, zip(t[1:], t)))` (after importing `itertools` and `operator`). – blhsing Jun 24 '18 at 18:31
7

Actually simply `list(map(operator.sub, t[1:], t[:-1]))` will do. – blhsing Aug 06 '18 at 15:24

score 148 · Answer 2 · answered Mar 08 '10 at 14:26

148

The other answers are correct but if you're doing numerical work, you might want to consider numpy. Using numpy, the answer is:

v = numpy.diff(t)

answered Mar 08 '10 at 14:26

Christian Alis

6,556
5
31
29

1

Very helpful! Thanks! `np.diff([2,4,9])` would be `[2,5]` – TravelTrader Oct 07 '19 at 20:38
4

Would this be more efficient than the `zip` version? – user760900 Jul 28 '20 at 23:29

Omer Dagan · Answer 3 · 2019-01-10T08:48:02.523

43

If you don't want to use numpy nor zip, you can use the following solution:

>>> t = [1, 3, 6]
>>> v = [t[i+1]-t[i] for i in range(len(t)-1)]
>>> v
[2, 3]

edited Jan 10 '19 at 08:48

answered May 23 '13 at 12:51

Omer Dagan

14,868
16
44
60

Xavier Guihot · Answer 4 · 2021-11-13T13:13:16.583

21

Starting in Python 3.10, with the new pairwise function it's possible to slide through pairs of elements and thus map on rolling pairs:

from itertools import pairwise

[y-x for (x, y) in pairwise([1, 3, 6, 7])]
# [2, 3, 1]

The intermediate result being:

pairwise([1, 3, 6, 7])
# [(1, 3), (3, 6), (6, 7)]

edited Nov 13 '21 at 13:13

answered Dec 07 '20 at 23:12

Xavier Guihot

54,987
21
291
190

Lol, I've had something similar in my common functions for ages now. I called my implementation "chunker", but pairwise has a nice sound to it. Very useful. – SurpriseDog Jun 27 '21 at 21:55

Bakuriu · Answer 5 · 2018-05-14T16:52:49.160

You can use itertools.tee and zip to efficiently build the result:

from itertools import tee
# python2 only:
#from itertools import izip as zip

def differences(seq):
    iterable, copied = tee(seq)
    next(copied)
    for x, y in zip(iterable, copied):
        yield y - x

Or using itertools.islice instead:

from itertools import islice

def differences(seq):
    nexts = islice(seq, 1, None)
    for x, y in zip(seq, nexts):
        yield y - x

You can also avoid using the itertools module:

def differences(seq):
    iterable = iter(seq)
    prev = next(iterable)
    for element in iterable:
        yield element - prev
        prev = element

All these solution work in constant space if you don't need to store all the results and support infinite iterables.

Here are some micro-benchmarks of the solutions:

In [12]: L = range(10**6)

In [13]: from collections import deque
In [15]: %timeit deque(differences_tee(L), maxlen=0)
10 loops, best of 3: 122 ms per loop

In [16]: %timeit deque(differences_islice(L), maxlen=0)
10 loops, best of 3: 127 ms per loop

In [17]: %timeit deque(differences_no_it(L), maxlen=0)
10 loops, best of 3: 89.9 ms per loop

And the other proposed solutions:

In [18]: %timeit [x[1] - x[0] for x in zip(L[1:], L)]
10 loops, best of 3: 163 ms per loop

In [19]: %timeit [L[i+1]-L[i] for i in range(len(L)-1)]
1 loops, best of 3: 395 ms per loop

In [20]: import numpy as np

In [21]: %timeit np.diff(L)
1 loops, best of 3: 479 ms per loop

In [35]: %%timeit
    ...: res = []
    ...: for i in range(len(L) - 1):
    ...:     res.append(L[i+1] - L[i])
    ...: 
1 loops, best of 3: 234 ms per loop

Note that:

zip(L[1:], L) is equivalent to zip(L[1:], L[:-1]) since zip already terminates on the shortest input, however it avoids a whole copy of L.
Accessing the single elements by index is very slow because every index access is a method call in python
numpy.diff is slow because it has to first convert the list to a ndarray. Obviously if you start with an ndarray it will be much faster:
```
In [22]: arr = np.array(L)

In [23]: %timeit np.diff(arr)
100 loops, best of 3: 3.02 ms per loop
```

in the second solution, `islice(seq, 1, None)` instead of `islice(seq, 1, len(seq))` makes it work with infinite iterables — Braham Snyder, May 12 '18 at 17:12

score 8 · Answer 6 · edited Apr 23 '20 at 08:39

8

I would suggest using

v = np.diff(t)

this is simple and easy to read.

But if you want v to have the same length as t then

v = np.diff([t[0]] + t) # for python 3.x

or

v = np.diff(t + [t[-1]])

FYI: this will only work for lists.

for numpy arrays

v = np.diff(np.append(t[0], t))

edited Apr 23 '20 at 08:39

ClimateUnboxed

7,106
3
41
86

answered Jun 05 '19 at 17:59

Chaitanya Kesanapalli

163
1
8

1

nice answer, you can also use the prepend keyword though to ensure the same length, see answer below, which I think is a little neater – ClimateUnboxed Apr 23 '20 at 08:41

score 6 · Answer 7 · answered Nov 30 '19 at 16:40

6

Using the := walrus operator available in Python 3.8+:

>>> t = [1, 3, 6]
>>> prev = t[0]; [-prev + (prev := x) for x in t[1:]]
[2, 3]

answered Nov 30 '19 at 16:40

Eugene Yarmash

142,882
41
325
378

score 4 · Answer 8 · edited Jan 21 '21 at 13:19

4

Ok. I think I found the proper solution:

v = [x[0]-x[1] for x in zip(t[1:],t[:-1])]

edited Jan 21 '21 at 13:19

ClimateUnboxed

7,106
3
41
86

answered Mar 08 '10 at 11:25

psihodelia

29,566
35
108
157

2

ya its good, but I think it should have been v = [x[0]-x[1] for x in zip(t[1:], t[:-1])] for sorted list! – Amitkumar Karnik Jan 31 '16 at 10:38

Ola Fosheim Grøstad · Answer 9 · 2017-10-27T18:43:52.047

4

A functional approach:

>>> import operator
>>> a = [1,3,5,7,11,13,17,21]
>>> map(operator.sub, a[1:], a[:-1])
[2, 2, 2, 4, 2, 4, 4]

Using generator:

>>> import operator, itertools
>>> g1,g2 = itertools.tee((x*x for x in xrange(5)),2)
>>> list(itertools.imap(operator.sub, itertools.islice(g1,1,None), g2))
[1, 3, 5, 7]

Using indices:

>>> [a[i+1]-a[i] for i in xrange(len(a)-1)]
[2, 2, 2, 4, 2, 4, 4]

edited Oct 27 '17 at 18:43

answered Oct 23 '17 at 12:36

Ola Fosheim Grøstad

259
1
6

ClimateUnboxed · Answer 10 · 2021-01-21T13:18:35.573

I suspect this is what the numpy diff command does anyway, but just for completeness you can simply difference the sub-vectors:

from numpy import array as a
a(x[1:])-a(x[:-1])

In addition, I wanted to add these solutions to generalizations of the question:

Solution with periodic boundaries

Sometimes with numerical integration you will want to difference a list with periodic boundary conditions (so the first element calculates the difference to the last. In this case the numpy.roll function is helpful:

v-np.roll(v,1)

Solutions with zero prepended

Another numpy solution (just for completeness) is to use

numpy.ediff1d(v)

This works as numpy.diff, but only on a vector (it flattens the input array). It offers the ability to prepend or append numbers to the resulting vector. This is useful when handling accumulated fields that is often the case fluxes in meteorological variables (e.g. rain, latent heat etc), as you want a resulting list of the same length as the input variable, with the first entry untouched.

Then you would write

np.ediff1d(v,to_begin=v[0])

Of course, you can also do this with the np.diff command, in this case though you need to prepend zero to the series with the prepend keyword:

np.diff(v,prepend=0.0)

All the above solutions return a vector that is the same length as the input.

score 0 · Answer 11 · edited Nov 13 '21 at 04:32

0

You can also convert the difference into an easily readable transition matrix using

v = t.reshape((c,r)).T - t.T

Where c = number of items in the list and r = 1 since a list is basically a vector or a 1d array.

edited Nov 13 '21 at 04:32

tdy

36,675
19
86
83

answered Nov 12 '21 at 22:58

John

1

score -1 · Answer 12 · answered Apr 15 '16 at 07:09

-1

My way

>>>v = [1,2,3,4,5]
>>>[v[i] - v[i-1] for i, value in enumerate(v[1:], 1)]
[1, 1, 1, 1]

answered Apr 15 '16 at 07:09

Arindam Roychowdhury

5,927
5
55
63

2

Using `enumerate` is wasteful because you're not using `value`. See https://stackoverflow.com/a/16714453/832230 – Asclepius Nov 25 '16 at 08:01

Python: Finding differences between elements of a list

12 Answers12

Linked

Related