Pythonic way to split a list after elements for which a given predicate is true

Question

Assume you have a list of arbitrary elements like

['monkey', 'deer', 'lion', 'giraffe', 'lion', 'eagle', 'lion', 'fish']

which should be split into sublists after each element for which a given predicate, e.g.

is_lion(element)

returns True. The above example should become

[['monkey', 'deer', 'lion'], ['giraffe', 'lion'], ['eagle', 'lion'], ['fish']]

Is there a pythonic way of doing it?

@Aशwiniचhaudhary Correct, I missed this. However I think the question asked this way is more generic. — Mario Konschake, Mar 25 '14 at 08:48
@Aशwiniचhaudhary, The question you posted is very close to this one. Not exactly, but enough to make me uncomfortable about this one. — batbrat, Mar 25 '14 at 08:50
@jonrsharpe `['snake','lion','lion']` returns `[['snake','lion'],['lion']]` — Mario Konschake, Mar 25 '14 at 08:56

jonrsharpe · Accepted Answer · 2014-03-25T09:00:20.997

5

The easiest way is probably:

out = [[]]
for element in lst:
    out[-1].append(element)
    if predicate(element):
        out.append([])

Note that this would leave an empty list at the end of out, if predicate(element): for the last element. You can remove this by adding:

out = [l for l in out if l]

edited Mar 25 '14 at 09:00

answered Mar 25 '14 at 08:47

jonrsharpe

115,751
26
228
437

score 2 · Answer 2 · answered Mar 25 '14 at 09:12

2

Just because we can, a functional one-liner:

from functools import reduce

reduce(lambda out, x: out[:-1] + [out[-1] + [x]] if not predicate(x) else out + [[x]], x, [[]])

answered Mar 25 '14 at 09:12

filmor

30,840
6
50
48

score 1 · Answer 3 · answered Mar 25 '14 at 09:27

I rather like this solution:

def f(outs, x):
    if outs[-1][-1:] == ["lion"]:
        outs.append([])
    outs[-1].append(x)
    return outs

def splitAfterLion(xs):
    return reduce(f,xs,[[]])

It might not be very pythonic, more functional. But it's short and does not suffer from trailing empty lists in the result.

utdemir · Answer 4 · 2014-03-25T08:55:27.307

0

>>> import itertools
>>> l = ['monkey', 'deer', 'lion', 'giraffe', 'lion', 'eagle', 'lion', 'fish']
>>> f = lambda i: i == "lion"
>>> a = [list(j) for i, j in itertools.groupby(l, f)]
>>> a
[['monkey', 'deer'], ['lion'], ['giraffe'], ['lion'], ['eagle'], ['lion'], ['fish']]
>>> [i+j for i, j in zip(a[::2], a[1::2])]
[['monkey', 'deer', 'lion'], ['giraffe', 'lion'], ['eagle', 'lion']]

Edit:

>>> [i+j for i, j in itertools.zip_longest(a[::2], a[1::2], fillvalue=[])]
[['monkey', 'deer', 'lion'], ['giraffe', 'lion'], ['eagle', 'lion'], ['fish']]

edited Mar 25 '14 at 08:55

answered Mar 25 '14 at 08:49

utdemir

26,532
10
62
81

2

You dropped the `fish`... – Tim Pietzcker Mar 25 '14 at 08:53
Thanks for noticing, fixed. – utdemir Mar 25 '14 at 08:55
1

Also, this will group consecutive `lion`s into one list – jonrsharpe Mar 25 '14 at 08:59

score 0 · Answer 5 · answered Mar 25 '14 at 09:07

Just another way of doing it by getting the index without using itertool, please let me know if that works for you:

#!/usr/bin/python

ls = ['monkey', 'deer', 'lion', 'giraffe', 'lion', 'eagle', 'lion', 'fish', 'fish']

def is_lion(elm):
    return elm in ls

def mark_it(nm):
    ind = [ x+1 for x,y in enumerate(ls) if y == nm ]
    if ind[-1] < len(ls):
        ind.append(len(ls))
    return ind

def merge_it(ind):
    return [list(ls[x[0]:x[1]]) for x in zip(ind[::], ind[1::])]

name = 'lion'
if is_lion(name):
    index = [0]
    index.extend(mark_it(name))
    print merge_it(index)
else:
    print 'not found'

Output:

[['monkey', 'deer', 'lion'], ['giraffe', 'lion'], ['eagle', 'lion'], ['fish', 'fish']]

score 0 · Answer 6 · answered Mar 25 '14 at 09:09

Here is a solution:

def is_lion(a, element):
    start = 0
    for key,value in enumerate(a):
        if value == element:
            yield a[start:key+1]
            start = key+1

    # print out the last sub-list
    if value != 'lion':
        yield a[start:key+1]


a = ['monkey', 'deer', 'lion', 'giraffe', 'lion', 'eagle', 'lion', 'fish']

print [x for x in is_lion(a, 'lion')]

Pythonic way to split a list after elements for which a given predicate is true

6 Answers6

Linked

Related