Sort a list of tuples by 2nd item (integer value)

Question

I have a list of tuples that looks something like this:

[('abc', 121),('abc', 231),('abc', 148), ('abc',221)]

I want to sort this list in ascending order by the integer value inside the tuples. Is it possible?

score 832 · Accepted Answer · edited Apr 14 '23 at 21:32

832

Try using the key keyword argument of sorted(), which sorts in increasing order by default:

sorted(
    [('abc', 121), ('abc', 231), ('abc', 148), ('abc', 221)], 
    key=lambda x: x[1]
)

key should be a function that identifies how to retrieve the comparable element from your data structure. In your case, it is the second element of the tuple, so we access [1].

For optimization, see jamylak's response using operator.itemgetter(1), which is essentially a faster version of lambda x: x[1].

edited Apr 14 '23 at 21:32

Intrastellar Explorer

3,005
9
52
119

answered May 22 '12 at 02:51

cheeken

33,663
4
35
42

26

While obvious. Sorted does not sort in place so: sorted_list = sorted([('abc', 121),('abc', 231),('abc', 148), ('abc',221)], key=lambda x: x[1]) – Vesanto Apr 18 '18 at 16:46
28

,reverse=True for biggest to smallest. – jonincanada Sep 30 '18 at 15:01
4

This still works well with Python 3.7. – jftuga Oct 30 '18 at 20:44
5

You can also add multiple keys as a tuple, if you want one as reversed you can add a negative sign, this will sort using the first element first and then second element: `sorted(some_list, lambda x: (x[0], -x[1],))` – Seraf Mar 10 '19 at 17:34
What's gonna happen in above case if we don't provide any key? – Hemanth Bakaya Mar 27 '20 at 12:12
3

I just wanted to say this is my most visited stackoverflow page of all time; i've been here like easily 500 times by now. Thank you cheeken, if only i could memorize this one line of code. – negfrequency Aug 19 '20 at 03:08

jamylak · Answer 2 · 2012-05-22T04:19:26.463

236

>>> from operator import itemgetter
>>> data = [('abc', 121),('abc', 231),('abc', 148), ('abc',221)]
>>> sorted(data,key=itemgetter(1))
[('abc', 121), ('abc', 148), ('abc', 221), ('abc', 231)]

IMO using itemgetter is more readable in this case than the solution by @cheeken. It is also faster since almost all of the computation will be done on the c side (no pun intended) rather than through the use of lambda.

>python -m timeit -s "from operator import itemgetter; data = [('abc', 121),('abc', 231),('abc', 148), ('abc',221)]" "sorted(data,key=itemgetter(1))"
1000000 loops, best of 3: 1.22 usec per loop

>python -m timeit -s "data = [('abc', 121),('abc', 231),('abc', 148), ('abc',221)]" "sorted(data,key=lambda x: x[1])"
1000000 loops, best of 3: 1.4 usec per loop

edited May 22 '12 at 04:19

answered May 22 '12 at 02:51

jamylak

128,818
30
231
230

15

+1 I agree that `itemgetter()` is a better solution. However, I thought a lambda expression would make it clearer how `key` functions. – cheeken May 22 '12 at 04:45
1

+1 However, When I ran your testing of the speed I noticed 'human-eye' that the one that is supposed to be faster.. and measured faster, actually was noticeably slower. I scratched my head on this for a bit, then took the python timeout module out of play and just used linux time. i.e. `time \`python -c "the code"\`` then I got 'human-eye' results that you spell out, as well as sys clock times that were faster. Still not sure why this is, but it was reproducible. I gather it has something to do with the overhead of loading in the module's, but still does not quite make since to me, just yet. – Jeff Sheffield Jul 23 '14 at 17:38
2

@JeffSheffield: Notice that jamylak is doing the import in the setup code (outside the timing), not the tested code. That's perfectly reasonable, because most programs will need to sort more than once, or need to sort much larger collections, but they'll only do the import once. (And for those programs that only need to do one smallish sort ever… well, you're talking about a difference of under a microsecond, so who cares either way?) – abarnert Sep 04 '14 at 02:24
@abarnert FYI: jamylak is doing the import inside of the `python -m timeit -s` but yea I think you are on point to say that in a production scenario you only pay that lib load penalty once. and... as for who cares about that microsecond... you care because the assumption is that your sorting data is going to get quite large and that microsecond is going to turn into real seconds once the data set grows. – Jeff Sheffield Sep 04 '14 at 14:05
@JeffSheffield: That's exactly the point: the cost of the import will not grow with the data, so even if it seems like a large part of the 1us you're paying for one smallish sort, it's going to be an irrelevant part of the 500ms you pay for a big sort, or a bunch of small sorts. – abarnert Sep 04 '14 at 17:37
x = [[[5,3],1.0345],[[5,6],5.098],[[5,4],4.89],[[5,1],5.97]] With a list like this is can we sort using itemgetter() with respect to elements in x[0][1] ? – nidHi Dec 02 '16 at 09:49
@nidHi I'm not sure if that can be done but I'm pretty sure that even if itemgetter could, lambda (solution above) would be clearer and hence more pythonic in that case. In your case though, (x[0] being the same for x's all elements) a simple `sorted(x)` will give you desired order. So, that, probably with a comment would be the most pythonic statement. – 0xc0de Mar 22 '17 at 04:49
I extend the `data` to 48 elements and do it in Jupyter. The results are `5.19 µs ± 42.6 ns per loop` for `%timeit sorted(data, key=itemgetter(1))` and `6.7 µs ± 63.6 ns per loop` for `%timeit sorted(data, key=lambda x: x[1])`. So `itemgetter` is still faster. – Louis Yang Feb 10 '19 at 20:55
@LouisYang Thanks for sharing those results. It confirms what I expected – jamylak Feb 10 '19 at 23:28

score 51 · Answer 3 · answered Mar 29 '15 at 17:50

51

Adding to Cheeken's answer, This is how you sort a list of tuples by the 2nd item in descending order.

sorted([('abc', 121),('abc', 231),('abc', 148), ('abc',221)],key=lambda x: x[1], reverse=True)

answered Mar 29 '15 at 17:50

Vignesh Jayavel

984
11
11

1

Note that the original list will not be changed. the `sorted` function just produce a new list which is sorted for you. – ZhaoGang Oct 30 '19 at 02:50

score 45 · Answer 4 · edited Nov 20 '13 at 23:07

45

As a python neophyte, I just wanted to mention that if the data did actually look like this:

data = [('abc', 121),('abc', 231),('abc', 148), ('abc',221)]

then sorted() would automatically sort by the second element in the tuple, as the first elements are all identical.

edited Nov 20 '13 at 23:07

kero

10,647
5
41
51

answered Nov 20 '13 at 22:49

Angus

451
4
2

score 29 · Answer 5 · answered Jun 30 '17 at 18:13

29

For an in-place sort, use

foo = [(list of tuples)]
foo.sort(key=lambda x:x[0]) #To sort by first element of the tuple

answered Jun 30 '17 at 18:13

Shivank Tibrewal

291
3
2

2

Although this answer may be correct, it is better to explain why this answer is correct instead of providing code only. Additionally, this is almost an exact answer of one that already exists and was accepted 5 years ago, so this doesn't really add anything to the site. Take a look at newer questions to help people! – JNYRanger Jun 30 '17 at 18:49
14

actually this helps people looking for an in-place sort – leoschet May 19 '18 at 00:16
While this is helpful it would likely be more appropriate as a comment to the suggested answer indicating how one would use the same method as the one provided in that answer to accomplish the same task in-place. – Michael DiStefano Mar 14 '19 at 23:06

score 15 · Answer 6 · edited May 22 '12 at 02:57

15

From python wiki:

>>> from operator import itemgetter, attrgetter    
>>> sorted(student_tuples, key=itemgetter(2))
[('dave', 'B', 10), ('jane', 'B', 12), ('john', 'A', 15)]    
>>> sorted(student_objects, key=attrgetter('age'))
[('dave', 'B', 10), ('jane', 'B', 12), ('john', 'A', 15)]

edited May 22 '12 at 02:57

jamylak

128,818
30
231
230

answered May 22 '12 at 02:54

Dmitry Zagorulkin

8,370
4
37
60

x = [[[5,3],1.0345],[[5,6],5.098],[[5,4],4.89],[[5,1],5.97]] With a list like this is can we sort using itemgetter() with respect to elements in x[0][1] ? – nidHi Dec 02 '16 at 09:50

score 8 · Answer 7 · answered Feb 16 '16 at 02:58

8

For a lambda-avoiding method, first define your own function:

def MyFn(a):
    return a[1]

then:

sorted([('abc', 121),('abc', 231),('abc', 148), ('abc',221)], key=MyFn)

answered Feb 16 '16 at 02:58

paulm

107
1
5

2

What are the benefits of this? – dromtrund May 03 '16 at 08:42
6

One benefit would be to have a defined function that you could use anywhere without having to put `lambda x: x[1]` in multiple areas of code. – Tom Myddeltyn Jul 13 '16 at 14:59
2

Another benefit is that you can document / comment better if it is a separate function. – uli42 Dec 07 '17 at 11:26

score 5 · Answer 8 · answered Feb 05 '17 at 11:01

5

For Python 2.7+, this works which makes the accepted answer slightly more readable:

sorted([('abc', 121),('abc', 231),('abc', 148), ('abc',221)], key=lambda (k, val): val)

answered Feb 05 '17 at 11:01

Neil

7,042
9
43
78

score 0 · Answer 9 · answered Jun 01 '17 at 15:15

The fact that the sort values in the OP are integers isn't relevant to the question per se. In other words, the accepted answer would work if the sort value was text. I bring this up to also point out that the sort can be modified during the sort (for example, to account for upper and lower case).

>>> sorted([(121, 'abc'), (231, 'def'), (148, 'ABC'), (221, 'DEF')], key=lambda x: x[1])
[(148, 'ABC'), (221, 'DEF'), (121, 'abc'), (231, 'def')]
>>> sorted([(121, 'abc'), (231, 'def'), (148, 'ABC'), (221, 'DEF')], key=lambda x: str.lower(x[1]))
[(121, 'abc'), (148, 'ABC'), (231, 'def'), (221, 'DEF')]

Sort a list of tuples by 2nd item (integer value)

9 Answers9

Linked

Related