192

I know that this sounds trivial, but I did not realize that the sort() function of Python was weird. I have a list of "numbers" that are actually in string form, so I first convert them to ints, then attempt a sort.

list1=["1","10","3","22","23","4","2","200"]
for item in list1:
    item=int(item)

list1.sort()
print list1

Gives me:

['1', '10', '2', '200', '22', '23', '3', '4']

I want

['1','2','3','4','10','22','23','200']

I've looked around for some of the algorithms associated with sorting numeric sets, but the ones I found all involved sorting alphanumeric sets.

I know this is probably a no-brainer problem, but Google and my textbook don't offer anything more or less useful than the .sort() function.

Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
Brian
  • 4,023
  • 8
  • 29
  • 36
  • 14
    Note that your for loop does not do what I suspect that you think it does. – deinst Aug 06 '10 at 17:21
  • 1
    At no time did you update `list1`. What made you think `list` was being updated? – S.Lott Aug 06 '10 at 17:23
  • The similar problem raise when list1 = ['1', '1.10', '1.11', '1.1', '1.2'] is provided as input. Instead of getting output as ['1', '1.1', '1.2', '1.10', '1.11'], I am getting ['1', '1.1', '1.10', '1.11', '1.2'] – sathish May 20 '16 at 03:46
  • 2
    in python 3 you may want to use `sorted(mylist)` – Akin Hwan Dec 23 '18 at 15:02
  • Related: *[Sort a list of numerical strings in ascending order](https://stackoverflow.com/questions/9758959/)* – Peter Mortensen Jul 31 '23 at 20:14

14 Answers14

228

You haven't actually converted your strings to ints. Or rather, you did, but then you didn't do anything with the results. What you want is:

list1 = ["1","10","3","22","23","4","2","200"]
list1 = [int(x) for x in list1]
list1.sort()

If for some reason you need to keep strings instead of ints (usually a bad idea, but maybe you need to preserve leading zeros or something), you can use a key function. sort takes a named parameter, key, which is a function that is called on each element before it is compared. The key function's return values are compared instead of comparing the list elements directly:

list1 = ["1","10","3","22","23","4","2","200"]
# call int(x) on each element before comparing it
list1.sort(key=int)
# or if you want to do it all in the same line
list1 = sorted([int(x) for x in list1]) 
rkachach
  • 16,517
  • 6
  • 42
  • 66
Seamus Campbell
  • 17,816
  • 3
  • 52
  • 60
  • 8
    when I try key=int in 2.7 I get None – KI4JGT Jan 28 '13 at 05:48
  • 1
    This works if the list element is stored as "integer", how shall be handled in case of float values? Eg., list1 = [1, 1.10, 1.11, 1.1, 1.2] – sathish May 19 '16 at 09:01
  • 3
    @KI4JGT the sort method modifies the list and returns None. So instead of `list1 = list1.sort(key=int)`, use just `list1.sort(key=int)` and list1 will already be sorted. – Josiah Yoder Jun 20 '18 at 16:41
  • 4
    @KI4JGT .sort() is an in place operator, it returns None, it sorts the list, you may want to use sorted() – sherpya Nov 14 '18 at 15:31
82

I approached the same problem yesterday and found a module called natsort, which solves your problem. Use:

from natsort import natsorted # pip install natsort

# Example list of strings
a = ['1', '10', '2', '3', '11']

[In]  sorted(a)
[Out] ['1', '10', '11', '2', '3']

[In]  natsorted(a)
[Out] ['1', '2', '3', '10', '11']

# Your array may contain strings
[In]  natsorted(['string11', 'string3', 'string1', 'string10', 'string100'])
[Out] ['string1', 'string3', 'string10', 'string11', 'string100']

It also works for dictionaries as an equivalent of sorted.

Alice Purcell
  • 12,622
  • 6
  • 51
  • 57
lrsp
  • 1,213
  • 10
  • 18
44

You could pass a function to the key parameter to the .sort method. With this, the system will sort by key(x) instead of x.

list1.sort(key=int)

BTW, to convert the list to integers permanently, use the map function

list1 = list(map(int, list1))   # you don't need to call list() in Python 2.x

or list comprehension

list1 = [int(x) for x in list1]
Claude
  • 8,806
  • 4
  • 41
  • 56
kennytm
  • 510,854
  • 105
  • 1,084
  • 1,005
33

In case you want to use sorted() function: sorted(list1, key=int)

It returns a new sorted list.

Cees Timmerman
  • 17,623
  • 11
  • 91
  • 124
syam
  • 562
  • 6
  • 15
21

You can also use:

import re

def sort_human(l):
    convert = lambda text: float(text) if text.isdigit() else text
    alphanum = lambda key: [convert(c) for c in re.split('([-+]?[0-9]*\.?[0-9]*)', key)]
    l.sort(key=alphanum)
    return l

This is very similar to other stuff that you can find on the internet but also works for alphanumericals like [abc0.1, abc0.2, ...].

Georgy
  • 12,464
  • 7
  • 65
  • 73
Julian
  • 385
  • 2
  • 9
  • You should probably either return a new list or modify the list, not both. The above code modifies the list and then returns it. Use `sorted()` to make a new list instead. – Victor Nordam Suadicani Nov 17 '20 at 19:15
  • 2
    Unfortunately this only works when letters and numbers don't appear in the same order; e.g. `["abc123", "123abc"]`: `TypeError: '<' not supported between instances of 'float' and 'str'`. Solution: replace the covert function with `(float(text), "") if text.isdigit() else (float("inf"), text)`. It will always return a (float, str) tuple, so comparison will always work. – Claude Feb 26 '23 at 10:55
14

Python's sort isn't weird. It's just that this code:

for item in list1:
   item=int(item)

isn't doing what you think it is - item is not replaced back into the list, it is simply thrown away.

Anyway, the correct solution is to use key=int as others have shown you.

Daniel Roseman
  • 588,541
  • 66
  • 880
  • 895
9

Seamus Campbell's answer doesn't work on Python 2.x.

list1 = sorted(list1, key=lambda e: int(e)) using lambda function works well.

Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
3

Try this. It’ll sort the list in-place in descending order (there isn’t any need to specify a key in this case):

Process

listB = [24, 13, -15, -36, 8, 22, 48, 25, 46, -9]
listC = sorted(listB, reverse=True) # listB remains untouched
print listC

output:

 [48, 46, 25, 24, 22, 13, 8, -9, -15, -36]
Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
Mavia
  • 71
  • 1
  • 6
1

The most recent solution is right. You are reading solutions as a string, in which case the order is 1, then 100, then 104 followed by 2 then 21, then 2001001010, 3 and so forth.

You have to cast your input as an int instead:

sorted strings:

stringList = (1, 10, 2, 21, 3)

sorted ints:

intList = (1, 2, 3, 10, 21)

To cast, just put the stringList inside int (blahblah).

Again:

stringList = (1, 10, 2, 21, 3)

newList = int (stringList)

print newList

=> returns (1, 2, 3, 10, 21)
Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
clint
  • 35
  • 1
  • 1
    TypeError: int() argument must be a string or a number, not 'tuple' – Cees Timmerman Apr 14 '14 at 14:56
  • Also, the strings in your stringList should have quotes. – Teepeemm Nov 20 '15 at 18:01
  • 3
    That's a helluva prediction to make: "the most recent solution is right" ;) – GreenAsJade Feb 29 '16 at 01:13
  • Re *"The most recent solution is right"*: Such relative references are not reliable (or stable). Which answer does it refer to? – Peter Mortensen Jul 31 '23 at 20:24
  • OK, the OP has left the building: *"Last seen more than 9 years ago"* – Peter Mortensen Jul 31 '23 at 20:24
  • "Most recent" may not even be taken literally, but instead as the lowest-scored answer at the time (by the default sort order), probably [Daniel Roseman's answer](https://stackoverflow.com/questions/3426108/how-to-sort-a-list-of-strings-numerically/3426157#3426157). Though it isn't at all clear what answer is being referred to. – Peter Mortensen Jul 31 '23 at 20:29
1

The real problem is that 'sort' sorts things alphanumerically.

So if you have a list, ['1', '2', '10', '19'], and run 'sort', you get ['1', '10'. '19', '2']. That is, 10 comes before 2, because it looks at the first character and sorts starting from that.

It seems most methods in Python return things in that order. For example, if you have a directory named 'abc' with the files labelled as 1.jpg, 2.jpg, etc., say, up to 15.jpg and you do file_list=os.listdir(abc), the file_list is not ordered as you expect, but rather as file_list=['1.jpg', '11.jpg'---'15.jpg', '2.jpg].

If the order in which files are processed is important (presumably that's why you named them numerically) the order is not what you think it will be. You can avoid this by using "zeros" padding. For example if you have a list, alist=['01', '03', '05', '10', '02','04', '06] and you run 'sort' on it, you get the order you wanted, alist=['01', '02', etc.], because the first character is 0 which comes before 1. The amount of zeros padding you need is determined by the largest value in the list.

For example, if the largest is, say, between 100 and 1000, you need to pad single digits as 001, 002 ---010, 011--100, 101, etc.

Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
Gerry P
  • 7,662
  • 3
  • 10
  • 20
0

A simple way to sort a numerical list:

numlists = ["5","50","7","51","87","97","53"]
results = list(map(int, numlists))
results.sort(reverse=False)
print(results)
Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
sayalok
  • 882
  • 3
  • 15
  • 30
0

If you want to use strings of the numbers, better take another list as shown in my code. It will work fine.

list1 = ["1", "10", "3", "22", "23", "4", "2", "200"]

k = []
for item in list1:
    k.append(int(item))

k.sort()
print(k)
# [1, 2, 3, 4, 10, 22, 23, 200]
Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
0

It may be not the best Python code, but for string lists like ['1', '1.0', '2.0', '2', '1.1', '1.10', '1.11', '1.2', '7', '3', '5'], with the expected target ['1', '1.0', '1.1', '1.2', '1.10', '1.11', '2', '2.0', '3', '5', '7'] helped me...

unsortedList = ['1', '1.0', '2.0', '2', '1.1', '1.10', '1.11', '1.2', '7', '3', '5']
sortedList = []
sortDict = {}
sortVal = []
# Set zero correct (integer): example: 1.000 will be 1 and breaks the order
zero = "000"
for i in sorted(unsortedList):
  x = i.split(".")
  if x[0] in sortDict:
    if len(x) > 1:
        sortVal.append(x[1])
    else:
        sortVal.append(zero)
    sortDict[x[0]] = sorted(sortVal, key = int)
  else:
    sortVal = []
    if len(x) > 1:
        sortVal.append(x[1])
    else:
        sortVal.append(zero)
    sortDict[x[0]] = sortVal
for key in sortDict:
  for val in sortDict[key]:
    if val == zero:
       sortedList.append(str(key))
    else:
       sortedList.append(str(key) + "." + str(val))
print(sortedList)
Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131
  • 2
    Welcome to SO! When you are about to answer an old question (this one is over 10 years old) that already has an accepted answer (that is the case here) please ask yourself: Do I really have a substantial improvement to offer? If not, consider refraining from answering. – Timus Oct 23 '20 at 10:38
-5

Use:

scores = ['91','89','87','86','85']
scores.sort()
print (scores)

This worked for me using Python version 3, though it didn't in version 2.

Peter Mortensen
  • 30,738
  • 21
  • 105
  • 131