How to make lists contain only distinct element in Python?

Question

I have a list in Python, how can I make it's values unique?

Or [this](http://stackoverflow.com/questions/480214/) if you want to preserve the ordering. — Björn Pollex, Dec 16 '10 at 10:31
Please fix the title of your question. You're not talking about make lists distinct. You're talking about making list **items** distinct. — S.Lott, Dec 16 '10 at 11:31
Why do you need list in the first place? Maybe set() or dict() are enough. — Paweł Prażak, Dec 16 '10 at 14:22
Also see [here](http://stackoverflow.com/q/7961363/1129682) for more information — user1129682, Mar 14 '14 at 17:37
Possible duplicate of [Removing duplicates in lists](https://stackoverflow.com/questions/7961363/removing-duplicates-in-lists) or https://stackoverflow.com/questions/480214/how-do-you-remove-duplicates-from-a-list-in-whilst-preserving-order — Ciro Santilli OurBigBook.com, Dec 18 '17 at 14:25

score 389 · Accepted Answer · answered Dec 16 '10 at 10:29

389

The simplest is to convert to a set then back to a list:

my_list = list(set(my_list))

One disadvantage with this is that it won't preserve the order. You may also want to consider if a set would be a better data structure to use in the first place, instead of a list.

answered Dec 16 '10 at 10:29

Mark Byers

811,555
193
1,581
1,452

1

i am wrong or with python3k the values will be preserved, cause set now are sorted? – Ant Dec 16 '10 at 10:32
3

@Ant Dictionary key order is preserved from Python 3.6, but it says "the order-preserving aspect of this new implementation is considered an implementation detail and should not be relied upon". Since they're both based on hashes, I'd think set would be the same, but it's not mentioned, so apparently not: https://docs.python.org/3.6/whatsnew/3.6.html – Mark Dec 03 '16 at 11:52
Preserve order and functional way: In `[23]: from functools import reduce` `In [24]: reduce(lambda acc,elem: acc+[elem] if not elem in acc else acc , [2,1,2,3,3,3,4,5], [])` `Out[24]: [2, 1, 3, 4, 5]` – Sky Oct 15 '18 at 05:52
The order of the list gets lost this way – Inaam Ilahi Aug 04 '22 at 11:12
worth mentioning that this doesn't work if the list contains a list. – Aiden Cullo Jul 17 '23 at 00:40

Paweł Prażak · Answer 2 · 2011-04-10T06:44:20.043

Modified versions of http://www.peterbe.com/plog/uniqifiers-benchmark

To preserve the order:

def f(seq): # Order preserving
  ''' Modified version of Dave Kirby solution '''
  seen = set()
  return [x for x in seq if x not in seen and not seen.add(x)]

OK, now how does it work, because it's a little bit tricky here if x not in seen and not seen.add(x):

In [1]: 0 not in [1,2,3] and not print('add')
add
Out[1]: True

Why does it return True? print (and set.add) returns nothing:

In [3]: type(seen.add(10))
Out[3]: <type 'NoneType'>

and not None == True, but:

In [2]: 1 not in [1,2,3] and not print('add')
Out[2]: False

Why does it print 'add' in [1] but not in [2]? See False and print('add'), and doesn't check the second argument, because it already knows the answer, and returns true only if both arguments are True.

More generic version, more readable, generator based, adds the ability to transform values with a function:

def f(seq, idfun=None): # Order preserving
  return list(_f(seq, idfun))

def _f(seq, idfun=None):  
  ''' Originally proposed by Andrew Dalke '''
  seen = set()
  if idfun is None:
    for x in seq:
      if x not in seen:
        seen.add(x)
        yield x
  else:
    for x in seq:
      x = idfun(x)
      if x not in seen:
        seen.add(x)
        yield x

Without order (it's faster):

def f(seq): # Not order preserving
  return list(set(seq))

sort of inner helper function (there was a bug in the code, should be _f instead of _f10 on line 2, thanks for spotting) — Paweł Prażak, Apr 10 '11 at 06:45

score 30 · Answer 3 · answered Jul 21 '14 at 12:44

30

one-liner and preserve order

list(OrderedDict.fromkeys([2,1,1,3]))

although you'll need

from collections import OrderedDict

answered Jul 21 '14 at 12:44

brillout

7,804
11
72
84

3

An alternative form is: OrderedDict.fromkeys(my_list).keys() – Danny Staple Jan 28 '15 at 14:05
2

@DannyStaple: that works in python 2, but in python 3 it returns a view of the dictionary keys, which might be okay for some purposes, but doesn't support indexing for example. – Mark Dec 03 '16 at 11:57
The initial one liner will work. The aternative form returns an odict_keys type, which is less useful for this - but can still be converted to a list. – Danny Staple Dec 04 '16 at 00:19

score 18 · Answer 4 · answered Aug 11 '14 at 05:28

18

Let me explain to you by an example:

if you have Python list

>>> randomList = ["a","f", "b", "c", "d", "a", "c", "e", "d", "f", "e"]

and you want to remove duplicates from it.

>>> uniqueList = []

>>> for letter in randomList:
    if letter not in uniqueList:
        uniqueList.append(letter)

>>> uniqueList
['a', 'f', 'b', 'c', 'd', 'e']

This is how you can remove duplicates from the list.

answered Aug 11 '14 at 05:28

disp_name

1,448
2
20
46

4

+1 because it's the only one that works for types that are unhashable, but do have an __eq__ function (if your types are hashable, use one of the other solutions). Note that it will be slow for very big lists. – Claude Sep 09 '14 at 19:48
1

Unless in some special case as Claude explained, this one has the worst performance: O(n^2) – Yingbo Miao Jun 18 '20 at 20:26

khachik · Answer 5 · 2010-12-16T10:39:29.873

14

To preserve the order:

l = [1, 1, 2, 2, 3]
result = list()
map(lambda x: not x in result and result.append(x), l)
result
# [1, 2, 3]

edited Dec 16 '10 at 10:39

answered Dec 16 '10 at 10:32

khachik

28,112
9
59
94

4

In python 3.4 returns an empty list!!! – Broken_Window Feb 04 '17 at 16:52
1

map just creates map object (generator), does not execute it. list(map(....)) forces the execution – user2389519 Jul 04 '22 at 06:53
It will produce not optimal performance, due to traversal the result list for each x. – eg04lt3r Jul 10 '23 at 23:02

cod3monk3y · Answer 6 · 2015-01-28T17:08:03.750

10

How about dictionary comprehensions?

>>> mylist = [3, 2, 1, 3, 4, 4, 4, 5, 5, 3]

>>> {x:1 for x in mylist}.keys()
[1, 2, 3, 4, 5]

EDIT To @Danny's comment: my original suggestion does not keep the keys ordered. If you need the keys sorted, try:

>>> from collections import OrderedDict

>>> OrderedDict( (x,1) for x in mylist ).keys()
[3, 2, 1, 4, 5]

which keeps elements in the order by the first occurrence of the element (not extensively tested)

edited Jan 28 '15 at 17:08

answered Jan 14 '14 at 06:54

cod3monk3y

9,508
6
39
54

This would not preserve order - dictionary order (and set order) is determined by the hashing algorithm and not insertion order. I am not sure of the effects of a dictionary comprehension with an OrderedDict type though. – Danny Staple Jan 28 '15 at 14:00
@DannyStaple True. I added an example using `OrderedDict` and a generator, if ordered output is desired. – cod3monk3y Jan 28 '15 at 17:03

score 4 · Answer 7 · answered May 31 '16 at 15:14

The characteristics of sets in Python are that the data items in a set are unordered and duplicates are not allowed. If you try to add a data item to a set that already contains the data item, Python simply ignores it.

>>> l = ['a', 'a', 'bb', 'b', 'c', 'c', '10', '10', '8','8', 10, 10, 6, 10, 11.2, 11.2, 11, 11]
>>> distinct_l = set(l)
>>> print(distinct_l)
set(['a', '10', 'c', 'b', 6, 'bb', 10, 11, 11.2, '8'])

Seitaridis · Answer 8 · 2010-12-16T12:03:48.553

3

If all elements of the list may be used as dictionary keys (i.e. they are all hashable) this is often faster. Python Programming FAQ

d = {}
for x in mylist:
    d[x] = 1
mylist = list(d.keys())

edited Dec 16 '10 at 12:03

answered Dec 16 '10 at 11:57

Seitaridis

4,459
9
53
85

score 2 · Answer 9 · answered Dec 16 '13 at 10:09

2

The simplest way to remove duplicates whilst preserving order is to use collections.OrderedDict (Python 2.7+).

from collections import OrderedDict
d = OrderedDict()
for x in mylist:
    d[x] = True
print d.iterkeys()

answered Dec 16 '13 at 10:09

adam.lofts

1,142
9
10

score 2 · Answer 10 · answered Dec 16 '10 at 10:29

2

From http://www.peterbe.com/plog/uniqifiers-benchmark:

def f5(seq, idfun=None):  
    # order preserving
    if idfun is None:
        def idfun(x): return x
    seen = {}
    result = []
    for item in seq:
        marker = idfun(item)
        # in old Python versions:
        # if seen.has_key(marker)
        # but in new ones:
        if marker in seen: continue
        seen[marker] = 1
        result.append(item)
    return result

answered Dec 16 '10 at 10:29

sje397

41,293
8
87
103

1

Wouldn't it make sense to use a set for seen, rather than a dict? – Thomas K Dec 16 '10 at 11:12
In Python, sets and dicts are built using hashtables so they are interchangeable in this scenario. They both provide the same operations (limiting duplicates) and both have the same running time. – brildum Dec 16 '10 at 14:29
This one is slow, generator version is much faster – Paweł Prażak Dec 16 '10 at 17:17

How to make lists contain only distinct element in Python?

10 Answers10

Linked

Related