9

Is there a way to make a list comprehension in Python that only contains unique items?

My original idea was to use something like this : new_items = [unicode(item) for item in items]

However, I later realized that I needed to omit duplicate items. So I ended up with this ugly monstrosity :

unique_items = []
for item in items :
    unicode_item = unicode(item)
    if unicode_item not in unique_items :
        unique_items.append(unicode_item)

Now this is far less pretty (and readable) than a simple list comprehension. So, is there a way to make a list comprehension equivalent to the above code?

Also order does matter, so I can't just use a set comprehension.

rectangletangle
  • 50,393
  • 94
  • 205
  • 275
  • Interesting, my initial thought was that you can do filtering in list comprehensions, but then I realized you'd need access to the new list that you're creating in the filter condition. – Davy8 Oct 01 '12 at 22:05
  • If order matters, how do I know what instance of a repeated item to use? The first, the last, or one in the middle? – lvella Oct 01 '12 at 22:09
  • 1
    possible duplicate of [How do you remove duplicates from a list in Python whilst preserving order?](http://stackoverflow.com/questions/480214/how-do-you-remove-duplicates-from-a-list-in-python-whilst-preserving-order) – mgilson Oct 01 '12 at 22:12
  • @mgilson After a quick check it seems the accepted answer to this question is more efficient than the accepted answer of the linked question. – Dunes Oct 01 '12 at 22:43

4 Answers4

18

Well, there is no ordered set, but we can misuse OrderedDict:

from collections import OrderedDict
t = "never gonna give you up"
OrderedDict.fromkeys(t).keys()

Gives:

['n', 'e', 'v', 'r', ' ', 'g', 'o', 'a', 'i', 'y', 'u', 'p']
Michael
  • 7,316
  • 1
  • 37
  • 63
  • Oh, very nice. On a side note, try using `repeat(None)` from `itertools` rather than `[None]*len(t)` – Dunes Oct 01 '12 at 22:20
  • 3
    How about `OrderedDict.fromkeys(t).keys()` instead? [Note that this approach, as well as the set approach, limits us to hashable elements.] – DSM Oct 01 '12 at 22:20
10

Your original idea works with a set comprehension:

new_items = {unicode(item) for item in items}
schryer
  • 103
  • 1
  • 3
6

I short one liner might be:

s = "some string"
unique_items = [unicode(ch) for ch in sorted(set(s), key=s.index)]
Dunes
  • 37,291
  • 7
  • 81
  • 97
4

Make it a helper function, like so.

def unique_iter(iterable):
  seen = set()
  for item in iterable:
    if item in seen:
      continue
    seen.add(item)
    yield item

for ch in unique_iter("never gonna give you up"):
  print ch,

outputs

n e v r g o a i y u p

AKX
  • 152,115
  • 15
  • 115
  • 172