4

I have two lists l and l_match. l_match is an empty list.

l = ['gtttaattgagttgtcatatgttaataacg',
     'tttaattgagttgtcatatgttaataacgg',
     'ttaattgagttgtcatatgttaataacggt',
     'taattgagttgtcatatgttaataacggta',
     'aattgagttgtcatatgttaataacggtat']

l_match = []

print list(set(l) - set(l_match))

gives the output

['aattgagttgtcatatgttaataacggtat',
 'tttaattgagttgtcatatgttaataacgg',
 'ttaattgagttgtcatatgttaataacggt',
 'taattgagttgtcatatgttaataacggta',
 'gtttaattgagttgtcatatgttaataacg']

I want the output the same order as the input. i.e. in the above case the output should be

['gtttaattgagttgtcatatgttaataacg',
 'tttaattgagttgtcatatgttaataacgg',
 'ttaattgagttgtcatatgttaataacggt',
 'taattgagttgtcatatgttaataacggta',
 'aattgagttgtcatatgttaataacggtat']

Can you suggest edits?

Georgy
  • 12,464
  • 7
  • 65
  • 73
Ssank
  • 3,367
  • 7
  • 28
  • 34

4 Answers4

2

Just make l_match a set:

l_match = []

st =  set(l_match)

print([ele for ele in l if ele not in st])

If l can have dupes use an OrderedDict to get unique values from l:

from collections import OrderedDict
print([ele for ele in OrderedDict.fromkeys(l) if ele not in st])

Obviously l_match would contain values in the real world or a simple l[:] = OrderedDict.fromkeys(l) would suffice to remove dupes from l and keep the order

Padraic Cunningham
  • 176,452
  • 29
  • 245
  • 321
1

This is old af but, in case someone is still wondering about it, a little googling gave me this really simple solution.

x = [1, 2, 6, 8, 2, 3]
y = [2, 6]
sorted(set(x) - set(y), key=x.index)

output -> [1, 8, 3]

Gabriel Pena
  • 411
  • 3
  • 9
0

You should look through l and include each element therein in your result array only if it's not in l_match. This will preserve the order. In python, the statement is a single line:

print [entry for entry in l if entry not in l_match]
Robin James Kerrison
  • 1,727
  • 1
  • 15
  • 26
0

What about this: How do you remove duplicates from a list in whilst preserving order?

l = ['gtttaattgagttgtcatatgttaataacg', 'tttaattgagttgtcatatgttaataacgg', 'ttaattgagttgtcatatgttaataacggt', 'taattgagttgtcatatgttaataacggta', 'aattgagttgtcatatgttaataacggtat']
seen = set()
seen_add = seen.add
print([ x for x in l if not (x in seen or seen_add(x))])
Community
  • 1
  • 1
Edwin Torres
  • 2,774
  • 1
  • 13
  • 15