Remove multple occurance of a word with another from python list

Question

I have a list in a particular format as follows:

my_list =  ['apple', 'apple', 'boy', 'cat', 'cat', 'apple', 'apple', 
             'apple', 'boy', 'cat', 'cat', 'dog', 'dog'].

And my expected output is

res = ['apple', 'boy', 'cat', 'apple',  'boy', 'cat',  'dog']

The consecutive occurrence of the same word should be replaced with the word only once irrespective of whether the word occurred as another sequence earlier.

The following code when I used gives the following output.

test_list = ['apple', 'apple', 'boy', 'cat', 'cat', 'apple', 'apple', 
         'apple', 'boy', 'cat', 'cat', 'dog', 'dog'] 
res = []
[res.append(x) for x in test_list if x not in res] 
print ("The list after removing duplicates : " + str(res))

output: ['apple', 'boy', 'cat', 'dog'] - which gave only distinct words. How will I proceed from here to get what I actually require. Thanks in advance.

Thank you for finding the duplicate. The SO search never seems to work very well for me. — Karl Knechtel, Feb 05 '21 at 10:31

score 2 · Accepted Answer · edited Feb 05 '21 at 10:26

2

Use itertools.groupby

from itertools import groupby

[key for key, _ in groupby(my_list)]

['apple', 'boy', 'cat', 'apple', 'boy', 'cat', 'dog']

edited Feb 05 '21 at 10:26

Karl Knechtel

62,466
11
102
153

answered Feb 05 '21 at 10:19

Epsi95

8,832
1
16
34

1

I simplified your code - the first element of the returned tuples, which you initially ignored, is already exactly what you want (so there's no need to parse the second element). – Karl Knechtel Feb 05 '21 at 10:26

score 0 · Answer 2 · edited Feb 05 '21 at 10:56

0

Use set(), which ignores duplicate values.

test_list = ['apple', 'apple', 'boy', 'cat', 'cat', 'apple', 'apple', 
         'apple', 'boy', 'cat', 'cat', 'dog', 'dog'] 
         
t = set(test_list)

Ouput :

{'apple', 'boy', 'cat', 'dog'}

If needed, you can convert the set back into a list by

list(t)

Output :

['dog', 'boy', 'apple', 'cat']

edited Feb 05 '21 at 10:56

pfabri

885
1
9
25

answered Feb 05 '21 at 10:21

blaze

59
6

Please read OPs question. This is not what they are asking. – Akshay Sehgal Feb 05 '21 at 10:24

score 0 · Answer 3 · answered Feb 05 '21 at 10:30

0

Try this:

my_list =  ['apple', 'apple', 'boy', 'cat', 'cat', 'apple', 'apple', 
             'apple', 'boy', 'cat', 'cat', 'dog', 'dog'] + [""]
res = [my_list[i] for i in range(len(my_list) -1) if my_list[i+1] != my_list[i]] 
print(res)

answered Feb 05 '21 at 10:30

dimay

2,768
1
13
22

Remove multple occurance of a word with another from python list

3 Answers3