Compare each line and remove the repeated/same line having the same numbers in python

Asked Jan 20 '22 at 23:39

Active Jan 20 '22 at 23:54

Viewed 43 times

For example if I have txt file with content:

12 15 33 40
9 23 44 45
22 26 76 45
9 23 44 45

I want to remove the repeated line which in this case is 9, 23, 44, 45 and replace it with only one line instead of two. i.e. I want the output to be:

12 15 33 40
9 23 44 45
22 26 76 45

I tried this but it didn't work:

from itertools import groupby
with open('result.txt', 'r') as infile:
    with open('final_result.txt', 'w') as outfile:
        for line, _ in itertools.groupby(infile):
            outfile.write(line)

Finally this works:

lines_seen = set() # holds lines already seen
outfile = open('final_result.txt', "w")
for line in open('result.txt', "r"):
    if line not in lines_seen: # not a duplicate
        outfile.write(line)
        lines_seen.add(line)
outfile.close()

edited Jan 20 '22 at 23:54

asked Jan 20 '22 at 23:39

uman laaamak

You already asked the exact same question earlier today, and it was closed as a duplicate. Come on... – Marco Bonelli Jan 20 '22 at 23:43
I just couldn't get the desired output, sorry I am a newbie in python/programming – uman laaamak Jan 20 '22 at 23:49
nevermind I just got the required output, it was a silly mistake. Thank you for the help. – uman laaamak Jan 20 '22 at 23:52

Compare each line and remove the repeated/same line having the same numbers in python

0 Answers0