Comparing characters in strings

Question

I'm trying to create a function that compares characters in the same position of two strings of same length and returns the count of their differences.

For instance,

a = "HORSE"
b = "TIGER"

And it would return 5 (as all characters in the same position are different)

Here's what I've been working on.

def Differences(one, two):
    difference = []
    for i in list(one):
        if list(one)[i] != list(two)[i]:
            difference = difference+1
    return difference

That gives an error "List indices must be integers not strings"

And so I've tried turning it to int by using int(ord(

def Differences(one, two):
    difference = 0
    for i in list(one):
        if int(ord(list(one)[i])) != int(ord(list(two)[i])):
            difference = difference+1
    return difference

Which also returns the same error.

When I print list(one)[1] != list(two)[1] it eithers returns True or False, as the comparison is correctly made.

Can you tell me how to correct my code for this purpose?

The reason you're getting errors is because you're iterating over the string with the for-loop. In Python, when you iterate over something (and- as an aside- you don't _need_ to convert strings to lists; strings are iterables by nature in python) you get each sub item in that item (as opposed to an index number). So you're getting `["H","O","R","S","E"]` as "i" values in you for-loop, which obviously aren't indexes (i.e.- 0,1,2,3,4). — Reid Ballard, Jun 22 '16 at 13:33

Reid Ballard · Answer 1 · 2016-06-23T04:58:23.340

I would probably just iterate over both of them at the same time with zip and a list comprehension and then take length of the list:

a='HORSE'
b='TIGER'


words=zip(a,b)
incorrect=len([c for c,d in words if c!=d])
print(incorrect)

Zipping pairs lists together index-for-index, stopping when one runs out. List comprehensions are generators that are basically compact for-statements that you can add logic to. So it basically reads: for each zipped pair of letters (c,d) if c!=d then put a into the list (so if the letters are different, we increase the list length by 1). Then we just take the length of the list which is all the letters that are positionally different.

If we consider missing letters to be different, then we can use itertools.zip_longest to fill out the rest of the word:

import itertools

a='HORSES'
b='TIG'

words=itertools.zip_longest(a,b,fillvalue=None)
incorrect=len([c for c,d in words if c!=d]) ## No changes here
print(incorrect)

Obviously, None will never equal a character, so the difference in length will be registered.

EDIT: This hasn't been mentioned, but if we want case-insensitivity, then you just run .lower() or .casefold() on the strings beforehand.

score 2 · Answer 2 · answered Jun 22 '16 at 04:30

2

sum([int(i!=j) for i,j in zip(a,b)]) would do the trick

answered Jun 22 '16 at 04:30

user3404344

1,707
15
13

this is nice and concise; just a heads up to future readers that this would lead to inaccuracies if the inputs are ever of different lengths. `zip` will not raise an Exception, either – Jordan Bonitatis Jun 22 '16 at 04:45

Jordan Bonitatis · Answer 3 · 2016-06-22T04:43:30.580

use zip to iterate over both strings consecutively

>>> def get_difference(str_a, str_b):
...     """
...     Traverse two strings of the same length and determine the number of 
...     indexes in which the characters differ
...     """
...
...     # confirm inputs are strings
...     if not all(isinstance(x, str) for x in (str_a, str_b)):
...         raise Exception("`difference` requires str inputs")
...     # confirm string lengths match
...     if len(str_a) != len(str_b):
...         raise Exception("`difference` requires both input strings to be of equal length")
...
...     # count the differences; this is the important bit
...     ret = 0
...     for i, j in zip(str_a, str_b):
...         if i != j:
...             ret += 1
...     return ret
... 
>>> difference('HORSE', 'TIGER')
5

also, the general style is to lower case function names (which are often verbs) and title case class names (which are often nouns) :)

score 0 · Answer 4 · answered Jun 22 '16 at 04:16

You could do something like this:

def getDifferences(a,b):
  count = 0

  for i in range(0, len(a)):
    if a[i] is not b[i]:
      count += 1

  return count

The only thing that you will have to implement yourself here is checking for the size of the strings. In my example, if a is larger than b, then there will be an IndexError.

score 0 · Answer 5 · answered Jun 22 '16 at 04:19

try this:

def Differences(one, two):
    if len(two) < len(one):
        one, two = two, one
    res = len(two) - len(one) 
    for i, chr in enumerate(one):
        res += two[i] != chr
    return res

it's important to make the first check of their size in case the second string is shorter than the first, so you don't get an IndexError

score 0 · Answer 6 · answered Jun 22 '16 at 04:20

As matters for complexity and runtime, calling list() each iteration is not efficient, since it splits the strings, allocates memory and on... The correct way to do it, is to iterate the index of the lists, than compare them by it, something like:

def str_compare(l1, l2):
   assert len(l1) == len(l2) , 'Lists must have the same size'        
   differ_cnt = 0
   for i in xrange(len(l1)):
       if l1[i] != l2[i]:
           differ_cnt += 1
   return differ_cnt

score 0 · Answer 7 · answered Jun 22 '16 at 04:21

There are a couple problems with:

def Differences(one, two):
    difference = []
    for i in list(one):
        if list(one)[i] != list(two)[i]:
            difference = difference+1
    return difference

Firstly list(one) is ['H', 'O', 'R', 'S', 'E'] when you call Differences(a, b) so you are iterating over strings not ints. Changing your code to:

for i in range(len(one)):

will iterate over the integers 0-4 which will work in your case only because a and b have the same length (you will need to come up with a better solution if you want to handle different length inputs).

Secondly you can't add to an array so you should change it be a int which you add to. The result would be:

def Differences(one, two):
    difference = 0
    for i in range(len(one)):
        if list(one)[i] != list(two)[i]:
            difference = difference+1
    return difference

If you were super keep to use an array you can however append to an array: difference.append(1) and then return the length of the array: return len(difference) but this would be inefficient for what you are trying to achieve.

score 0 · Answer 8 · answered Jun 22 '16 at 05:33

Find out for yourself

>>> a = "HORSE"
>>> list(a)
['H', 'O', 'R', 'S', 'E']
>>> list(a)[2]
'R'
>>> for i in list(a):
...     i
...
'H'
'O'
'R'
'S'
'E'
>>> list(a)['R']
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: list indices must be integers, not str

Good luck!

score 0 · Answer 9 · answered Jun 23 '16 at 13:33

a = "HORSE"
b = "TIGER"
a_list=[]
b_list=[]
for l in a_list:
    a_list.append(l)

for k in b_list:
    b_list.append(k)

difference = len(a)
for i in a_list:
    for x in b_list:
        if a_list[i] == b_list[x]:
            difference = difference - 1

print(difference)

See if this works :)

score 0 · Answer 10 · answered Feb 05 '18 at 05:05

0

That's too simple:

def Differences(one, two):
    difference = 0
    for char1, char2 in zip(one, two):
        if char1 != char2:
            difference += difference
    return difference

answered Feb 05 '18 at 05:05

Happy Ahmad

1,072
2
14
33

score 0 · Answer 11 · answered Feb 18 '20 at 13:16

0

a = ['a' , 'b'  , 'c' , 'd' , 'd' , 'c']
b = ['a' , 'b' , 'c' ]
index = []

if len(a) == len(b):
    for i in range(len(a)):
            if a[i]!=b[i]:
                index.append(a[i])
    if len(index) > 0:
        print("The array is not same")

    else:
        print("The array is same")

else:
    print("The array is not same")

answered Feb 18 '20 at 13:16

bhagyashree bhaya

1

Hi bhagyashree bhaya. You should add some lines of explanation alongside your code snippet to make it better understandable why this is a good answer to the question. Maybe also short noting why you choose this way to go. Best regards! – klaas Feb 19 '20 at 20:11

Comparing characters in strings

11 Answers11