Binary search in a Python list

Question

I am trying to perform a binary search on a list in python. List is created using command line arguments. User inputs the number he wants to look for in the array and he is returned the index of the element. For some reason, the program only outputs 1 and None. Code is below. Any help is extremely appreciated.

import sys

def search(list, target):
  min = 0
  max = len(list)-1
  avg = (min+max)/2
  while (min < max):
    if (list[avg] == target):
      return avg
    elif (list[avg] < target):
      return search(list[avg+1:], target)
    else:
      return search(list[:avg-1], target)

  print "The location of the number in the array is", avg

# The command line argument will create a list of strings                               
# This list cannot be used for numeric comparisions                                     
# This list has to be converted into a list of ints                                     
def main():

  number = input("Please enter a number you want to search in the array !")
  index = int(number)
  list = []
  for x in sys.argv[1:]:
    list.append(int(x))
  print "The list to search from", list

  print(search(list, index))

if __name__ == '__main__':
  main()

CL :
Anuvrats-MacBook-Air:Python anuvrattiku$ python binary_search.py 1 3 4 6 8 9 12 14 16 17 27 33 45 51 53 63 69 70
Please enter a number you want to search in the array !69
The list to search from [1, 3, 4, 6, 8, 9, 12, 14, 16, 17, 27, 33, 45, 51, 53, 63, 69, 70]
0
Anuvrats-MacBook-Air:Python anuvrattiku$

You do know, that binary search requires a sorted array/list to work? It's not the only wrong thing with your algorithm. And BTW, never ever call a variable `list` or any other built-in type or function. And why do you print inside a function that returns `arg`. It won't ever be printed. — Eli Korvigo, Jul 13 '16 at 08:12
Also there's binary search built in: https://docs.python.org/3/library/bisect.html — jonrsharpe, Jul 13 '16 at 08:14
@Eli Korvigo : Yes I am aware that it needs a sorted list. I have edited to display the command line arguments. I dont understand why it is not printing the index of the element. I tried commenting the print statement in the function but it still prints 0. That is what is not clear to me — Anuvrat Tiku, Jul 13 '16 at 08:22
@EliKorvigo : It is not a homework assignment sir. Just trying to do it the conventional way — Anuvrat Tiku, Jul 13 '16 at 08:24
Your algorithm is totally incorrect. You mixed a while loop with recursion.. You should only do one of them — SomethingSomething, Jul 13 '16 at 08:34
Don't use recursion. Your task is to determine the index of the original list that matches the input value, but when you call search recursively you pass it a slice of the original list. Index 0 of the slice is, in general, not index 0 of the original, so this can't possibly work. — Paul Cornelius, Jul 13 '16 at 08:36

score 25 · Answer 1 · edited Jun 18 '22 at 22:40

In Python2 and Python3 you can use bisect as written in the comments. Replace your search with the following

from bisect import bisect_left

def search(alist, item):
    'Locate the leftmost value exactly equal to item'
    i = bisect_left(alist, item)
    if i != len(alist) and alist[i] == item:
        return i
    raise ValueError

alist = [1,2,7,8,234,5,9,45,65,34,23,12]
x = 5
alist.sort() # bisect only works on sorted lists
print(search(alist, x)) # prints 2 as 5 is on position 2 in the sorted list

Also, the AS SortedCollection (Python recipe) could be useful.

The following code (from here) performs the binary search and returns position and if the item was found at all.

def binarySearch(alist, item):
    first = 0
    last = len(alist)-1
    found = False

    while first<=last and not found:
        pos = 0
        midpoint = (first + last)//2
        if alist[midpoint] == item:
            pos = midpoint
            found = True
        else:
            if item < alist[midpoint]:
                last = midpoint-1
            else:
                first = midpoint+1
    return (pos, found)

Will return (2, True) if used in the example above.

In the first code segment, don't you mean `print(search(alist,x))`? — fluffy, Sep 12 '18 at 23:23
what is 'a'? It's still 'a' and not 'alist', so it should be 'a'? — Rony Tesler, Nov 25 '21 at 19:39
I just corrected the code to read `alist` in the first snippet, as in `print(search(alist, x))`, as previous commenters indicated, rather than the previous (incorrect) `print(search(a, x))`. — Dan Nissenbaum, Jun 18 '22 at 22:41

score 7 · Accepted Answer · answered Jul 13 '16 at 09:05

Well, there are some little mistakes in your code. To find them, you should either use a debugger, or at least add traces to understand what happens. Here is your original code with traces that make the problems self evident:

def search(list, target):
  min = 0
  max = len(list)-1
  avg = (min+max)/2
  print list, target, avg
  ...

You can immediately see that:

you search in a sub array that skips avg-1 when you are below avg
as you search in a sub array you will get the index in that subarray

The fixes are now trivial:

elif (list[avg] < target):
      return avg + 1 + search(list[avg+1:], target)  # add the offset
    else:
      return search(list[:avg], target)  # sublist ends below the upper limit

That's not all, when you end the loop with min == max, you do not return anything (meaning you return None). And last but not least never use a name from the standard Python library for your own variables.

So here is the fixed code:

def search(lst, target):
  min = 0
  max = len(lst)-1
  avg = (min+max)/2
  # uncomment next line for traces
  # print lst, target, avg  
  while (min < max):
    if (lst[avg] == target):
      return avg
    elif (lst[avg] < target):
      return avg + 1 + search(lst[avg+1:], target)
    else:
      return search(lst[:avg], target)

  # avg may be a partial offset so no need to print it here
  # print "The location of the number in the array is", avg 
  return avg

could you explain your point of ending the loop with min==max a bit — Anuvrat Tiku, Jul 13 '16 at 09:36
Agreed with @JacobIRR, consider use low/high instead of min/max. — Madmint, Apr 17 '19 at 23:23

Sergey Bushmanov · Answer 3 · 2021-09-05T18:30:55.377

Recursive:

def in_list(l, x):
    if len(l) < 2:
        if l[0] == x:
            return True
        else:
            return False
    mid = len(l) // 2
    if x < l[mid]:
        return in_list(l[:mid], x)
    else:
        return in_list(l[mid:], x)

or iterative:

def in_list2(l, x):
    low = 0
    high = len(l) - 1
    while low <= high:
        mid = (low + high) // 2
        if l[mid] == x:
            return True
        if x < l[mid]:
            high = mid - 1
        else:
            low = mid + 1
    return False

score 1 · Answer 4 · answered Nov 29 '18 at 14:42

@Serge Ballesta 's solution is undoubtly the correct answer to this question.

I am just going to add another way of solving this:

def search(arr, item, start, end):

  if end-start == 1:
    if arr[start] == item:
        return start
    else:
        return -1;

  halfWay = int( (end-start) / 2)

  if arr[start+halfWay] > item:
    return search(arr, item, start, end-halfWay)
  else:
    return search(arr, item, start+halfWay, end)

def binarysearch(arr, item):
  return search(arr, item, 0, len(arr))

arr = [1, 3, 4, 6, 8, 9, 12, 14, 16, 17, 27, 33, 45, 51, 53, 63, 69, 70]

print("Index of 69: " + str(binarysearch(arr, 69))) # Outputs: 16

score 0 · Answer 5 · answered Jul 13 '16 at 08:39

0

The reason you aren't getting correct result is because in every recursive call your code is sending sliced array. So the array length keeps reducing. Ideally you should work out a way to send original array and work with only start, end indices.

answered Jul 13 '16 at 08:39

Kevin

901
1
7
15

Binary search in a Python list

5 Answers5

Linked