Finding the mode of a list

Question

Given a list of items, recall that the mode of the list is the item that occurs most often.

I would like to know how to create a function that can find the mode of a list but that displays a message if the list does not have a mode (e.g., all the items in the list only appear once). I want to make this function without importing any functions. I'm trying to make my own function from scratch.

Sorry, but can you explain what exactly you mean by 'mode of the list'? — Vikas, May 29 '12 at 11:02
@Vikas: the mode is the most frequently-occurring element (if any). Some definitions extend it to take the arithmetic mean of all such elements if there are more than one. — Jeremy Roman, May 29 '12 at 11:05
So many wrong answers here! For e.g `assert(mode[1, 1, 1]) == None` and `assert(mode[1, 2, 3, 4]) == None`. For a number to be a `mode`, it must occur more number of times than at least one other number in the list, and it must _not_ be the only number in the list. — lifebalance, Dec 08 '17 at 17:51

score 198 · Answer 1 · edited Jan 20 '20 at 16:10

198

You can use the max function and a key. Have a look at python max function using 'key' and lambda expression.

max(set(lst), key=lst.count)

edited Jan 20 '20 at 16:10

vvvvv

25,404
19
49
81

answered Jan 24 '15 at 20:08

David Dao

2,421
1
12
10

7

This is the correct answer to OP, considering it does not require any extra imports. Good job, David – Jason Parham Apr 15 '15 at 15:48
18

It seems to me that this would run in `O(n**2)`. Does it? – lirtosiast Sep 24 '15 at 04:57
11

This has quadratic runtime – Padraic Cunningham Nov 07 '15 at 13:56
Can someone explain the choice of key to me in this answer? The example keys in the link are all transformations whereas list.count should be a static numeric value right? – bhnn Apr 18 '17 at 15:18
@BenDot list.count is a function that returns the number of times an element occurs in the list in O(n) time. – Erik Dec 05 '17 at 22:34
24

Could also just use `max(lst, key=lst.count)`. (And I'd really not call a list `list`.) – Stefan Pochmann Dec 24 '17 at 13:48
error if lst is empty, but this can be easily solved. – Kardi Teknomo Dec 31 '18 at 09:43
2

Can anyone explain how this works for bi-modal distributions? e.g. `a = [22, 33, 11, 22, 11]; print(max(set(a), key=a.count))` returns `11`. Will it always return the minimum mode? And if so, why? – battey Jan 13 '19 at 22:54
1

@battey No, `max(set(a), key=a.count)` of `a = [33, 33, 22, 22, 11, 11]` returns 33, so your assumption is false.. – Mark Seliaev Dec 06 '19 at 20:36
1

@StefanPochmann you really want `max(set(lst), key=lst.count)`, the reason is that you only want to run list.count once for each unique element, without the `set` even though the output is correct, you end up re-running count for duplicate elements. – khuang834 Jul 01 '21 at 19:32
@khuang834. That's the least of your complexity problems – Mad Physicist Sep 11 '22 at 03:27

score 114 · Answer 2 · edited Sep 25 '12 at 11:08

114

You can use the Counter supplied in the collections package which has a mode-esque function

from collections import Counter
data = Counter(your_list_in_here)
data.most_common()   # Returns all unique items and their counts
data.most_common(1)  # Returns the highest occurring item

Note: Counter is new in python 2.7 and is not available in earlier versions.

edited Sep 25 '12 at 11:08

sharafjaffri

2,134
3
30
47

answered May 29 '12 at 11:07

Christian Witts

11,375
1
33
46

22

The question states that the user wants to make a function from scratch -- i.e., no imports. – abcd Mar 14 '15 at 02:37
5

Your last line returns a list containing a tuple containing a mode and its frequency. To get just a mode use `Counter(your_list_in_here).most_common(1)[0][0]`. If there is more than one mode this returns an arbitrary one. – Rory Daulton Apr 16 '17 at 12:20
1

Suppose there are `n` most common `modes`. If Counter(your_list_in_here).most_common(1)[0][0] gets you the first mode, how would you get another most common `mode` ? Just replace the last `0` with `1`? One can make a function to customize the `mode` to their liking.. – Apr 17 '17 at 12:07
2

if there is more than one mode, how can I return the largest of these numbers? – Akin Hwan Dec 31 '18 at 19:56
If you just want the highest occuring item, it should be `data.most_common(1)[0][0]`. – Stef Mar 29 '22 at 09:26
1

@AkinHwan `max(data.items(), key=lambda x: (x[1], x[0]))` or with `from operator import itemgetter`, you can rewrite this `max(data.items(), key=itemgetter(1,0))` – Stef Mar 29 '22 at 09:27

score 76 · Answer 3 · answered Mar 17 '14 at 15:19

76

Python 3.4 includes the method statistics.mode, so it is straightforward:

>>> from statistics import mode
>>> mode([1, 1, 2, 3, 3, 3, 3, 4])
 3

You can have any type of elements in the list, not just numeric:

>>> mode(["red", "blue", "blue", "red", "green", "red", "red"])
 'red'

answered Mar 17 '14 at 15:19

jabaldonedo

25,822
8
77
77

21

Throws error on using mode([1, 1,1,1, 2, 3, 3, 3, 3, 4]) where 1 and 3 repeat equal number of time. Ideally, should return smallest of the number which largest but equal number of times. StatisticsError: no unique mode; found 2 equally common values – aman_novice Apr 01 '16 at 19:36
4

Haven't used this 3.4 statistics package, but scipy.stats.mode will return the smallest, in this case 1. I would, however, prefer the throw of the error in certain cases... – wellplayed Jan 20 '17 at 15:51
3

@aman_novice, the issue was solved in Python 3.8. https://docs.python.org/3/library/statistics.html#statistics.mode – Michael D Dec 26 '19 at 12:40
3

python 3.8 also added [`multimode`](https://docs.python.org/3/library/statistics.html#statistics.multimode), which returns multiple modes when there is more than one. – stason Mar 12 '20 at 23:08

score 34 · Answer 4 · answered May 29 '12 at 11:12

34

Taking a leaf from some statistics software, namely SciPy and MATLAB, these just return the smallest most common value, so if two values occur equally often, the smallest of these are returned. Hopefully an example will help:

>>> from scipy.stats import mode

>>> mode([1, 2, 3, 4, 5])
(array([ 1.]), array([ 1.]))

>>> mode([1, 2, 2, 3, 3, 4, 5])
(array([ 2.]), array([ 2.]))

>>> mode([1, 2, 2, -3, -3, 4, 5])
(array([-3.]), array([ 2.]))

Is there any reason why you can 't follow this convention?

answered May 29 '12 at 11:12

Chris

44,602
16
137
156

4

Why only the smallest mode is returned when there are multiple? – zyxue Mar 16 '17 at 14:49
@zyxue simple statistical convention – chrisfs Apr 23 '18 at 05:03
2

@chrisfs and to make it return the largest mode if there are multiple? – Akin Hwan Dec 31 '18 at 19:57

score 30 · Answer 5 · answered Aug 10 '15 at 03:26

30

There are many simple ways to find the mode of a list in Python such as:

import statistics
statistics.mode([1,2,3,3])
>>> 3

Or, you could find the max by its count

max(array, key = array.count)

The problem with those two methods are that they don't work with multiple modes. The first returns an error, while the second returns the first mode.

In order to find the modes of a set, you could use this function:

def mode(array):
    most = max(list(map(array.count, array)))
    return list(set(filter(lambda x: array.count(x) == most, array)))

answered Aug 10 '15 at 03:26

mathwizurd

1,387
1
13
15

3

Using the mode, gives error when there are two elements occur same amount of time. – Abhishek Mishra Dec 14 '17 at 20:47
Sorry, saw this comment really late. Statistics.mode(array) would return an error with multiple modes, but none of the other methods do. – mathwizurd May 09 '19 at 19:30
(1) It is described as a data points with the highest frequency (2) There can be multiple modes in a dataset (3) In case continuous values, it might not be a possible to find the mode (since all the values will be unique) (4) It can be used over non-numeric data as well – thrinadhn Jan 29 '21 at 05:39

score 7 · Answer 6 · answered Dec 31 '18 at 09:42

7

Extending the Community answer that will not work when the list is empty, here is working code for mode:

def mode(arr):
        if arr==[]:
            return None
        else:
            return max(set(arr), key=arr.count)

answered Dec 31 '18 at 09:42

Kardi Teknomo

1,375
16
24

score 5 · Answer 7 · edited Feb 22 '18 at 19:52

In case you are interested in either the smallest, largest or all modes:

def get_small_mode(numbers, out_mode):
    counts = {k:numbers.count(k) for k in set(numbers)}
    modes = sorted(dict(filter(lambda x: x[1] == max(counts.values()), counts.items())).keys())
    if out_mode=='smallest':
        return modes[0]
    elif out_mode=='largest':
        return modes[-1]
    else:
        return modes

score 3 · Answer 8 · answered Feb 09 '16 at 21:56

A little longer, but can have multiple modes and can get string with most counts or mix of datatypes.

def getmode(inplist):
    '''with list of items as input, returns mode
    '''
    dictofcounts = {}
    listofcounts = []
    for i in inplist:
        countofi = inplist.count(i) # count items for each item in list
        listofcounts.append(countofi) # add counts to list
        dictofcounts[i]=countofi # add counts and item in dict to get later
    maxcount = max(listofcounts) # get max count of items
    if maxcount ==1:
        print "There is no mode for this dataset, values occur only once"
    else:
        modelist = [] # if more than one mode, add to list to print out
        for key, item in dictofcounts.iteritems():
            if item ==maxcount: # get item from original list with most counts
                modelist.append(str(key))
        print "The mode(s) are:",' and '.join(modelist)
        return modelist

shubh · Answer 9 · 2020-12-08T08:07:55.250

Mode of a data set is/are the member(s) that occur(s) most frequently in the set. If there are two members that appear most often with same number of times, then the data has two modes. This is called bimodal.

If there are more than 2 modes, then the data would be called multimodal. If all the members in the data set appear the same number of times, then the data set has no mode at all.

Following function modes() can work to find mode(s) in a given list of data:

import numpy as np; import pandas as pd

def modes(arr):
    df = pd.DataFrame(arr, columns=['Values'])
    dat = pd.crosstab(df['Values'], columns=['Freq'])
    if len(np.unique((dat['Freq']))) > 1:
        mode = list(dat.index[np.array(dat['Freq'] == max(dat['Freq']))])
        return mode
    else:
        print("There is NO mode in the data set")

Output:

# For a list of numbers in x as
In [1]: x = [2, 3, 4, 5, 7, 9, 8, 12, 2, 1, 1, 1, 3, 3, 2, 6, 12, 3, 7, 8, 9, 7, 12, 10, 10, 11, 12, 2]
In [2]: modes(x)
Out[2]: [2, 3, 12]
# For a list of repeated numbers in y as
In [3]: y = [2, 2, 3, 3, 4, 4, 10, 10]
In [4]: modes(y)
Out[4]: There is NO mode in the data set
# For a list of strings/characters in z as
In [5]: z = ['a', 'b', 'b', 'b', 'e', 'e', 'e', 'd', 'g', 'g', 'c', 'g', 'g', 'a', 'a', 'c', 'a']
In [6]: modes(z)
Out[6]: ['a', 'g']

If we do not want to import numpy or pandas to call any function from these packages, then to get this same output, modes() function can be written as:

def modes(arr):
    cnt = []
    for i in arr:
        cnt.append(arr.count(i))
    uniq_cnt = []
    for i in cnt:
        if i not in uniq_cnt:
            uniq_cnt.append(i)
    if len(uniq_cnt) > 1:
        m = []
        for i in list(range(len(cnt))):
            if cnt[i] == max(uniq_cnt):
                m.append(arr[i])
        mode = []
        for i in m:
            if i not in mode:
                mode.append(i)
        return mode
    else:
        print("There is NO mode in the data set")

score 2 · Answer 10 · answered Dec 26 '13 at 21:22

I wrote up this handy function to find the mode.

def mode(nums):
    corresponding={}
    occurances=[]
    for i in nums:
            count = nums.count(i)
            corresponding.update({i:count})

    for i in corresponding:
            freq=corresponding[i]
            occurances.append(freq)

    maxFreq=max(occurances)

    keys=corresponding.keys()
    values=corresponding.values()

    index_v = values.index(maxFreq)
    global mode
    mode = keys[index_v]
    return mode

This method will fail if 2 items have same no. of occurences. — akshaynagpal, Dec 13 '14 at 13:38

score 2 · Answer 11 · answered Aug 18 '14 at 14:03

Short, but somehow ugly:

def mode(arr) :
    m = max([arr.count(a) for a in arr])
    return [x for x in arr if arr.count(x) == m][0] if m>1 else None

Using a dictionary, slightly less ugly:

def mode(arr) :
    f = {}
    for a in arr : f[a] = f.get(a,0)+1
    m = max(f.values())
    t = [(x,f[x]) for x in f if f[x]==m]
    return m > 1 t[0][0] else None

score 2 · Answer 12 · answered Dec 30 '14 at 23:39

This function returns the mode or modes of a function no matter how many, as well as the frequency of the mode or modes in the dataset. If there is no mode (ie. all items occur only once), the function returns an error string. This is similar to A_nagpal's function above but is, in my humble opinion, more complete, and I think it's easier to understand for any Python novices (such as yours truly) reading this question to understand.

 def l_mode(list_in):
    count_dict = {}
    for e in (list_in):   
        count = list_in.count(e)
        if e not in count_dict.keys():
            count_dict[e] = count
    max_count = 0 
    for key in count_dict: 
        if count_dict[key] >= max_count:
            max_count = count_dict[key]
    corr_keys = [] 
    for corr_key, count_value in count_dict.items():
        if count_dict[corr_key] == max_count:
            corr_keys.append(corr_key)
    if max_count == 1 and len(count_dict) != 1: 
        return 'There is no mode for this data set. All values occur only once.'
    else: 
        corr_keys = sorted(corr_keys)
        return corr_keys, max_count

I say this only because you said "the function returns an error string." The line that reads `return 'There is no mode for this data set. All values occur only once.'` can be turned into an error message with `traceback` as `if condition: *next line with indent* raise ValueError('There is no mode for this data set. All values occur only once.') [Here is a list](https://docs.python.org/3/tutorial/errors.html) of different types of errors you can raise. — , Apr 17 '17 at 12:14

lifebalance · Answer 13 · 2017-12-08T19:21:48.807

For a number to be a mode, it must occur more number of times than at least one other number in the list, and it must not be the only number in the list. So, I refactored @mathwizurd's answer (to use the difference method) as follows:

def mode(array):
    '''
    returns a set containing valid modes
    returns a message if no valid mode exists
      - when all numbers occur the same number of times
      - when only one number occurs in the list 
      - when no number occurs in the list 
    '''
    most = max(map(array.count, array)) if array else None
    mset = set(filter(lambda x: array.count(x) == most, array))
    return mset if set(array) - mset else "list does not have a mode!"

These tests pass successfully:

mode([]) == None 
mode([1]) == None
mode([1, 1]) == None 
mode([1, 1, 2, 2]) == None

score 2 · Answer 14 · answered Apr 13 '19 at 19:34

Here is how you can find mean,median and mode of a list:

import numpy as np
from scipy import stats

#to take input
size = int(input())
numbers = list(map(int, input().split()))

print(np.mean(numbers))
print(np.median(numbers))
print(int(stats.mode(numbers)[0]))

score 2 · Answer 15 · answered Apr 20 '20 at 14:20

Simple code that finds the mode of the list without any imports:

nums = #your_list_goes_here
nums.sort()
counts = dict()
for i in nums:
    counts[i] = counts.get(i, 0) + 1
mode = max(counts, key=counts.get)

In case of multiple modes, it should return the minimum node.

score 1 · Answer 16 · answered May 29 '12 at 23:32

Why not just

def print_mode (thelist):
  counts = {}
  for item in thelist:
    counts [item] = counts.get (item, 0) + 1
  maxcount = 0
  maxitem = None
  for k, v in counts.items ():
    if v > maxcount:
      maxitem = k
      maxcount = v
  if maxcount == 1:
    print "All values only appear once"
  elif counts.values().count (maxcount) > 1:
    print "List has multiple modes"
  else:
    print "Mode of list:", maxitem

This doesn't have a few error checks that it should have, but it will find the mode without importing any functions and will print a message if all values appear only once. It will also detect multiple items sharing the same maximum count, although it wasn't clear if you wanted that.

So what im trying to do is to detect multiple items displaying the same count and then displaying all the items with that same count — bluelantern, May 30 '12 at 00:43
Have you actually tried this yourself? The extension from my code here to have it print all items with the same count is fairly straightforward. — lxop, May 31 '12 at 00:32

score 1 · Answer 17 · answered Apr 03 '17 at 14:03

This will return all modes:

def mode(numbers)
    largestCount = 0
    modes = []
    for x in numbers:
        if x in modes:
            continue
        count = numbers.count(x)
        if count > largestCount:
            del modes[:]
            modes.append(x)
            largestCount = count
        elif count == largestCount:
            modes.append(x)
    return modes

score 1 · Answer 18 · answered Dec 09 '19 at 16:10

1

For those looking for the minimum mode, e.g:case of bi-modal distribution, using numpy.

import numpy as np
mode = np.argmax(np.bincount(your_list))

answered Dec 09 '19 at 16:10

V3K3R

164
1
5

score 1 · Answer 19 · answered Jul 24 '21 at 10:00

Okey! So community has already a lot of answers and some of them used another function and you don't want.
let we create our very simple and easily understandable function.

import numpy as np

#Declare Function Name
def calculate_mode(lst):

Next step is to find Unique elements in list and thier respective frequency.

unique_elements,freq = np.unique(lst, return_counts=True)

Get mode

max_freq = np.max(freq)   #maximum frequency
mode_index = np.where(freq==max_freq)  #max freq index
mode = unique_elements[mode_index]   #get mode by index
return mode

Example

lst =np.array([1,1,2,3,4,4,4,5,6])
print(calculate_mode(lst))
>>> Output [4]

score 1 · Answer 20 · answered Apr 22 '22 at 21:16

How my brain decided to do it completely from scratch. Efficient and concise :) (jk lol)

import random

def removeDuplicates(arr):
    dupFlag = False

    for i in range(len(arr)):
        #check if we found a dup, if so, stop
        if dupFlag:
            break

        for j in range(len(arr)):
            if ((arr[i] == arr[j]) and (i != j)):
                arr.remove(arr[j])
                dupFlag = True
                break;

    #if there was a duplicate repeat the process, this is so we can account for the changing length of the arr
    if (dupFlag):
        removeDuplicates(arr)
    else:
        #if no duplicates return the arr
        return arr

#currently returns modes and all there occurences... Need to handle dupes
def mode(arr):
    numCounts = []

    #init numCounts
    for i in range(len(arr)):
        numCounts += [0]

    for i in range(len(arr)):
        count = 1
        for j in range(len(arr)):
            if (arr[i] == arr[j] and i != j):
                count += 1
        #add the count for that number to the corresponding index
        numCounts[i] = count

    #find which has the greatest number of occurences
    greatestNum = 0
    for i in range(len(numCounts)):
        if (numCounts[i] > greatestNum):
            greatestNum = numCounts[i]

    #finally return the mode(s)
    modes = []
    for i in range(len(numCounts)):
        if numCounts[i] == greatestNum:
            modes += [arr[i]]
    
    #remove duplicates (using aliasing)
    print("modes: ", modes)
    removeDuplicates(modes)
    print("modes after removing duplicates: ", modes)
    
    return modes


def initArr(n):
    arr = []
    for i in range(n):
        arr += [random.randrange(0, n)]
    return arr

#initialize an array of random ints
arr = initArr(1000)
print(arr)
print("_______________________________________________")

modes = mode(arr)

#print result
print("Mode is: ", modes) if (len(modes) == 1) else print("Modes are: ", modes)

akshaynagpal · Answer 21 · 2014-12-13T12:52:54.620

def mode(inp_list):
    sort_list = sorted(inp_list)
    dict1 = {}
    for i in sort_list:        
            count = sort_list.count(i)
            if i not in dict1.keys():
                dict1[i] = count

    maximum = 0 #no. of occurences
    max_key = -1 #element having the most occurences

    for key in dict1:
        if(dict1[key]>maximum):
            maximum = dict1[key]
            max_key = key 
        elif(dict1[key]==maximum):
            if(key<max_key):
                maximum = dict1[key]
                max_key = key

    return max_key

score 0 · Answer 22 · answered Apr 09 '15 at 07:16

def mode(data):
    lst =[]
    hgh=0
    for i in range(len(data)):
        lst.append(data.count(data[i]))
    m= max(lst)
    ml = [x for x in data if data.count(x)==m ] #to find most frequent values
    mode = []
    for x in ml: #to remove duplicates of mode
        if x not in mode:
        mode.append(x)
    return mode
print mode([1,2,2,2,2,7,7,5,5,5,5])

score 0 · Answer 23 · answered Feb 14 '16 at 01:42

Here is a simple function that gets the first mode that occurs in a list. It makes a dictionary with the list elements as keys and number of occurrences and then reads the dict values to get the mode.

def findMode(readList):
    numCount={}
    highestNum=0
    for i in readList:
        if i in numCount.keys(): numCount[i] += 1
        else: numCount[i] = 1
    for i in numCount.keys():
        if numCount[i] > highestNum:
            highestNum=numCount[i]
            mode=i
    if highestNum != 1: print(mode)
    elif highestNum == 1: print("All elements of list appear once.")

score 0 · Answer 24 · edited Jan 18 '18 at 16:22

If you want a clear approach, useful for classroom and only using lists and dictionaries by comprehension, you can do:

def mode(my_list):
    # Form a new list with the unique elements
    unique_list = sorted(list(set(my_list)))
    # Create a comprehensive dictionary with the uniques and their count
    appearance = {a:my_list.count(a) for a in unique_list} 
    # Calculate max number of appearances
    max_app = max(appearance.values())
    # Return the elements of the dictionary that appear that # of times
    return {k: v for k, v in appearance.items() if v == max_app}

score 0 · Answer 25 · 2018-03-29T17:44:13.000

0

#function to find mode
def mode(data):  
    modecnt=0
#for count of number appearing
    for i in range(len(data)):
        icount=data.count(data[i])
#for storing count of each number in list will be stored
        if icount>modecnt:
#the loop activates if current count if greater than the previous count 
            mode=data[i]
#here the mode of number is stored 
            modecnt=icount
#count of the appearance of number is stored
    return mode
print mode(data1)

edited Mar 29 '18 at 17:44

answered Mar 29 '18 at 15:06

You should explain your answer with comments or more details – Michael Mar 29 '18 at 15:30

score 0 · Answer 26 · answered Aug 26 '19 at 07:43

import numpy as np
def get_mode(xs):
    values, counts = np.unique(xs, return_counts=True)
    max_count_index = np.argmax(counts) #return the index with max value counts
    return values[max_count_index]
print(get_mode([1,7,2,5,3,3,8,3,2]))

score 0 · Answer 27 · answered Feb 20 '21 at 17:22

Perhaps try the following. It is O(n) and returns a list of floats (or ints). It is thoroughly, automatically tested. It uses collections.defaultdict, but I'd like to think you're not opposed to using that. It can also be found at https://stromberg.dnsalias.org/~strombrg/stddev.html

def compute_mode(list_: typing.List[float]) -> typing.List[float]:
    """                       
    Compute the mode of list_.

    Note that the return value is a list, because sometimes there is a tie for "most common value".
                                                                        
    See https://stackoverflow.com/questions/10797819/finding-the-mode-of-a-list
    """                                                                                                        
    if not list_:
        raise ValueError('Empty list')
    if len(list_) == 1:           
        raise ValueError('Single-element list')
    value_to_count_dict: typing.DefaultDict[float, int] = collections.defaultdict(int)
    for element in list_:
        value_to_count_dict[element] += 1
    count_to_values_dict = collections.defaultdict(list)
    for value, count in value_to_count_dict.items():   
        count_to_values_dict[count].append(value)                           
    counts = list(count_to_values_dict)
    if len(counts) == 1:                                                                            
        raise ValueError('All elements in list are the same')          
    maximum_occurrence_count = max(counts)
    if maximum_occurrence_count == 1:
        raise ValueError('No element occurs more than once')
    minimum_occurrence_count = min(counts)
    if maximum_occurrence_count <= minimum_occurrence_count:
        raise ValueError('Maximum count not greater than minimum count')
    return count_to_values_dict[maximum_occurrence_count]

Finding the mode of a list

27 Answers27

Linked

Related