Pandas: How do I get a column of group counts that fills in each row of the group?

Question

I can successfully fill my new column with group counts, but I suspect there is a simpler way:

# How do I simplify this?

def f(gr):

    return pd.Series([gr['class_name'].count()] * gr.shape[0], index=gr.index)

df['class_size'] = df.groupby("class_name").apply(f).reset_index(level=0, drop=True)
column_list = ['class_name', 'class_size']
df[column_list].head(5)

Gets:

score 1 · Accepted Answer · edited Sep 27 '17 at 16:45

1

I think you need transform:

df['class_size'] = df.groupby('class_name')['class_name'].transform('size')

Or:

df['class_size'] = df.groupby('class_name')['class_name'].transform('count')

What is the difference between size and count in pandas?

edited Sep 27 '17 at 16:45

Graham

7,431
18
59
84

answered May 06 '17 at 15:59

jezrael

822,522
95
1,334
1,252

Works great - thanx! – Dave Babbitt May 06 '17 at 16:01
Glad can help, also added differences between size and count to answer. nice day! – jezrael May 06 '17 at 16:03

score 0 · Answer 2 · answered May 06 '17 at 16:07

0

Depending on your DataFrame shape you can also just do a count on the groupby:

import pandas as pd
df = pd.DataFrame({'class names':list('abracadabra'),'class count':1})
df.groupby('class names').count().reset_index()

answered May 06 '17 at 16:07

Sebastiaan

1,166
10
18

Pandas: How do I get a column of group counts that fills in each row of the group?

2 Answers2