How do I fill one column with how many times the variable in the other column has appeared?

Question

I have this dataset consisting of two variables, A and B. Variable A consists of a list of numbers. Not all numbers are different. I want to fill variable B in each row with the number N, where N is the number of times A has appeared so far.

This is the dataframe I have:

Here is how I want the output to be:

see cumcount : `df['B']=df.groupby('A').cumcount()+1` – anky Jul 29 '19 at 10:34 — anky, Jul 29 '19 at 10:34

score -1 · Answer 1 · answered Jul 29 '19 at 10:38

-1

You can simply do that with this

df['B']=df.groupby('A').cumcount()+1  # +1 as the index starts with 0

reference : pandas.core.groupby.GroupBy.cumcount

answered Jul 29 '19 at 10:38

Sundeep Pidugu

2,377
2
21
43

How do I fill one column with how many times the variable in the other column has appeared?

1 Answers1