group by multiple columns and get a sum and count

Question

I'm trying to create a single data frame where can be visualized the 5 frequent ban characters by year, season and league. My initial df looks like this:

    League  Year    Season  ban_1   ban_2   ban_3   ban_4   ban_5
0   NALCS   2015    Spring  Rumble  Kassadin Lissandra NaN NaN
1   NALCS   2015    Spring  Tristana Leblanc Nidalee NaN NaN
2   NALCS   2015    Spring  Kassadin Sivir  Lissandra NaN NaN
3   NALCS   2015    Spring  RekSai  Janna   Leblanc NaN NaN
4   NALCS   2015    Spring  JarvanIV Lissandra Kassadin NaN NaN

and I want it to look something like this at the end:

Year    Season  League  Top 5 bans
2015    Spring  EULCS   [(Zed, 49), (Kassadin, 39), (Cassiopeia, 34), (RekSai, 33), (Nidalee, 30)]

At this point I've been trying to make it any sense so I tried this:

bans_info.groupby(['Year','Season', 'League', 'ban_1', 'ban_2', 'ban_3', 'ban_4', 'ban_5',]).sum()

and this:

bans_info.groupby(['Year', 'Season', 'League']).ban_1.value_counts() but still don't get it at the end I tried to make it separately but it becomes too messy

b1 = bans_info.groupby(['Year', 'Season', 'League']).ban_1.value_counts()
b2 = bans_info.groupby(['Year', 'Season', 'League']).ban_2.value_counts()
b12 = pd.merge(b1, b2, how='outer', on ='Year')

There are already many threads on SO about group by. What have you tried so far? What exactly are you struggling with? — pnovotnyq, Oct 13 '19 at 21:30

score 1 · Answer 1 · answered Oct 13 '19 at 22:02

1

You need to use .agg and then pass in a dictionary of column names & functions.

You can find more detail here.

answered Oct 13 '19 at 22:02

Peritract

761
5
13

group by multiple columns and get a sum and count

1 Answers1