pandas group by ALL functionality?

Question

I'm using the pandas groupby+agg functionality to generate nice reports

aggs_dict = {'a':['mean', 'std'], 'b': 'size'}
df.groupby('year').agg(aggs_dict)

I would like to use the same aggs_dict on the entire dataframe as a single group, with no division to years, something like:

df.groupall().agg(aggs_dict)

or:

df.agg(aggs_dict)

But couldn't find any elegant way to do it.. Note that in my real code aggs_dict is quite complex so it's rather cumbersome to do:

df.a.mean()
df.a.std()
df.b.size()
....

am I missing something simple and nice?

@ayhan IIUC, it's the opposite - if the entire index would be one big duplicate, that would work here. The question is about an aggregation for the entire df as a group, not for each of the rows. — Ami Tavory, Sep 07 '16 at 08:41

score 27 · Answer 1 · answered Oct 05 '17 at 10:30

27

You could also use a function to directly group on:

 df.groupby(lambda x: True).agg(aggs_dict)

answered Oct 05 '17 at 10:30

Hervé Mignot

281
3
3

I think this answer is much better than provided earlier by @bunji (below) – Anugraha Sinha Jul 06 '22 at 06:44

score 9 · Accepted Answer · answered Sep 07 '16 at 12:45

9

Ami Tavory's answer is a great way to do it but just in case you wanted a solution that doesn't require creating new columns and deleting them afterwards you could do something like:

df.groupby([True]*len(df)).agg(aggs_dict)

answered Sep 07 '16 at 12:45

bunji

5,063
1
17
36

wow! exactly what I wanted (cumbersome notation, but I'm used to that from pandas :) – ihadanny Sep 07 '16 at 19:32
Nice answer! – Ami Tavory Sep 07 '16 at 20:42

score 6 · Answer 3 · edited May 23 '17 at 10:29

6

You could add a dummy column:

df['dummy'] = 1

Then groupby + agg on it:

df.groupby('dummy').agg(aggs_dict)

and then delete it when you're done.

edited May 23 '17 at 10:29

Community

1
1

answered Sep 07 '16 at 08:39

Ami Tavory

74,578
11
141
185

pandas group by ALL functionality?

3 Answers3