Pandas - Calculate aggregate value for last 1 year

Question

I have a dataset in this form:

Customer_key    Issue_dt      Amount
45435           2021-03-19    566
64352           2021-06-22    843
43766           2020-04-29    754
45435           2021-06-21    547

There are many repeated customer_keys for different Issue_dt. I want to groupby customer_key and get the total Amount only for year 2021. Can someone please suggest, how to do that ??

If you think filter by `2021` then closed is correct (100% match), if need previous year from actual then it is not dupe, only similar (70% in my opinion). Can you specify it? — jezrael, Feb 01 '22 at 09:18

jezrael · Answer 1 · 2022-02-01T09:14:25.957

0

If need filter by year use boolean indexing and then aggregate sum:

df[df['Issue_dt'].dt.year == 2021].groupby('customer_keys', as_index=False)['Amount'].sum()

For dynamic solution get actual year and subtract 1:

y = pd.to_datetime('now').year - 1
df[df['Issue_dt'].dt.year == y].groupby('customer_keys', as_index=False)['Amount'].sum()

edited Feb 01 '22 at 09:14

answered Feb 01 '22 at 09:07

jezrael

822,522
95
1,334
1,252

keramat · Answer 2 · 2022-02-01T12:01:17.713

0

Use:

df = pd.DataFrame({'Customer_key':[45435,64352,43766,45435], 'Issue_dt': ['2021-03-19','2021-06-22','2020-04-29','2021-06-21'], 'Amount': [566, 843, 754, 547]})
               
df[pd.to_datetime(df['Issue_dt']).dt.year==2021].groupby('Customer_key').sum()

First filter df by year (after type conversion) then apply sum on groups.

edited Feb 01 '22 at 12:01

answered Feb 01 '22 at 09:12

keramat

4,328
6
25
38

So your solution is same like me, only converting to datetimes? – jezrael Feb 01 '22 at 09:13
Actually, I did not see yours. When I started coding there was not any answer there! Yes, they are the same. – keramat Feb 01 '22 at 09:14
ya, when posted there was solution 5 minutes. It it is long or not it is for discussion ;) – jezrael Feb 01 '22 at 09:16

Pandas - Calculate aggregate value for last 1 year

2 Answers2