Pandas: Summarize Event Data into a Table

Question

I have a dataframe with two series:

Date of Call
Sales Rep

The dataframe is a list is all calls to customers made by a sales force over the course of one year - one row for each call. The date field is the date of the call, and the "Sales Rep" is the person who made the call. There are approximately 250k rows in the dataframe.

I'd like to summarize this data into a new dataframe with the index being the sales reps and the columns being the number of calls by month i.e. one row for each sales rep and one series for each month. I thought pd.Pivot was the way to go but that didn't work.

What's the easiest and most pythonic way to achieve this results?

is [this](http://pbpython.com/pandas-crosstab.html) what you need? — Maarten Fabré, Oct 23 '18 at 12:44
please also provide some sample data and expected outcome, and what you already tried — Maarten Fabré, Oct 23 '18 at 12:44
This isn't a duplicate question. The question it supposedly duplicates is a general discussion of aggregating data, and there is no discussion of time based data. If this question is classed as a duplicate then I suggest virtually all `pd.groupby` questions are also duplicates of the same question. — Steve Maughan, Oct 24 '18 at 00:15

jezrael · Accepted Answer · 2018-10-23T12:53:58.700

2

I believe you need crosstab:

df = pd.crosstab(df['Sales Rep'], df['Date of Call'].dt.month)

edited Oct 23 '18 at 12:53

answered Oct 23 '18 at 12:46

jezrael

822,522
95
1,334
1,252

1

perfect! Thanks so much – Steve Maughan Oct 24 '18 at 00:07

Pandas: Summarize Event Data into a Table

1 Answers1