Pandas, for each unique value of column1 i want to count how many times it appear for each unique value of column2

Question

I have a dataframe which columns look like actor | object | group | phone | day, i want to obtain a vector (or other name, i'm new to python) for each actor that counts how many times that actor appeared by day. The vector should be just the count by day.

The initial data in in a pandas dataframe and looks like this:

    actor  object  group  phone    Day
    john    don     saf   9234...  27-06-2015

Something like this: John (2,0,8,2,2,3,0,0,2,1,5,3,3,0,0...) I want every day to appear even if that actor doesn´t appear in that day, the count should be 0 in that case. I've tried many ways but never manage to get this.

The output intended is:

    Actor (data day 1) (data day 2) (data day 3) (data day 4)
    john      2             2           0            3        ...

or

    john (2,2,0,3,...)

the important ideia is to get for each actor the ammount of times ir appears in the inicial data for each day.

Hi. Please take the time to read this post on [how to provide a great pandas example](http://stackoverflow.com/questions/20109391/how-to-make-good-reproducible-pandas-examples) as well as how to provide a [minimal, complete, and verifiable example](http://stackoverflow.com/help/mcve) and revise your question accordingly. These tips on [how to ask a good question](http://stackoverflow.com/help/how-to-ask) may also be useful. — jezrael, Dec 07 '18 at 11:44
That works for the count part, my problem is getting the each day to appear, a "column" or "space" for each day. That way every actor is counted how many times it appears in a day, even if its zero. — Mariana, Dec 07 '18 at 12:21
I don't understand a lot about stackoverflow, do you mean like that? It loses the format whn i try do copy — Mariana, Dec 07 '18 at 12:41

score 0 · Accepted Answer · answered Dec 07 '18 at 11:53

0

This should do what you want:

df.groupby(['actor', 'day']).agg({'day':'count'})

answered Dec 07 '18 at 11:53

yatu

86,083
12
84
139

better is `df.groupby(['actor', 'day']).size()` - check dupe – jezrael Dec 07 '18 at 11:56
True @jezrael, simpler. – yatu Dec 07 '18 at 11:57
your solution should be `df.groupby(['actor', 'day'])[day'].count()` - If OP need count non NaNs values. – jezrael Dec 07 '18 at 11:58
Thank you so much for all your answers, but i wanted to get a column for each day, and this way i have just two columns, actor and Day, do you know a way? – Mariana Dec 07 '18 at 12:11

Pandas, for each unique value of column1 i want to count how many times it appear for each unique value of column2

1 Answers1