Beginner question. This seems like it should be a straightforward operation, but I can't figure it out from reading the docs.
I have a df with this structure:
|integer_id|int_field_1|int_field_2|
The integer_id column is non-unique, so I'd like to group the df by integer_id and sum the two fields.
The equivalent SQL is:
SELECT integer_id, SUM(int_field_1), SUM(int_field_2) FROM tbl
GROUP BY integer_id
Any suggestions on the simplest way to do this?
EDIT: Including input/output
Input:
integer_id int_field_1 int_field_2
2656 36 36
2656 36 36
9702 2 2
9702 1 1
Ouput using df.groupby('integer_id').sum():
integer_id int_field_1 int_field_2
2656 72 72
9702 3 3