How to combine rows with same id number using pandas in python?

Question

I have a big csv file and I would like to combine rows with the same id#. For instance, this is what my csv shows right now.

and I would like it to be like this:

how can I do this using pandas?

Does this answer your question? [pandas group by and find first non null value for all columns](https://stackoverflow.com/questions/59048308/pandas-group-by-and-find-first-non-null-value-for-all-columns) — Chris, Jul 06 '22 at 17:07

score 0 · Answer 1 · answered Jul 06 '22 at 17:16

Try this:

df = df.groupby('id').agg({'name':'last', 
                           'type':'last', 
                           'date':'last' }).reset_index()

this way you can have customized function in handling each columns. (By changing the function from 'last' to your function)

Matteo Buffagni · Answer 2 · 2022-07-06T17:22:57.937

0

You can read the csv with pd.read_csv() function and then use the GroupBy.last() function to aggregate rows with the same id.

something like:

df = pd.read_csv('file_name.csv')
df1 = df.groupby('id').last()

you should also decide an aggregation function instead of using "the last" row value.

edited Jul 06 '22 at 17:22

answered Jul 06 '22 at 17:17

Matteo Buffagni

2 Answers2