Get all data in the same row in a pandas dataframe

Question

I have a dataframe with 3 columns: id, single, age.

    'id': [1, 1, 1, 2, 2, 3, 3, 4],
    'single': ['y', '', '', '', 'n', 'n', '', 'y'],
    'age': ['', 22, '', 34, '', 22, '', 43]
}

Some rows of the same id have NaN values and others have info.

I want something like:

data = {
    'id': [1,2,3, 4],
    'single': ['y' 'n', 'n', 'y'],
    'age': [22,  34, 22,  43]
}

Is it possible?

`df.replace('', np.nan).groupby('id', as_index=False).first()` — mozway, Jun 06 '23 at 17:25
Also duplicate of: https://stackoverflow.com/questions/57300682/pandas-how-to-merge-rows-with-blank-columns — Celius Stingher, Jun 06 '23 at 17:26

score 1 · Answer 1 · answered Jun 06 '23 at 17:23

1

Just use groupby and first. Replace the '' with np.nan before that.

df.replace('', np.nan, inplace=True)
df_new = df.groupby('id', as_index=False).first()

answered Jun 06 '23 at 17:23

NYC Coder

1 Answers1