Combining similar dataframe rows

Question

I currently have a dataframe which looks like this

User    Date     FeatureA FeatureB
John    DateA      1        2
John    DateB      3        5

Is there anyway that I can combine the 2 rows such that it becomes

  User    Date1    Date2    FeatureA1 FeatureB1 FeatureA2 FeatureB2
  John    DateA    DateB        1        2          3        5

Do you want to combine all rows that have the same `User`? Can you have more than two rows that have the same `User`? — DYZ, Mar 21 '18 at 06:06
@DyZ I would like to combine all rows that have the same User such that the dataframe would only have unique Users — Derrick Peh, Mar 21 '18 at 06:08
@DyZ For the 2nd part of your question, I do not wish to have more than one row that have the same User — Derrick Peh, Mar 21 '18 at 06:14
You misunderstood my question. Can the original data have _more than two_ rows that have the same `User`? — DYZ, Mar 21 '18 at 06:15
Yes, the original data can have more than 2 rows of the same user — Derrick Peh, Mar 21 '18 at 06:17
Possible duplicate of [Pandas long to wide reshape](https://stackoverflow.com/questions/22798934/pandas-long-to-wide-reshape) — DYZ, Mar 21 '18 at 06:21

jezrael · Accepted Answer · 2018-03-21T06:23:02.173

I think need:

g = df.groupby(['User']).cumcount()
df = df.set_index(['User', g]).unstack()
df.columns = ['{}{}'.format(i, j+1) for i, j in df.columns]
df = df.reset_index()
print (df)
   User  Date1  Date2  FeatureA1  FeatureA2  FeatureB1  FeatureB2
0  John  DateA  DateB          1          3          2          5

Explanation:

Get count per groups by Users with cumcount
Create MultiIndex by set_index
Reshape by unstack
Flatenning MultiIndex in columns
Convert index to columns by reset_index

Combining similar dataframe rows

1 Answers1