Converting repeating rows to columns in pandas dataframe

Question

I am trying to convert a dataframe with repeating rows into columns as follows

INPUT
Key  |  Value
A    |  1
B    |  2
C    |  3
A    |  4
B    |  5
C    |  6

EXPECTED OUTPUT
A  |  B  |  C
1  |  2  |  3
4  |  5  |  6

There are a lot of options like pivot(), unstack(), groupby(), etc. But, I was unsure of using it with just 2 columns as shown in the input.

@Ch3steR In my opinion its not a straight forward pivot. Please check my answer and let me know if I have missed anything obvious. — Mayank Porwal, Nov 06 '20 at 11:57
@Ch3steR after looking at q.10 agree that its a dupe - same as my answer pretty much. — Umar.H, Nov 06 '20 at 12:00
@MayankPorwal Question 10 is the exact same as what OP wanted. The one with `GroupBy.cumcount` and `df.pivot` — Ch3steR, Nov 06 '20 at 12:18
@Ch3steR Oh ok. I answered it in another way. Should I remove my solution or can I keep it? — Mayank Porwal, Nov 06 '20 at 12:19
@MayankPorwal One more way to solve the question. Why delete it? — Ch3steR, Nov 06 '20 at 12:20

Mayank Porwal · Answer 1 · 2020-11-06T12:02:01.030

Its not a straight-forward pivot. Do this using df.pivot with df.apply and Series.dropna:

In [747]: x = df.pivot(index=None, columns='Key', values='Value').apply(lambda x: pd.Series(x.dropna().to_numpy()))

In [748]: x
Out[748]: 
Key    A    B    C
0    1.0  2.0  3.0
1    4.0  5.0  6.0

Explanation:

Let's break it down:

First you pivot your df like this:

In [751]: y = df.pivot(index=None, columns='Key', values='Value')

In [752]: y
Out[752]: 
Key    A    B    C
0    1.0  NaN  NaN
1    NaN  2.0  NaN
2    NaN  NaN  3.0
3    4.0  NaN  NaN
4    NaN  5.0  NaN
5    NaN  NaN  6.0

Now we are close to your expected output, but we need to remove Nan and collapse the 6 rows into 2 rows.

For that, we convert each column to a pd.Series and dropna():

In [753]: y.apply(lambda x: pd.Series(x.dropna().to_numpy()))
Out[753]: 
Key    A    B    C
0    1.0  2.0  3.0
1    4.0  5.0  6.0

This is your final output.

Could you please explain what the lambda function is doing? – Gagan G Nov 06 '20 at 11:50 — Gagan G, Nov 06 '20 at 11:50
@GaganG I've added explanation. Please check. – Mayank Porwal Nov 06 '20 at 11:55 — Mayank Porwal, Nov 06 '20 at 11:55

Converting repeating rows to columns in pandas dataframe

1 Answers1

Explanation: