How to convert column to row in pyspark

Question

I got the following result in pyspark I used the .discribe() function

+------+-------+
|    _c0|   _c1|
+------+-------+
| count|2674686|
|  mean|    0.0|
|stddev|    0.0|
|   min|      0|
|   max|      0|
+------+-------+

i'm trying my result to converting like this

+------+-------+-------+-------+-------+
|  count|  mean| stddev|   min|   max|
+------+-------+-------+-------+-------+
|2674686|   0.0|    0.0|      0|      0|
+------+-------+-------+-------+-------+

how can i solve it? If this is hard, is it possible to change columns and rows when using the discribe() function?

score 2 · Accepted Answer · answered Aug 18 '21 at 11:00

2

groupBy() and pivot functions can be used here:

your_df.groupBy().pivot("_c0").agg(first('_c1')).show()

answered Aug 18 '21 at 11:00

Enayat

3,904
1
33
47

score 0 · Answer 2 · answered Aug 18 '21 at 09:21

0

You can use the .pivot() function for that.

Example:

from pyspark.sql.functions import first

data_frame.pivot("_c0").agg(first('_c1')).show()

answered Aug 18 '21 at 09:21

Robert Kossendey

6,733
2
12
42

How to convert column to row in pyspark

2 Answers2