Using median instead of mean as aggregation function in Spark

Question

Say I have a dataframe that contains cars, their brand and their price. I would like to replace the avg below by median (or another percentile):

df.groupby('carBrand').agg(F.avg('carPrice').alias('avgPrice'))

However, it seems that there is no aggregation function that allows to compute this in Spark.

score 2 · Answer 1 · answered Nov 24 '16 at 10:47

2

answered Nov 24 '16 at 10:47

Assaf Mendelson

1 Answers1