1

I am currently working on Spark 1.6 using Scala. I want to get the quantiles of an integer column. Unfortunately, Spark doesn't have any quantile fun in 1.6. However, I found that we have percentile_approx() in hive. Is there any significant difference between those two? or Can I just use Percentile_approx instead of quantiles?

Raj
  • 11
  • 3
  • 1
    you cant use percentile instead of quantile there is [difference](http://stats.stackexchange.com/a/156787) Spark2 has api support for this For <2 please [see](http://stackoverflow.com/a/31437177/647053) – Ram Ghadiyaram Jan 02 '17 at 07:51
  • Thanks for pointing me to the appropriate article @Ram – Raj Jan 10 '17 at 22:31

0 Answers0