spark performace in native UDF for sql vs scala vs python

Asked Mar 29 '23 at 06:23

Active Mar 29 '23 at 06:23

Viewed 37 times

When using native UDFs does performance vary based on language used (scala vs python vs sql)

I have a normal sql statement with 2 CST running on spark through databricks. Then I did that same process in same order using pyspark in databricks on same data. No custom UDF involved.

I found that pyspark took 6 seconds and sql took 5 seconds.

asked Mar 29 '23 at 06:23

Aseem

5,848
7
45
69

https://stackoverflow.com/questions/38296609/spark-functions-vs-udf-performance – afjcjsbx Mar 29 '23 at 07:10

spark performace in native UDF for sql vs scala vs python

0 Answers0