Spark disable predicates pushdown

Asked Jun 05 '20 at 05:38

Active Jun 05 '20 at 09:24

Viewed 476 times

I am using Spark 2.2. I have a join query on a partitioning column and also have some filter conditions on other columns. So, when I checked the execution plan, it looks like below.

It checks for non-null partition columns.
It applies predicates on entire table even before joining with second table. This is causing Spark to read/apply filters on all partitions, then join to get data. My join clause actually hits only one partition.

Why does my query need to scan all partitions? Is there any way to control predicates push down in Spark when doing joins?

edited Jun 05 '20 at 09:24

Adrian Mole

49,934
160
51
83

asked Jun 05 '20 at 05:38

vijay

Does this answer your question? [How to prevent predicate pushdown?](https://stackoverflow.com/questions/50336355/how-to-prevent-predicate-pushdown) – Rayan Ral Jun 05 '20 at 05:55

Spark disable predicates pushdown

0 Answers0