I have big job to join different tables together and eventually have an aggregated table for final report. But every time when fetching the final summary table for some filters, the job takes really long time to finish and I believe because of lazy evaluation of Spark. Is there a way to evaluate the final summary table first so that later when filtering the summary each time, it could be faster?
I know if I write that summary table to storage and read it back, it could solve the problem but if I don't want to write and read back, is there any other way?