Let's assume the configuration 14 executors, each with 10 GB memory and my input data size is 1000GB.
Q1 - How executors will process the huge data because data size is more and how partion logic works here ?
Q2 - How caching can help here for better performance and what caching strategy can be used ? (Mem only , Mem and disk etc )