I am having a little problem while running similar code on the yarn-client mode as well as the yarn-cluster mode. My code executes perfectly when I run it in the client mode, but fails when made to run on the yarn-cluster node.
It throws a file not file exception, stating that pyspark.zip file could not be found. Any insight into this would be helpful.