I am trying to run spark-submit command from drive/folder where my python script and dataset is H:\spark_material
. It just won't work !
But if I copy my python script into this folder C:\spark\bin
then it works.
I believe it has something to do with environment variables.
Here is my Path = %JAVA_HOME%\bin; %SPARK_HOME%\bin
Here are my variables:
HADOOP_HOME = C:\winutils
JAVA_HOME = C:\jdk
SPARK_HOME = C:\spark
Java is properly installed as I have tried typing "java -version" anywhere in CMD and it works!!