I am trying to connect Astra Cassandra in AWS EMR. but Executor are not able to get the bundle files as I am passing the file through S3.
this the spark submit command i passing.
--master yarn
--class com.proj.prog
--packages com.datastax.spark:spark-cassandra-connector_2.12:3.1.0,org.apache.hadoop:hadoop-aws:3.1.2
--conf spark.files=s3://.../connect/secure-connect-proj.zip
--conf spark.cassandra.connection.config.cloud.path=secure-connect-proj.zip
mode is cluster its working in client mode but not in cluster.
I also tried with but none worked.
--conf spark.cassandra.connection.config.cloud.path=s3://.../connect/secure-connect-proj.zip
This was the error in both cases.
diagnostics: User class threw exception: java.io.IOException: \
Failed to open native connection to Cassandra \
at Cloud File Based Config at secure-connect-proj.zip :: \
The provided path secure-connect-proj.zip is not a valid URL \
nor an existing locally path. Provide an URL accessible to all executors \
or a path existing on all executors (you may use `spark.files` \
to distribute a file to each executor).
Job aborted due to stage failure: Task 0 in stage 1.0 failed 4 times, \
most recent failure: Lost task 0.3 in stage 1.0 (TID 7) \
(ip-172-31-17-85.ap-south-1.compute.internal executor 1): \
java.io.IOException: Failed to open native connection to Cassandra \
at Cloud File Based Config at s3://.../connect/secure-connect-proj.zip :: \
The provided path s3://.../connect/secure-connect-proj.zip is not a valid URL \
nor an existing locally path. Provide an URL accessible to all executors \
or a path existing on all executors (you may use `spark.files` \
to distribute a file to each executor).
Please help. I know I am missing something but I could not found a working solution.