I get this error message every now and then making the Job very unreliable.
On deeper evaluation, and continuous logging, I see the following error:
2021-09-02 10:38:19,810 ERROR [main] glue.ProcessLauncher (Logging.scala:logError(70)): Unknown error from Python: Error Traceback is not available.
The above error also does not mention where the issue lies. I am running Glue 2.0 with Python 3 with worker type G.1X. The data being pulled is only 100,000 rows, so this should not be a memory issue at all.
The AWS page is very unhelpful with the error.
The AWS Glue job fails with the error "Command failed with exit code 10"
Check the CloudWatch logs for the job to find errors related to executors. This error usually occurs during the shuffle stage of Spark.
Where is the script failing?