We have two cloudera 5.7.1 clusters, one secured using Kerberos and one unsecured.
Is it possible to run Spark using the unsecured YARN cluster while accessing hive tables stored in the secured cluster? (Spark version is 1.6)
If so, can you please provide some explanation on how can I get it configured?
Update:
I want to explain a little the end goal behind my question. Our main secured cluster is heavily utilized and our jobs can't get enough resources to complete in a reasonable time. In order to overcome this, we wanted to use resources from another unsecured cluster we have without needing to copy the data between the clusters.
We know it's not the best solution as the data locality level might not be optimal, however that's the best solution we can come up for now.
Please let me know if you have any other solution as it seems like we can't achieve the above.