It seems that when the NebulaGraph cluster is deployed in containers(Docker-Compose or K8s), I cannot make Spark Connector read the Graph Data properly anyway.
The mitigation I made was to run Spark inside the container network:
- for the docker-compose case, I put the spark env in same docker network
- for k8s, I created a spark env inside the k8s cluster
While, this seems not always doable, especially since we'll have spark infra in production cases.
Could anyone explain me what exactly is needed to enable Spark outside of the container network working with NebulaGraph running in containers?