I am using Spark Structured Streaming (Version 2.3.2). I need to read from Kafka Cluster and write into Kerberized Kafka. Here I want to use Kafka as offset checkpointing after the record is written into Kerberized Kafka.
Questions:
- Can we use Kafka for checkpointing to manage offset or do we need to use only HDFS/S3 only?
Please help.