Recently our (Kafka 1.1.1) brokers went down and our Kafka-stream application stopped working. So we stopped the application manually to stop the alerts.
After the Kafka came live again, We started our stream application but it didn't read any messages from the topic. We found out after reading the logs that the Group Coordinator keeps discovering and then becoming unavailable again in an endless loop.
Below are the logs.
[Consumer clientId=dummy-consumer-id-6b3ad573-5b6a-4e89-82c1-1705e3662d55-StreamThread-10-consumer, groupId=dummy-consumer-id] Group coordinator test-kafka01.com:9092 (id: 2147483644 rack: null) is unavailable or invalid, will attempt rediscovery
loggerName":"org.apache.kafka.clients.consumer.internals.AbstractCoordinator
[Consumer clientId=dummy-consumer-id-6b3ad573-5b6a-4e89-82c1-1705e3662d55-StreamThread-1-consumer, groupId=dummy-consumer-id] Discovered group coordinator test-kafka01.com:9092 (id: 2147483644 rack: null)
loggerName":"org.apache.kafka.clients.consumer.internals.AbstractCoordinator
[Consumer clientId=dummy-consumer-id-6b3ad573-5b6a-4e89-82c1-1705e3662d55-StreamThread-1-consumer, groupId=dummy-consumer-id] (Re-)joining group
loggerName":"org.apache.kafka.clients.consumer.internals.AbstractCoordinator
[Consumer clientId=dummy-consumer-id-6b3ad573-5b6a-4e89-82c1-1705e3662d55-StreamThread-3-consumer, groupId=dummy-consumer-id] Discovered group coordinator test-kafka01.com:9092 (id: 2147483644 rack: null)
loggerName":"org.apache.kafka.clients.consumer.internals.AbstractCoordinator
[Consumer clientId=dummy-consumer-id-6b3ad573-5b6a-4e89-82c1-1705e3662d55-StreamThread-3-consumer, groupId=dummy-consumer-id] (Re-)joining group
loggerName":"org.apache.kafka.clients.consumer.internals.AbstractCoordinator
[Consumer clientId=dummy-consumer-id-6b3ad573-5b6a-4e89-82c1-1705e3662d55-StreamThread-2-consumer, groupId=dummy-consumer-id] Discovered group coordinator test-kafka01.com:9092 (id: 2147483644 rack: null)
loggerName":"org.apache.kafka.clients.consumer.internals.AbstractCoordinator
[Consumer clientId=dummy-consumer-id-6b3ad573-5b6a-4e89-82c1-1705e3662d55-StreamThread-2-consumer, groupId=dummy-consumer-id] (Re-)joining group
loggerName":"org.apache.kafka.clients.consumer.internals.AbstractCoordinator
[Consumer clientId=dummy-consumer-id-6b3ad573-5b6a-4e89-82c1-1705e3662d55-StreamThread-1-consumer, groupId=dummy-consumer-id] Group coordinator test-kafka01.com:9092 (id: 2147483644 rack: null) is unavailable or invalid, will attempt rediscovery
loggerName":"org.apache.kafka.clients.consumer.internals.AbstractCoordinator
After doing much analysis we decided to change the stream app id and start the application, Everything worked fine but after some time the same issue happened again.
Please help me to debug this issue. We can't afford to change the stream app id each time and do let me know if any detail is required.