Kafka producer reconnect

Question

Hi,

We have a pentaho integration in place to stream data to a data warehouse system through Kafka. We have the connection set up by providing bootstrap servers and kafka topic. As bootstrap servers we specify all kafka brokers as broker1:9091,broker2:9092,broker3:9092
Pentaho connects to the kafka cluster as expected and can send data, however if for any reason the lead broker for that topic dies and restarts, the sending does not resume, even though kafka has elected a new leader for the specific topic.
Pentaho retries to send data for a minute but since the original lead broker is no longer valid for the topic, it fails and never tries to reconnect.
Questions:
How to make Pentaho try a reconnect after a failure in the send operation?
Is there a way to lenghten the retry period after a failed send?
Why do we experience massive message loss during such an incident? (Last time when we experimented with this, status lines indicated ~27000 messages sent, while we only had ~23000 messages in the kafka topic.)

By the way: we are using 9.3.0.0-428 build.

Thanks in advance,
Imre

Answer

Imre,

Reviewing the logs, the connection to the server is being disconnected. This can also occur when the lead broker dies or being killed for some reason which is what you have described initially as well.
As I mentioned earlier, a new leader should be taken up which isn't happening in your case.
This could be possible because of the way your HA setup is done, however you can try to set the parameter " retried" instead of default value 0 to a value using which it should pick up the new leader.
Can you try with "retries" and "retry.backoff.ms" parameter in the Kafka Producer step - Options tab? See attached screen shot

Pentaho

Kafka producer reconnect

Related Content

RE: Kafka producer reconnect

RE: Kafka producer reconnect

Kafka consumer step will not read messages

AMQP Consumer and AMQP Producer steps on PDI 8.3 community

Kafka consumer java.lang.OutOfMemoryError: GC overhead limit exceeded