Understanding how Kafka load balances requests among multiple pods in a deployment

scottR · January 28, 2024, 2:23am

Hello everyone,
I have a question.
Does the client consumer read from partition followers? For example, if I have 6 pods (instances) of a deployment (Kubernetes env) that consume the same topics, how does Kafka load balance their requests?
Thanks

oWoods · January 28, 2024, 3:14am

Prior to Kafka 2.4, consumption was always done from the leader. With the introduction of KIP 392, replica follower is now possible.

https://cwiki.apache.org/confluence/display/KAFKA/KIP-392%3A+Allow+consumers+to+fetch+from+closest+replica

The default configuration is still based on leader, but can be changed with this configuration

https://kafka.apache.org/documentation/#brokerconfigs_replica.selector.class

This selected can be used in order to have rack awareness used to try to find a replica closer to the client.

org.apache.kafka.common.replica.RackAwareReplicaSelector

If you do decide to go down this path, I would recommend closely monitoring latency as it can easily go up not down.

** my observation/assumption **
This approach is more about reducing network costs by having clients read from the same AZ more than trying to reduce latency.

scottR · January 28, 2024, 3:21am

Thank you for your ongoing support