* Kafka Infrastructure Management: Handle provisioning, patching, and upgrades for Kafka clusters using Confluent Platform version 7 (Kafka 3.7), ensuring optimal performance and stability.
* Incident Management: Proactively manage and resolve Kafka-related incidents, minimizing downtime and ensuring system reliability.
* AWS & Kubernetes Expertise: Manage Kafka clusters on AWS, leveraging Kubernetes for deployment and scaling to ensure high availability and performance.
* Automation with Puppet: Automate Kafka infrastructure management (deployments, patches) using Puppet for consistency and efficiency across environments.
* Cluster Oversight: Maintain and monitor two Kafka clusters (one in pre-prod, one on AWS), ensuring seamless operations, data replication, and reliability.