Job Description
We are seeking an experienced Cloud Infrastructure Engineer to join our team and work with a global leader in digital services. The successful candidate will be responsible for ensuring the reliability, scalability, and efficiency of clients' platforms.
Key Responsibilities:
* Designing and implementing scalable and resilient cloud infrastructure to ensure seamless deployment and optimisation of containerised applications.
* Collaborating with cross-functional teams to implement automation strategies that reduce operational complexity and drive continuous improvement.
* Building strong observability practices, aligning with the SRE mindset & principles, and driving continuous improvement.
* Defining and implementing Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to measure and maintain system & application performance, ensuring services meet agreed reliability targets.
* Instrumenting applications to collect key metrics, logs, and traces that enable proactive monitoring and troubleshooting.
* Creating dashboards and configuring alerts to provide real-time visibility into system health, enabling teams to quickly detect and resolve issues.
* Assessing and enhancing Kubernetes capabilities, improving DevOps efficiency through innovation, agility and cost optimisation.
* Taking a holistic approach to modernising the developer experience, focusing on organisational culture, DevOps practices, processes, automation and tooling.
Requirements:
* Experience in implementing site reliability engineering (SRE) principles, with a focus on observability and optimising applications & cloud environments.
* Strong understanding of the SRE mindset and principles, including the creation and management of Service Level Indicators (SLIs) and Service Level Objectives (SLOs), ensuring reliability and performance.
* Experience in implementing observability, instrumenting applications to provide insights into system performance. Hands-on experience with tools such as Dynatrace, Prometheus and OpenTelemetry for monitoring, tracing, and real-time alerting is highly sought after.
* An understanding of Microservices & container orchestration with the ability to optimise containerised applications for reliability and scalability.
* Experience enabling continuous delivery pipelines, with a focus on ensuring system reliability, quality, and performance through automated deployment, scaling, and observability tools.
* Understanding of build and deployment pipelines and experience collaborating with developers to improve observability and monitoring practices.
* Strong collaboration skills with the ability to work effectively both independently and as part of a team.
* A comfort level interacting and engaging with clients, although a consulting background is not a prerequisite.
About Us
83zero Limited is a boutique Tech & Data Recruitment Consultancy based within the UK. We provide high-quality interim and permanent Tech & Data professionals. Our client is a global leader in digital services, driving innovation in customer experience through CRM, marketing, business intelligence, and cloud solutions.
Location and Salary
This role offers a competitive salary of £52,500 - £57,000 per annum, plus pension and private healthcare benefits. You will also have the opportunity to work from home or office locations across the UK, with a mix of client sites and hybrid working arrangements.