Job Description
This is an amazing opportunity for an experienced Site Reliability Engineering (SRE) to take the next step in their career working for an exciting, growing health-tech company driving digital transformation in UK healthcare.
As a senior SRE team member of our Azure-based infrastructure team, you will be responsible for ensuring the reliability, availability, and performance of our clients' applications and services. You will work closely with engineering, operations, and product teams to implement best practices for cloud infrastructure management and operational excellence.
Responsibilities:
Assist with the design, implementation, and manage scalable, resilient, and secure Azure infrastructure.
Oversee the deployment, configuration, and maintenance of cloud resources, ensuring they meet performance and reliability standards.
Lead incident response efforts, ensuring quick resolution and minimizing impact on service availability.
Conduct post-incident reviews and implement corrective actions to prevent future occurrences.
Drive automation initiatives to reduce manual intervention, improve efficiency, and enhance system reliability.
Implement Infrastructure as Code (IaC) using tools like Terraform, or Azure Bicep.
Continuously assess and improve system performance, capacity, and reliability.
Building new customer deployments and performing maintenance on e...