We have an exciting opportunity for an SRE/DevOps Engineer to join a leading software house. Reporting to the Head of Engineering, this role offers the chance to stay hands-on and participate in strategic decisions.
We want someone with fresh ideas and supporting experience who enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences.
Responsibilities:
* Ownership and implementation across multiple projects from our SRE strategy that focuses on maturity, scalability, resilience, security and automation.
* Manage and maintain environments to ensure high availability and security.
* Build and maintain infrastructure as code (IaC) solutions using tools like Terraform.
* Manage AWS services – monitoring, investigating and fixing any issues if they arise.
* Manage and optimise Kubernetes clusters for efficiency, stability and high availability.
* Design and implement CI/CD pipelines to automate software delivery, maturing our current approaches.
* Design and implement robust monitoring systems that proactively identify and address performance bottlenecks, security vulnerabilities, and system failures.
* Automate manual tasks to improve operational efficiency and reduce technical debt.
* Collaborate with other SRE engineers to ensure resilience and scalability across the platform.
* Work with our teams to directly influence and drive the adoption of SRE best practices and ways of working within our microservice architecture.
* Provide primary operational support and engineering for multiple large-scale distributed software applications supporting out-of-hours changes where necessary.
Experience & Skills Required:
* Bachelor’s degree (or equivalent) in computer science or related discipline.
* 7+ years experience working in DevOps, SRE or infrastructure management roles.
* Familiar with AWS and its services, particularly IAM, S3, EKS, Networking.
* Experience with implementing/enhancing IaC specifically Terraform.
* Experience using CloudFormation/CDK.
* Working knowledge of containerisation and orchestration tools (Docker, Kubernetes).
* Exposure to working with serverless computing.
* Experience in delivering monitoring and alerting solutions for diverse systems.
* Familiarity with CI/CD pipelines and tools.
* Experience working with database technology, e.g. MySQL & Postgres.
Place of work: Remote with two days a month in the office.
#J-18808-Ljbffr