Site Reliability Engineer
Location: Bristol
Time Type: Full time
Posted On: Posted 2 Days Ago
End Date: November 17, 2024 (15 days left to apply)
Salary Range: £86,964 - £102,310
Flexible Working Options: Hybrid Working
Job Description Summary:
Our Cloud Site Reliability Engineering team is looking for an experienced and passionate Engineer to join our Consumer Servicing and Engagement Platform.
As an application level SRE, you’ll be an active and leading member of a cloud-focused team of engineers – working on one of the Group’s flagship projects to run and maintain a set of products and services on the Google Cloud Platform (GCP).
Accountabilities will include:
* Delivering against GCP and SRE Public Cloud technology roadmaps.
* Collaboratively working with other engineering teams to release and evolve enterprise-class solutions.
* Being responsible for the operations of a large set of critical banking services (including 24x7 coverage by participating in on-call rota).
What you’d get involved with:
* Working with our service teams and Cloud Platform team to enhance & improve the resiliency and reliability of critical customer facing services.
* Troubleshooting, investigating & diagnosing service issues.
* Building and implementing tooling to assist the service teams.
* Working across multiple labs and signature projects in the Digital space.
* Helping to integrate and then implement Chaos Engineering into the bank.
What You’ll Need:
We welcome applications from candidates with a range of experiences and backgrounds. To thrive in this role, you’ll need to demonstrate the below skills in your CV and application.
Site Reliability Engineering & DevOps experience:
* Strong understanding of Site Reliability Engineering with commercial experience.
* Knowledge of SLAs, SLOs and SLIs is essential.
* Strong understanding of the principles of DevOps.
* Strong experience of CI/CD pipelines.
Troubleshooting & Problem-solving:
* Experience in problem-solving and troubleshooting skills.
* Thorough understanding of observability, monitoring and alerting.
* Demonstrate a passion to continue to learn and develop your engineering skills.
Cloud experience - GCP:
* Experience working with GCP products or extensive experience with another Public Cloud platform.
* A strong understanding of Cloud, Cloud security and Cloud networking.
* Proficient in one or more of the following tooling – Dynatrace, Stackdriver/Cloud Operations Suite.
Effective collaboration & Leadership:
* Excellent collaboration skills with a desire to lead and mentor others.
* Leadership experience including line management and team leadership.
Automation:
* Experience of developing for, or administrating Kubernetes clusters in a production environment.
* Experience in automation to remove toil.
Any evidence of the below would also be beneficial:
* Ability to work with architectural, business and other engineers.
* Ability to work with a team of engineers to deliver domain-owned deliverables.
Continuous learning:
* Demonstrating a commitment to learning and improvement.
Adaptability and Flexibility:
* Technology agnostic and willing to adapt their approach.
* Recent practical experience with Chaos Engineering using tools like Gremlin.
Certifications:
* Certifications in GCP or another cloud platform.
Other:
* Candidates with less direct experience in cloud engineering may also be considered.
About Working for Us:
Our focus is to ensure we're inclusive every day, building an organisation that reflects modern society and celebrates diversity in all its forms.
We also offer a wide-ranging benefits package, which includes:
* A generous pension contribution of up to 15%.
* An annual bonus award, subject to Group performance.
* Share schemes including free shares.
* 30 days’ holiday, with bank holidays on top.
If you’re excited by the thought of becoming part of our team, get in touch. We’d love to hear from you!
#J-18808-Ljbffr