Site Reliability Engineer
Job ID: R0347324
Full/Part-Time: Full-time
Regular/Temporary: Regular
Listed: 2024-09-20
Location: Birmingham
Position Overview
You will work closely with application teams to ensure stable, well monitored applications that are resilient to faults. You will agree and review Service Level Agreements (SLAs) and Service Level Objectives (SLOs) to achieve high availability for applications based on their criticality.
You will maintain Error Budgets for the application teams and prevent releases in the event of production instability and reduced availability.
You will have knowledge of and experience in relevant tools used in the SRE environment and be a specialist in one or more technical domains to ensure that all associated stakeholders are provided with an optimum level of service in line with Service Level Agreements (SLAs) / Operating Level Agreements (OLAs).
Your key responsibilities
* Monitoring/alerting across Corporate & Investment Banking applications, providing optimum service level to the business.
* Work with application teams to build monitoring solutions to alert in the event of failures/performance issues and to optimise uptime.
* Provide feedback loops to continually improve application resilience across multiple application teams.
* Agree, and maintain SLAs, SLOs and Error Budgets where necessary to ensure availability for end users and to achieve appropriate levels of application stability.
* Identify and eliminate toil for both the application teams and the SRE team to optimise effectiveness.
* Manage outage resolution with technical and business teams and agree actions to reduce the likelihood of failure happening in future.
Your skills and experience
* Bachelor Degree or equivalent in Computer Science or IT-related discipline or equivalent experience in IT in large corporate environments, specifically in controlled production environments or Financial Services Technology in a client-facing function.
* Demonstrable Site Reliability Engineering experience.
* Excellent analytical and problem solving skills.
* Experience in Prometheus/Grafana monitoring stack.
* Scripting skills (Groovy, shell etc).
* Experience in mid-range technologies, platforms, i.e. UNIX and ORACLE database experience required. Working experience of Openshift and Kubernetes also desirable.
How we’ll support you
* Training and development to help you excel in your career.
* A culture of continuous learning to aid progression.
* We value diversity and as an equal opportunities’ employer, we make reasonable adjustments for those with a disability such as the provision of assistive equipment if required (e.g. screen readers, assistive hearing devices, adapted keyboards).
About us
Deutsche Bank is the leading German bank with strong European roots and a global network. Click here to see what we do.
Deutsche Bank in the UK is proud to have been named in The Times Top 50 Employers for Gender Equality 2024 for five consecutive years. Additionally, we have been awarded a Gold Award from Stonewall and named in their Top 100 Employers 2024 for our work supporting LGBTQ+ inclusion.
We strive for a culture in which we are empowered to excel together every day. This includes acting responsibly, thinking commercially, taking initiative and working collaboratively. Together we share and celebrate the successes of our people. Together we are Deutsche Bank Group. We welcome applications from all people and promote a positive, fair and inclusive work environment.
#J-18808-Ljbffr