The successful candidate will ensure operational stability & performance of OR systems across CRM, workflow, IIP & Field operations to deliver expected business benefits. You will focus on driving the adoption of operational best-practices across OR platforms and optimising service levels across OR, representing OR Technology at senior stakeholder level on our operational performance, trading and service reliability. As a Site Reliability Engineer Manager, you will be required to build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, and reducing work through automation. What you'll be doing
* SRE
* Operational Readiness
* Operational Stability
* Operational Assurance - ensure all delivered solutions are fit to run
* Trading Reliability - SLAs met for P1/P2/P3 incident management
* Incident Turnaround and resolution - lead on E2E ownership of incidents though the resolution may be owned by other teams
* Automation - reduction in levels of ASG Service Requests/manual workarounds and automation of incident handling
* Security coverage of the estate along with application of latest security patches
* Customer and stakeholder management
* Building key critical skills across the team
* Support cultural change through execution of how systems need to be supported
* Process change - continuous improvement of ASM processes
* Innovation to find new ways to provide operational system support whilst maximising efficiencies and cost savings(using AI/ML)
* Innovation to find new ways to contribute to business benefits via functional changes
* Maintains an oversight of key technology transformation programmes in specific area of expertise, monitoring performance against business objectives, scorecard and responding and recommending actions to trends and taking executive action
May have an engineering degree qualification (engineering/science) in Tier 1 institution or has served a technical apprenticeship and/or obtained NVQ and/or further education technical qualifications (i.e. HND)
* Qualified to be and possibly member of a professional engineering/science institution and working towards chartered engineer accreditation
* Relevant professional experience, Experience working in a Software Development, Dev Ops, Site Reliability Engineering, Support or Infrastructure position or team
* Hands on experience in ideating, implementing and delivering SRE practices across mid/large tier organization
* Demonstrable knowledge of continuous integration and/or continuous deployment tools and scripting
* Ability to conduct thorough investigations, including a deep dive, into reliability and scaling issues from both a code and infrastructure perspective
* Experience working with source management tools like GitHub
* Experience with Microservices architecture
* Strong knowledge on Change and Incident Management Process
* Docker, Kubernetes
* Python
* Monitoring tools like AppD, Datadog, Dynatrace
Environment :
* Linux
* GCP production environment
* shell scripting (BASH or similar)
* ELK, App Dynamics
* Helm and Kubernetes manifests
* CI/CD pipelines on GitLab
* Scalable Docker containers, ideally on Kubernetes
* Dockerfiles for Node.js and PHP applications, I execute brilliantly on clear priorities that add value to our customers and the wider business.
Commercially savvy I demonstrate strong commercial focus, bringing an external perspective to decision-making. Looking to the future: Growth mindset I experiment and identify opportunities for growth for both myself and the organisation.
BT is part of BT Group, along with EE, Openreach, and Plusnet. Millions of people rely on us every day to help them live their lives, power their businesses, and keep their public services running. We connect friends to family, clients to colleagues, people to possibilities. We keep the wheels of business spinning, and the emergency services responding. We value diversity and celebrate difference. 'We embed diversity and inclusion into everything that we do. It's fundamental to our purpose: we connect for good.' We all stick to the same values: Personal, Simple, and Brilliant. From day one, you'll get stuck in to tough challenges, pitch in with ideas, make things happen. But you won't be alone: we'll be there with help and support, learning and development. This is your chance to make a real difference to the world: to be part of the digital transformation of countless lives and businesses. Grab it.
* Annual On target bonus 10% (personal and company multipliers)
* BT Pension scheme; minimum 5% employee contribution, BT contribution 10%
* Life Assurance
* Direct share scheme
* Exclusive colleague discounts on our latest and greatest BT broadband packages
* 50% off EE mobile pay monthly or SIM only plans and 50% discount for friends and family on EE SIM only plans
* My Discounts gives colleagues access to unbeatable savings on everyday purchases at hundreds of retailers
* Discounted EE TV including TNT Sport and the NOW Entertainment membership
* Great support for working parents including pay whilst on maternity, adoptive, and paternity leave
* Option to join the Healthcare Cash Plan or other benefits such as dental insurance, gym memberships etc.
* 25 days annual leave (not including bank holidays), increasing with service with buy holiday option
* Volunteering days so you can give back to your local community
* Brand new electric vehicle salary sacrifice arrangement, known as 'My EV'
Our leadership standards Looking in: Leading inclusively and Safely I inspire and build trust through self-awareness, honesty and integrity.