GTIS Public Cloud Engineering is a global team of circa 100 colleagues based in the UK, India, and the US. We are accountable for strategic engineering and delivery of Public Cloud services within Enterprise Technology. Our team is at the forefront of the migration of Barclays applications to public cloud with a current focus on the delivery of the underlying AWS and Azure platforms and we are expanding our existing Engineering team to meet the objective of “You Build It, You Own It”.
This role is for a Site Reliability Engineer (SRE) in a team that will be part of the whole lifecycle of feature development, from solution design all the way through to production support and back again, helping to shape and drive our SRE capability. You must demonstrate strong problem-solving techniques and display an ability to reach decisions under conditions of uncertainty or high risk.
Microsoft Azure Accreditation – Experience with the Azure platform, including its services, architecture, and best practices.
Proficiency in Python – Intermediate proficiency in Python is essential for automating tasks, building tools, and managing infrastructure.
Experience with CI/CD Pipelines – The ability to build and maintain CI/CD pipelines for zero-touch deployments is crucial for ensuring smooth and efficient software delivery.
Familiarity with ITIL Framework – Knowledge of ITIL practices can help in managing incidents, changes, and service requests effectively.
Agile/Kanban Experience – Experience working within Agile or Kanban teams will be beneficial for adapting to the dynamic environment and collaborating effectively.
Cross-functional Collaboration – The ability to work with cross-functional teams to ensure seamless integration and deployment processes is highly desirable.
You may be assessed on the key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen strategic thinking and digital and technology, as well as job-specific technical skills.
To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.
Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.
Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring.
Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience.
Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.
Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations.
Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth.
To advise and influence decision making, contribute to policy development and take responsibility for operational effectiveness. Set objectives and coach employees in pursuit of those objectives, appraisal of performance relative to objectives and determination of reward outcomes
OR for an individual contributor, they will lead collaborative assignments and guide team members through structured assignments, identify the need for the inclusion of other areas of specialisation to complete assignments. They will identify new directions for assignments and/ or projects, identifying a combination of cross functional methodologies or practices to meet required outcomes.
Identify ways to mitigate risk and developing new policies/procedures in support of the control and governance agenda.
Take ownership for managing risk and strengthening controls in relation to the work done.
Engage in complex analysis of data from multiple sources of information, internal and external sources such as procedures and practises (in other areas, teams, companies, etc).Complex' information could include sensitive information or information that is difficult to communicate because of its content or its audience.
All colleagues will be expected to demonstrate the Barclays Values of Respect, Integrity, Service, Excellence and Stewardship – our moral compass, helping us do what we believe is right. They will also be expected to demonstrate the Barclays Mindset – to Empower, Challenge and Drive – the operating manual for how we behave.