Site Reliability Engineer
Full-time Permanent
London - hybrid (3 days in office mandatory)
Up to £62,500 + 5 % annual performance related bonus
We have an exciting new opportunity for a Site Reliability Engineer to Join Robert Walters as a Consultant. As an Employed Consultant, you will benefit from permanent employment with Robert Walters and will be deployed on an interim or project basis into our clients' organisations, in return we will provide you with an opportunity to develop your skills with ongoing training and professional accreditations!
The Opportunity:
Our client is a leader in investment banking, commercial banking, financial transaction processing and asset management. We serve millions of customers, predominantly in the U.S., and many of the world's most prominent corporate, institutional and government clients globally. Through continued investments, business initiatives and philanthropic commitments, we aim to help our employees, customers, clients and communities grow and thrive!
As Site Reliability Engineer you will partner with other teams, using your expertise to guide design, development, and delivery of products and solutions built with multiple different platform technologies and application stacks.
Key Responsibilities:
* Cloud Operations: Integrate applications with cloud platforms (e.g., AWS, Azure), ensuring they meet the operational standards for high availability and resilience.
* Automation & Tooling: Develop automation scripts and tools to streamline cloud operations, reduce manual efforts, and enhance the overall efficiency of deployments and incident management.
* Performance Optimisation: Work on application performance tuning, collaborating with other SREs and Product Engineers to identify and mitigate performance bottlenecks.
* Observability and Monitoring: Work on implementation of code-based instrumentation & telemetry to enable accurate visualisation and insights into the behaviour of our systems at scale.
* CI/CD Pipeline Enhancement: Contribute to the creation and improvement of CI/CD pipelines to support rapid and reliable software releases.
* Monitoring & Incident Management: Assist in implementing robust monitoring solutions, ensuring quick detection and resolution of any issues in the production environment.
* Documentation: Create and maintain comprehensive documentation of code, processes, and infrastructure configurations to ensure transparency and knowledge sharing.
Key Skills/Experience:
* 5-10 years experience as SRE
* Strong knowledge of SRE principles
* Technologies: AWS, Kubernetes, Python, Terraform (essential)
* Strong Automation experience
* Adaptable and flexible approach to work
We are committed to offering an inclusive recruitment experience. If you require accommodations because of disability or health condition, please email: gscemeaedi @ robertwalters.com. This position is being sourced through our Outsourcing service line.
TPBN1_UKTJ