Bell Integration has been in the business of helping companies establish, maintain and grow their IT services since 1996. Our team of hardworking professionals deliver Bell Integration’s multiple services all over the world, and they do it with unmatched efficiency and enthusiasm. We continue to grow and have over 900 permanent staff employed at our offices in London, Portsmouth, Wokingham, Glasgow, Hyderabad, US, Slovakia and within many of our customers’ sites. Our heritage is in helping businesses to operate their critical technology in a more cost-effective manner, while improving effectiveness in areas such as customer engagement and operational responsiveness. As a Site Reliability Engineer (SRE) within the Principal Engineering team, you will ensure the reliability, availability, and performance of our services, primarily utilising AWS with a focus on container, serverless, AI, analytics, and database services. You will work closely with development teams to build scalable and resilient systems and provide advisory support to our support teams. Collaborate with development teams to design scalable and resilient architectures. Develop and implement monitoring and alerting solutions. Automate operational processes and tasks. Manage and optimise AWS resources, focusing on container (e.g., ECS, EKS), serverless (e.g., Lambda), AI, analytics, and database services. Perform root cause analysis for incidents and implement preventative measures. Provide advisory support to support teams. Proven experience as a Site Reliability Engineer or similar role. Strong knowledge of AWS services, particularly container (e.g., ECS, EKS), serverless (e.g., Lambda), AI, analytics, and database services. Proficiency in scripting and automation (e.g., Python, Bash). Experience with monitoring and logging tools (e.g., Prometheus, Grafana, CloudWatch). Strong problem-solving skills and attention to detail. Excellent communication and collaboration abilities. Familiarity with infrastructure as code tools (e.g., Terraform, CloudFormation). Familiarity with data pipeline and ETL processes. What we care about: At Bell, we believe that we are stronger together, and promote an open, collaborative culture where everyone is encouraged to be involved in the shaping of our business. We value diversity We seek to employ a workforce representative of the markets that we serve and work hard to ensure that all of our staff have the opportunity to thrive within a friendly and inclusive environment. Why join Bell: Why join bell: We prioritise internal development opportunities and offer access to our Udemy training platform with over 5000 training courses Competitive Salary Flexible remote working A generous company pension 25 days annual leave entitlement plus bank holidays and the option to purchase 5 extra days Healthcare and dental insurance Life assurance Cycle to work scheme A diverse and inclusive work culture Modern vibrant workplaces Exclusive discounts with major retailers, discount gym memberships and access to our wellness centre