About us
Every day we deliver safe and secure energy to homes, communities, and businesses. We are there when people need us the most. We connect people to the energy they need for the lives they live. The pace of change in society and our industry is accelerating and our expertise and track record puts us in an unparalleled position to shape the sustainable future of our industry.
To be successful we must anticipate the needs of our customers, reducing the cost of energy delivery today and pioneering the flexible energy systems of tomorrow. This requires us to deliver on our promises and always look for new opportunities to grow, both ourselves and our business.
IT and Digital works in a harmonised partnership with the National Grid group of diverse energy businesses to deliver technology which revolutionises the way we operate. As we lead the charge towards a carbon-free future, our teams are embracing disruptive changes in our industry by working with Agile methodologies and adopting Digital mindsets to drive efficiency and bring new capabilities for our internal and external customers.
National Grid is hiring a Platform Owner, AI OPS.
Job Purpose
As a Platform Owner of AI Ops and SRE, your primary objective is to design and oversee the implementation of complex systems that meet functional and non-functional requirements. You will play a key role in developing system design policies, standards, and innovation processes specific to AI Ops and SRE. Additionally, you will actively monitor emerging technologies and assess their potential impact on the organization. Your responsibilities will include driving the strategic vision for AI Ops and SRE within the platform, ensuring alignment among stakeholders, and promoting a cohesive approach to AI Ops and SRE implementation.
What you'll do
As a Platform Owner of AI Ops and SRE, your primary responsibility is to develop comprehensive strategies for implementing AI Ops and SRE practices within the organization. This involves understanding business requirements, assessing technical capabilities, and identifying areas where AI and automation can be leveraged to enhance reliability, performance, and operational efficiency.
1. Strategic Leadership: Define and execute comprehensive strategies for implementing AIOps and SRE practices aligned with business objectives.
2. Cloud Architecture solutions: Design scalable and resilient cloud architectures to support energy-sector-specific applications, leveraging AIOps for predictive monitoring and automated incident response.
3. SRE Implementation: Establish and promote SRE principles, including reliability engineering, service-level objectives, and monitoring strategies tailored to energy systems.
4. AIOps Integration: Oversee the implementation of AIOps platforms, ensuring the seamless integration of AI-driven insights into IT operations.
5. Collaboration: Partner closely with engineering and operations teams to provide technical guidance and ensure the successful implementation of AI Ops and SRE practices.
6. Continuous Improvement: Monitor and enhance system performance through iterative AIOps and strategies that incorporate AI Ops and SRE practices.
7. Implementing AI-Driven Monitoring and Analytics: Implement AI-driven monitoring and analytics solutions within the cloud domain.
8. Managing Infrastructure: Manage the infrastructure platform within budget guardrails to ensure alignment with company priorities and goals.
About you
1. Bachelor's degree in a relevant discipline, or an equivalent combination of education, training, and experience.
2. 5 - 7 years of related experience with cloud platforms such as Azure preferred, Amazon Web Services (AWS), or Google Cloud Platform (GCP).
3. Proficient in Docker and Kubernetes for deploying and managing containerized applications at scale.
4. Knowledgeable in Terraform and AWS CloudFormation for automating infrastructure provisioning and management.
5. Familiar with tools like Prometheus, Grafana, ServiceNow, ELK Stack, and Splunk for system performance monitoring and troubleshooting.
6. Experienced with CI/CD pipelines and tools such as GitHub and GitLab CI/CD.
7. Knowledge of configuration management tools like Ansible, Puppet, or Chef.
8. Proficiency in incident management tools and collaboration platforms.
9. Understanding of networking concepts, protocols, and security best practices.
10. Knowledge of database technologies such as MySQL, PostgreSQL, MongoDB, or Redis.
What you'll get
A competitive salary between £70,000 - £95,000 - dependent on capability. As well as your base salary, you will receive a company car or allowance, a bonus of up to 20% of your salary for stretch performance, and a competitive contributory pension scheme.
More Information
The closing date for this vacancy is 28th January. However, we encourage candidates to submit their applications as early as possible and not to wait until the published closing date.
DE & I statement
At National Grid, we work towards the highest standards in everything we do, including how we support, value and develop our people. Our aim is to encourage and support employees to thrive and be the best they can be. #J-18808-Ljbffr