AI Ops and ML Engineer - NetDevOps & Observability
GSK is seeking a highly skilled and experienced Senior Engineer - NetDevOps to join our dynamic IT infrastructure team. This role is pivotal in the ongoing development, deployment, and maintenance of our network infrastructure, with a specialized focus on integrating Artificial Intelligence (AI) and Machine Learning (ML) to enhance network automation, observability, and incident management.
The ideal candidate will bring a strong background in network engineering, AI/ML technologies, and automation, contributing to the modernization of GSK's network and observability platforms.
In this role you will
AI Ops and ML Development
1. Design and implement AI-driven solutions for anomaly detection, predictive analytics, and automated remediation.
2. Build and fine-tune machine learning models using frameworks like TensorFlow, PyTorch, or Scikit-learn to optimize network operations.
3. Integrate AI Ops frameworks with tools such as Moogsoft, Dynatrace, or Splunk ITSI to deliver actionable insights.
Network Automation and Optimization
1. Automate network tasks, configurations, and maintenance using tools like Ansible, Terraform, and scripting languages such as Python or PowerShell.
2. Develop and maintain CI/CD pipelines for deploying AI Ops and ML solutions in network environments.
3. Enhance monitoring capabilities using AI to reduce alert noise and prioritize critical issues.
Observability and Incident Management
1. Collaborate with teams managing tools like SolarWinds, Zscaler ZDX, Elastic, and Juniper Mist to improve data collection, correlation, and visualization.
2. Use AI/ML to proactively identify and resolve network vulnerabilities and performance issues.
3. Conduct root cause analysis (RCA) for network incidents and implement long-term improvements based on AI insights.
Mentorship and Continuous Improvement
1. Mentor junior engineers, providing guidance on adopting AI/ML technologies, automation, and NetDevOps best practices.
2. Stay updated on industry trends in AI Ops, network automation, and observability, applying innovations to improve operational efficiency.
3. Proactively identify opportunities for optimization and implement solutions to enhance network performance, reliability, and security.
Why you?
Qualifications & Skills:
We are looking for professionals with these required skills to achieve our goals:
1. Minimum of 8 years of experience in network engineering, with at least 2 years focusing on AI Ops or ML applications.
2. Proficiency in machine learning frameworks (e.g., TensorFlow, PyTorch, Scikit-learn) and AI Ops platforms (e.g., Moogsoft, Dynatrace, Splunk ITSI).
3. Expertise in network automation tools and frameworks like Ansible, Terraform, Puppet, or Chef.
4. Strong understanding of network engineering principles, including routing, switching, firewalls, and load balancers.
5. Advanced scripting skills in Python, PowerShell, or Bash.
6. Relevant certifications such as AWS Certified Machine Learning, CCNP, or equivalent.
Preferred Qualifications & Skills:
1. Bachelor's degree in Computer Science, Data Science, Network Engineering, or a related field. Master's degree preferred.
2. Experience in the pharmaceutical industry, with knowledge of its technological trends.
3. Proven ability to deliver large-scale operational programs.
4. Familiarity with agile delivery methodologies and the DevOps operational model.
5. Passionate about enhancing the customer experience through AI/ML innovations.
Closing Date for Applications: Thursday 23rd January 2025 (COB)
Please take a copy of the Job Description, as this will not be available post closure of the advert. #J-18808-Ljbffr