It's no secret that traditional site reliability teams struggle to keep pace with manual monitoring, reactive troubleshooting, and labor-intensive deployments. The rise of AI presents a solution, but many companies fail to fully leverage its potential, resulting in systems that underperform and bottlenecks that stifle innovation. Data shows that 73% of companies struggle with deployment delays and operational downtime, primarily due to outdated processes and lack of AI-driven automation.
At IgniteTech, we are tackling these issues head-on by building AI-first cloud solutions that are designed to anticipate and prevent problems before they arise. We focus on integrating AI and machine learning into every facet of cloud infrastructure management, from automated monitoring systems to intelligent CI/CD pipelines. This approach creates environments that not only self-heal but also continuously evolve, reducing downtime, improving performance, and pushing the boundaries of what cloud services can do.
This isn’t your typical site reliability role, where you'd be reacting to problems and manually intervening when things go wrong. Here, you’ll lead the charge in building AI-enhanced monitoring systems that detect and resolve 95% of issues before they ever reach end users. You’ll also architect and manage AI-automated CI/CD pipelines that reduce deployment times by 30% while slashing manual interventions. The ideal candidate thrives in an AI-driven environment, is excited by the prospect of automation-first solutions, and enjoys pushing the envelope of cloud infrastructure design.
In this role, you’ll join a global team of innovators who are redefining cloud infrastructure. Your work will play a key role in our mission to deliver next-gen, AI-driven operational excellence. We’re seeking someone who is passionate about AI and ready to make a lasting impact on the future of cloud services. If that’s you, we encourage you to apply and be part of something revolutionary.
What you will be doing
Implementing AI-based monitoring services to automatically detect, predict, and resolve issues before they impact operations
Managing CI/CD pipelines with AI-driven automation to enhance deployment efficiency and reduce manual intervention
What you will NOT be doing
Focusing solely on manual monitoring, troubleshooting, and maintenance of systems; your goal will be to get AI to do these things for you
Key Responsibilities
Achieve seamless scalability and optimize performance for AI-powered cloud services, ensuring 99.99% uptime while delivering AI-enhanced software upgrades and customizations that meet clients' evolving needs
Candidate Requirements
3+ years of DevOps experience, including automation of CI/CD pipelines and infrastructure management
2+ years of experience with Amazon Web Services (AWS) or Google Cloud Platform (GCP)
Proficiency in AI and machine learning tools used for monitoring, automation, and predictive analytics (or strong willingness to learn and adapt to AI-driven technologies)
Strong programming and scripting skills, with experience in automating tasks and building AI-driven processes