About the Role Roku pioneered TV streaming and continues to innovate and lead the industry. The Roku Channel has us well-positioned to help shape the future of streaming. Continued success relies on investing in Advertising Engineering. The ideal candidate will have endless curiosity and can pair a global mindset with locally relevant execution. The successful candidate will display a balance of hard and soft skills, including the ability to respond quickly to changing business needs. You’ll be working in a highly skilled engineering team to drive automation, scalability, performance, reliability, and high availability for our complex, large-scale, revenue-critical systems handling: Deployment to GCP and AWS across multiple regions Highly scalable systems capable of handling millions of requests per second, with p99 processing times in the low double digits (ms) Constructed of many different (micro)services Using terabytes of real-time data driving our decision (Billions of records) with real-time sync between regions A scalable infrastructure of a few thousand servers for diverse traffic scenarios, including huge live events What you'll be doing Collaborate with cross-functional engineering teams to design and implement highly scalable, reliable, and secure infrastructure solutions Develop and maintain CI/CD pipelines, Infrastructure as Code, and automation frameworks tailored to our systems Drive disaster recovery planning, high availability architecture, and 24/7 SLO adherence for critical ad-serving solutions Build and maintain custom, complex deployment pipelines using Jenkins and other modern tools Improve system reliability and developer productivity by creating isolated and containerized environments Lead capacity planning, autoscaling strategies, and real-time monitoring solutions for infrastructure supporting dynamic traffic loads Advocate and implement best practices in DevOps, SRE, and infrastructure design. Provide mentorship to other team members and evangelize technical innovation We’re excited if you have 6 years of experience designing and building DevOps/SRE solutions for high-scale, distributed systems Proven expertise with GCP and AWS, including multi-region deployments Proficiency with Infrastructure-as-Code (IaC) tools such as Terraform (preferred) or CloudFormation Hands-on experience with containers (Docker, Kubernetes, or similar orchestration tools) Advanced scripting capabilities in Python, Perl, or other languages Strong knowledge of networking concepts, Linux administration, and system-level troubleshooting Hands-on experience with CI/CD tools such as Jenkins (preferred), GitHub Actions, or GitLab pipelines Experience building and managing high-availability systems and disaster recovery solutions is a plus Familiarity with database administration, particularly for large-scale systems is a plus Understanding of configuration management tools such as Ansible, Chef, Salt, or Puppet is a plus B.S. or M.S. in Computer Science, Engineering, or related field (or equivalent experience)