We value individuals that are self-starters who have a passion to learn, build, automate and rollout infrastructure services globally.
Help build a team that responses to incidents, troubleshoots production issues, interfaces with internal customers, collects data and provides feedback on how to improve operations and platform's stability.
* Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.
* Understand the end-to-end configuration, technical dependencies, and overall behavioural characteristics of production services.
* Partner with development teams in defining and implementing improvements in Operations.
The team will act as the first escalation point for complex or critical issues and will be responsible for initial triaging and coordination of production incidents.
Career Level - IC3
Basic Qualifications
* 5+ years of SRE/Devops/Automation experience in a Linux based environment
* Experience with Linux shell scripting, and Python
* Familiarity with CICD environments
* Familiarity with Agile Development
* Proficient with Git and Terraform
* Proficient with commonly used networking protocols such as TCP/IP, HTTP, DNS, SSH
* Familiarity with docker containers, Kubernetes
* Troubleshooting and performance tuning skills.
* Bachelors in computer science and Engineering or related engineering fields or relevant industry experience
#LI-DNI