Job Description
We are looking for an expert Automation Tools Engineer to join our diverse team of cloud and infrastructure automation engineers. In this role, you will design, implement, and maintain tools and services for automating the deployment and configuration of engineering infrastructure and platforms. You’ll work with software and hardware engineering, and IT teams to build and maintain robust systems that support our technology initiatives and solutions!
Responsibilities:
1. Design, implement, and run automation tools such as Gerrit, Cloudbees, Hashicorp Vault, GitLab, Jenkins, Ansible, and Terraform Enterprise platforms used for automating the provisioning and configuration of engineering services.
2. Build and manage monitoring tools and platforms such as Prometheus, Grafana, Azure Monitoring, AWS CloudWatch, Dynatrace/Datadog and similar tools that forms our AIOps stack.
3. Develop and maintain automation scripts (Python, Bash, Shell, etc.) and tools (GitLab, Hashicorp Terraform, Hashicorp Vault, etc.) to streamline & improve infrastructure deployment, monitoring, and management processes, using Infrastructure as Code (IaC).
4. Analyse system performance and implement improvements to improve cost efficiency and user experience.
5. Participate in on-call rotations to ensure 24/7 system availability.
6. Maintain detailed documentation (HLDs and LLDs) of infrastructure, processes, and procedures to facilitate learning and operational continuity.
7. Adopt a continuous learning mentality to stay updated with industry trends and new technologies to improve operational performance.
Required Skills and Experience:
1. Experience in deploying, maintaining, and integrating automation tools such as Gerrit, GitLab, Jenkins/Cloudbees, Hashicorp Vault, Ansible, and Terraform Enterprise.
2. Experience working with public cloud platforms (AWS, Azure, or GCP), containerisation technologies (Docker, Kubernetes, Rancher, Fleet, and Cloudbees, etc.), and monitoring solutions (Prometheus, Grafana, OpenTelemetry, etc.).
3. Proficiency in monitoring tools and platforms such as Prometheus, Grafana, AWS CloudWatch, Azure Monitor, Datadog, Dynatrace, etc.
4. Skilled in Linux/Windows OS administration and scripting/programming (Bash, Python, and Go).
5. Excellent analytical and problem-solving abilities with a proactive approach to identifying and resolving issues.
Nice to Have Skills:
1. Familiarity of experience working in a HW or SW engineering organization.
2. Experience in running a large distributed systems environment in the cloud and on-premises data centres.
3. Familiarity with ITIL practices and incident management frameworks.
In Return:
At Arm, we are guided by our core beliefs that reflect our creative culture and guide our decisions, defining how we work together to surpass ordinary and shape extraordinary.
Equal Opportunities at Arm
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don’t discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
#J-18808-Ljbffr