Job Description
Key responsibilities:
* Design and implement infrastructure solutions across multi-cloud and on premplatforms.
* It is required that they have mastery experience with Ansible Automation Platform and Linux.
* Collaborate with cross-functional teams to identify, design, and implement end-to-end cloud-based solutions.
* Develop and implement best practices for cloud platform deployments, including building for observability, future maintenance, and pleasant customer experiences
* Expectations to "own" all code and features into production, including troubleshooting and resolving complex technical issues related to the platform.
* Expected to dive into the unknown and/or bleeding edge technologies on behalf of research & development; this means embracing technical challenges even when documentation is sparse, unknown or out of date.
* Stay updated with the latest industry trends and best practices in site reliability engineering.
* Develop and maintain documentation related to system configurations, processes, and procedures.
Your Profile
Key skills/knowledge/experience:
* Sr. level knowledge of Linux OS management
* RedHat Satellite and/or Foreman repository configuration and management
* Ansible Automation Platform experience including, but not limited to:
o Controller management
o Private Hub management
o Execution nodes and Instance Groups
o Execution environments
o Containerization; Docker, Podman, etc.
* CI/CD environment experience (i.e. Github Action, Azure DevOps, etc.)
* Multi-tenant Cloud management experience
* Multiple cloud hosting provider experience (e.g. GCP, AWS, Azure)
* Other configuration management platform experience not mentioned (i.e. Puppet)
* Experience with writing / debugging Terraform / TFE scripts and pipelines
* Expertise in performance optimization as applied to Configuration Management platforms and the OSes they reside
* Experience in reading and interpreting log files in a variety of formats
* Developer competencies in three or more:
o Python
o YAML
o Bash
o Jinja2
* Automation scripting skills
* Able to understand team members code and adhere to their style, or influence style to establish team-wide consistency
* Excellent debugging skills achieved through hands-on experience
* Experience with troubleshooting errors from a system administration viewpoint
* Thorough understanding and experience of Git/GitHub version control operations
* Practical experience in using and managing each of the following OS:
o RedHat Enterprise Linux (CentOS, Fedora okay too)
o Ubuntu/Debian Linux
* Able to work effectively with other teams
* Excellent verbal and written communications skills
* Proven documentation skills as they relate to the job role