Site Reliability Engineer, CloudWatch Infrastructure
Job ID: 2844437 | Amazon Development Centre Ireland Limited
If you love infrastructure and automation, we are the team for you! AWS CloudWatch is blazing new trails as a pioneer in the cloud Infrastructure Monitoring, Application Monitoring, and Log Analytics space. We are seeking a Systems Development Engineer to join the team to help us with delivery of large scale automation projects across a broad range of technical arenas.
What is AWS CloudWatch?
We run one of the largest time-series data stores on the planet monitoring more than 13 quadrillion metric observations and triggering 3.9 trillion events per month. Our teams solve problems of large telemetric data scale, distributed systems/cloud computing, data visualization, anomaly detection, predictive analytics, and root cause analysis. Many of the largest web services in the world rely on our technology to drive their operational excellence. We build, operate and continually improve the CloudWatch service for both technical and business users giving them insight into what is important for them.
CloudWatch is part of AWS Utility Computing (UC) that provides product innovations — from foundational services such as Amazon’s Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS’s services and features apart in the industry. As a member of the UC organization, you’ll support the development and management of Compute, Database, Storage, Internet of Things (IoT), Platform, and Productivity Apps services in AWS.
Key Job Responsibilities
1. You coordinate with internal teams to uncover infrastructure improvement areas and remove them through automation.
2. You contribute toward the forward-looking vision for the team.
3. You help improve operational excellence by reducing technical debt for the team.
A Day in the Life
Why join the CloudWatch Infrastructure team?
Huge scale and endless automation opportunities! The CloudWatch Infrastructure team operates one of the largest fleets inside of AWS, with tens of thousands of servers to ensure continued customer delight. Managing that is difficult and requires coordination with multiple internal CloudWatch teams to uncover infrastructure issues and automate them away. Working at our scale requires long term thinking, continuous innovation, large scope for new ideas, interaction with awesome teammates and never-ending automation possibilities.
Work/Life Balance
Our team puts a high value on work-life balance. It isn’t about how many hours you spend at home or at work; it’s about the flow you establish that brings energy to both parts of your life. We believe striking the right balance between your personal and professional life is critical to life-long happiness and fulfilment.
BASIC QUALIFICATIONS
* Experience in automating, deploying, and supporting large-scale infrastructure.
* Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby.
* Experience with Linux/Unix.
* Experience with CI/CD pipelines build processes.
PREFERRED QUALIFICATIONS
* Experience with distributed systems at scale.
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.
Posted: December 2, 2024 (Updated about 2 hours ago)
#J-18808-Ljbffr