DESCRIPTION
The region service Messaging and Streaming Team (MAST) is a customer experience-oriented team looking for a self-motivated talented engineer who can solve complex problems and has a vision to improvise service support. MAST builds and supports Messaging and Streaming services such as Kinesis Data Streams, Simple Queue Service (SQS), Simple Notification Service (SNS), Amazon MQ, and Amazon Managed Service for Apache Flink (MSF).
We are looking for a technical expert who brings a mix of operations and networking expertise and shares our passion to change the way our customers operate. A systems engineer will be creating and driving opportunities to automate and simplify daily operations and scale organizational operations.
Key job responsibilities
1. Work proactively to solve potential problems and inefficiencies. Communicate clearly and collaborate with others to deliver results with minimal supervision.
2. Participate in 24/7 on-call rotation to troubleshoot high severity issues.
3. Analyze dashboards and investigate metrics with a vision for improvements.
4. Create and maintain Standard Operating Procedures (SOPs) and runbooks for documentation.
5. Discuss radical new approaches to automate operational issues, assess risks, and develop creative solutions.
6. Develop strategies for resolving identified problems to prevent future occurrences.
7. Assist others in the team.
About the team
The team has the unique perspective of operating all messaging and streaming services, instead of just the software components. This enables our team to drive cross-organization initiatives to remove operational hurdles, optimize software delivery, and eliminate bottlenecks felt by all of AWS. Upon joining the MAST Engineering team, every employee is paired with a peer buddy who will help you quickly come up to speed in understanding the technology we're building, the tools we use, and the business problems we're trying to solve.
BASIC QUALIFICATIONS
1. Experience writing scripts from scratch for automating manual tasks (BASH, Python, Perl, Ruby or similar).
2. Solid background in Linux. Familiarity with in-depth troubleshooting and ability to solve complex technical problems.
3. Knowledge of network fundamentals (DNS, UDP, TCP/IP, HTTP(s), routing, switching).
4. Experience owning services that are secure, scalable, reliable, and efficient. Can identify multiple operational and security risks and then resolve, mitigate, and/or escalate them.
PREFERRED QUALIFICATIONS
1. Bachelor's Degree in Systems Engineering, Computer Science, or related field, or relevant work experience.
2. Exposure to cloud computing concepts and design considerations.
3. Experience in a 24x7 production environment.
4. Experience with monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic, or similar).
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify, and build.
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.
#J-18808-Ljbffr