System Engineer, Messaging and Streaming Team
Job ID: 2320023 | AWS EMEA SARL (UK Branch)
The region service Messaging and Streaming Team (MAST) is a customer experience-oriented team who is looking for a self-motivated talented engineer who can solve complex problems and have a vision to improvise the service support. MAST builds and supports Messaging and Streaming services such as Kinesis Data Streams, Simple Queue Service (SQS), Simple Notification Service (SNS), Amazon MQ, and Amazon Managed service for Apache Flink (MSF).
We are looking for a technical expert who brings a mix of operations and networking expertise and shares our passion to change the way our customers operate. A systems engineer will be creating and driving opportunities to automate and simplify the daily operations and scale the organizational operations.
Key job responsibilities
1. Work proactively to solve potential problems and inefficiencies. Communicate clearly and collaborate with others to deliver results with minimal supervision.
2. Participate in 24/7 on-call rotation to troubleshoot high severity issues.
3. Analyze dashboards and investigate metrics with the vision for improvements.
4. Create and maintain Standard Operating Procedures (SOPs) and runbooks for documentation.
5. Discuss radical new approaches to automate operational issues, assess risks and develop creative solutions.
6. Develop strategies for resolving identified problems to prevent future occurrences.
7. Assist others in the team.
About the team
The team has the unique perspective of operating all of the messaging and streaming services, instead of the software components. This enables our team to drive cross-organization initiatives to remove operational hurdles, optimize software delivery, and remove bottlenecks felt by all of AWS. Upon joining the MAST Engineering team, every employee is paired with a peer buddy who will help you to quickly come up to speed in understanding the technology we’re building, the tools we use and the business problems we’re trying to solve.
BASIC QUALIFICATIONS
1. Experience writing scripts from scratch for automating manual tasks (BASH, Python, Perl, Ruby or similar).
2. Solid background in Linux. Familiarity with in-depth troubleshooting and ability to solve complex technical problems.
3. Knowledge of network fundamentals (DNS, UDP, TCP/IP, HTTP(s), routing, switching).
4. Experience owning services that are secure, scalable, reliable and efficient. Can identify multiple operational and security risks and then resolve, mitigate and/or escalate them.
PREFERRED QUALIFICATIONS
1. Bachelor’s Degree in Systems Engineering, Computer Science or related field, or relevant work experience.
2. Exposure to cloud computing concepts and design considerations.
3. Experience in a 24x7 production environment.
4. Experience of monitoring frameworks (such as CloudWatch, Datadog, Grafana, Elastic or similar).
J-18808-Ljbffr