The Apple Services Engineering team (ASE) is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it at an extensive scale, meeting our high expectations with dedication to deliver a huge variety of entertainment in over 35 languages to more than 150 countries. These engineers build secure, end-to-end solutions. They develop the custom software used to process all the creative work, the tools that providers use to deliver that media, all the server-side systems, and the APIs for many Apple services. Thanks to Apple’s unique integration of hardware, software, and services, engineers here partner to get behind a single unified vision. That vision always includes a deep commitment to strengthening Apple’s privacy policy, one of our core values. Although services are a bigger part of Apple’s business than ever before, these teams remain small, and multi-functional, offering greater exposure to the array of opportunities here. Description The Site Reliability Engineer (SRE) role in Apple Services Engineering requires a mix of strategic engineering and design along with hands-on, technical work. This SRE will configure, tune, and fix multi-tiered systems to achieve optimal application performance, stability and availability. We manage jobs as well as applications on bare-metal and cloud computing platforms to deliver data processing for many of Apple’s global products. Our teams work with exabytes of data, petabytes of memory, and tens of thousands of jobs to enable predicable and performant data analytics enabling features in Apple Music, TV, Appstore and other world class products. If you love designing, running systems that will impact millions of users then this is the place for you THE MAIN RESPONSIBILITIES FOR THIS POSITION INCLUDE: Support Java based applications & Spark/Flink jobs on Baremetal, AWS & Kubernetes. Understand the application requirements (Performance, Security, Scalability etc.) and assess the right services/topology on AWS, Baremetal & Kubernetes. Build automation to enable self-healing systems. Build tools to monitor high performance & alert the low latency applications. Troubleshoot application specific, core network, system & performance issues. Involvement in challenging and fast paced projects supporting Apple's business by delivering innovative solutions. Monitor production, staging, test and development environments for a myriad of applications in an agile and dynamic organization. Minimum Qualifications BS or MS degree in Computer Science with years of experience in a Site Reliability Engineering (SRE) and/or DevOps role. Years of experience of running services in a large scale nix environment and understanding of SRE principles & goals along with prior on-call experience. Deep understanding and experience in one or more of the following - Hadoop, Spark, Flink, Kubernetes, AWS. The ability to design, author, and release code in any language (Go, Python, Ruby or Java). Key Qualifications Preferred Qualifications Fast learner with excellent analytical problem solving and interpersonal skills. Experience supporting Java applications. Experience on Big Data Technologies. Experience working with geographically distributed teams and implement high level projects and migrations. Strong communication skills and ability deliver results on time with high quality. Education & Experience Additional Requirements