American Express Senior Service Assurance Engineer II in London, United Kingdom
Description
You Lead the Way. We’ve Got Your Back.
With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a career journey that’s unique and meaningful to you with benefits, programs, and flexibility that support you personally and professionally.
At American Express, you’ll be recognized for your contributions, leadership, and impact—every colleague can share in the company’s success. Together, we’ll win as a team, striving to uphold our company values and powerful backing promise to provide the world’s best customer experience every day. And we’ll do it with the utmost integrity, and in an environment where everyone is seen, heard and feels like they belong.
Join Team Amex and let's lead the way together.
We’re looking for a Site Reliability/Application Support Engineers responsible for web application performance, availability, and reliability. Candidate is responsible to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. Site Reliability Engineering/Application Support (SRE/AS) is a continuous engineering discipline that effectively combines software development and systems engineering to build and run scalable, distributed, fault-tolerant systems. This role will ensure that American Express internal and external services have reliability and uptime appropriate to users' needs. We also ensure a continuous improvement, while keeping an ever-watchful eye, automated, on capacity and performance.
How will you make an impact in this role?
This role will drive the SRE/AS mindset which strives to use software engineering to build and run better production systems. You will write software to optimize day to day work through better automation, monitoring, alerting, testing and deployment. You’ll be expected to work with several Technology partners to identify areas of opportunity within the availability platform and build a solution to automate monitoring solutions for the modernization platform, technology, and constant innovations to drive efficiencies. You will be responsible for implementing tracing, monitoring, tooling solutions to maximize the performance and availability of our Web applications. This is an opportunity to work in one of the best Technology units to help improve customer experience for American Express digital assets and influence how millions of people interact with their cards, their merchants, and their money.
The Senior Service Assurance Engineer II (SRE/AS Engineer) role is a hands-on Senior Architect Level position supporting American Express Site Reliability Engineering/Application Support team.
What you will be doing
1. Research latest technology, concepts, conceptualize solution and develop proof of concept that will improve resiliency and performance of the production infrastructure
2. Design and implement innovative solution/framework that will improve software engineering velocity, infrastructure resiliency and security, and data availability
3. Develop common framework components (to be leveraged by enterprise applications), define standards for configuration, monitoring, reliability, and performance engineering
4. Identify and mitigate major incidents, acting as a lead on Root Cause identification and permanent resolution implementation for all the incidents
5. Continuously improve automated remediation tasks to ensure the highest levels of availability
Qualifications
1. Open to work in 24/7 or on-call working environment
2. BS or MS degree in computer science, computer engineering, or other technical discipline, or equivalent 8 years of work experience in Site Reliability Engineering/Application Support (SRE/AS) supporting Full-stack applications
3. Development or support of Java/J2EE/REACT JS applications, and Node applications
4. Hands on experience with frameworks - Spring Boot, Vertex, NodeJS
5. Experience in designing mission critical highly available enterprise applications
6. Hand on experience with performance testing framework design, tuning Java applications
7. Experience managing relational and NoSQL databases such as DB2, Postgres, Mongo, Couchbase, Cassandra etc.
8. Strong knowledge of Linux internals and experience managing Linux systems in high traffic environments
9. Strong interpersonal communication skills and the ability to work well in a diverse team-focused environment
10. Experience with Splunk and/or ELK, including hands-on experience configuring Splunk, Grafana dashboards, Elastalert, OpenSearch, etc.
11. Good understanding of cloud technologies - Kubernetes, OpenShift, Docker etc.
12. Good understand of GraphQL – Query and resolver
13. Knowledge of Public Cloud technologies GCP, AWS, AZURE etc. would be an advantage
14. Monitoring and analyzing PMI data
15. Hands on experience on enterprise tools set such as Grafana, Dynatrace, AppDynamics, BMC, Prometheus etc.
16. Understanding of using Agile Practices in Operations teams
17. Experience in handling DDoS/BOT attack and different security remediations
18. Working experience with Network load balancers, Global Traffic Managers (GTMs), Local Traffic Managers (LTMs)
19. Working experience on network rules creation, load balancer configurations, network packet analysis
20. Analytical knowledge and exposure on root cause identification using analyzer tools like IBM support assistant, Splunk etc.
Benefits
We back our colleagues and their loved ones with benefits and programs that support their holistic well-being. That means we prioritize their physical, financial, and mental health through each stage of life. Benefits include:
* Support for financial-well-being and retirement
* Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location)
* Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need
* Generous paid parental leave policies (depending on your location)
* Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)
* Free and confidential counseling support through our Healthy Minds program
* Career development and training opportunities
Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.
Job: Technologies
Primary Location: United Kingdom-London-London
Other Locations: United Kingdom-London-London
#J-18808-Ljbffr