Description Role Description J.P. Morgan is seeking an Senior application support lead for the Payments Testing Program, with a goal of providing an improved client experience across our payments products. The VP will implement SRE principles to implement, manage and improve Non-functional requirements such as availability, reliability, capacity performance and scalability of the payments applications in the Client Testing Environment. The VP will provide front line support, proactive monitoring, problem and incident management in the Client Testing environment. S/he will be hands-on and manage work of other application support resources. The role provides opportunities to work across the business lines with product, operations, and technology partners. There will be significant exposure & visibility to senior management. This role is ideal for highly motivated individuals with advanced strategic thinking, problem solving and communications skills. Responsibilities Work closely with other incident management teams and technical staff to identify and resolve Priority/High Severity issues, and perform Root Cause Analysis to provide long term solutions. Lead initiatives to improve the reliability, stability and performance of applications using data-driven analytics and participating in design discussion Implement site reliability engineering best practices within team Through code and cloud artifacts, configure, maintain, monitor, and optimize applications and their associated artifacts to independently decompose and iteratively improve reliability in the applications. Collaborate with teams in multiple regions and time zones. Identify and implement continuous improvement opportunities, to improve delivery flow across product and technology. Required qualifications, capabilities, and skills Experience supporting Linux, Wintel and private/public cloud-based applications Proficient in Site Reliability culture and principles and familiarity with how to implement site reliability within an application or platform Proficient in applied architecture and distributed system design patterns. Proficient in writing code to solve operational problems and working knowledge of SDLC and CI/CD pipeline. Deep expertise in instrumentation, customization, and usage of modern observability toolset such as Dynatrace, AppDynamics, Grafana, Prometheus, ThousandEyes, Splunk, Geneos. and strong understanding of key tenets of observability such as metrics, logs, events, and traces. Proficiency in managing modern cloud and container platforms like AWS and Kubernetes. Expertise in operating both Linux and Windows platforms Working knowledge of Networking protocols, packet captures, load-balancing, DNS and firewall. Expert in at least one of the relational databases (SQL Server, Oracle) and at least one of the No SQL databases (Cassandra, Mongo) Working knowledge of Batch scripting, Ansible, Control-M, Autosys, Shell Scripting Preferred Qualifications, capabilities and skills BS/BA degree or 5 years applied experience in Application Support/Incident management and Site Reliability Engineering positions Cloud certified and/or working experience on public/private cloud-based applications Proficient in at least one programming language such as Python, Java, C#, .NET