Description
:
This is a Non Production Management Technical Lead position L2 SRE position in North America DevOps, supporting Global Consumer Group applications. GCG Production Management is in the midst of transformation, expanding the support model to incorporate Service Reliability Engineering principles. In support of this transformation, this role is a blend of traditional ITIL based Production Management, with Service Reliability Engineering. The ideal candidate for this position will have experience and broad knowledge of North America Consumer applications along with an interest in learning new technologies, including the use of automation and artificial intelligence technologies to avoid system problems, automate manual activities, and drive improved system & application service levels. The work is supported by contractors offshore and onshore, who provide 7x24 service for North America. This is a technical leadership position, requiring strong organizational and communication skills in addition to analytical and troubleshooting talent. Partnership with Development Teams, Technology teams in CTI, and other Production Management teams is a critical component of this position and required daily.
This position will lead provide the technical leadership for GCG applications. They will work with other peers in the DevOps team to drive the stability. Collaborate with app Dev community, CTI partners, TPM and other stakeholders to identify and create value chain, identify and conduct POC to plug the gap areas.
Primary Responsibilities:
1. Provides expertise related to various Distributed Consumer Applications across multiple Lines of Business in North America.
2. Primary point of contact LOB assigned domain.
3. Enable Production management processes in non production environment to provide environment stability
4. Execute robust service readiness.
5. Facilitate standard toolset adoption for all services in the domain.
6. Works as a L2 expert to support the Problem management, risk management and Change management, CI/CD enablement pipeline for SRE function identified.
7. Has Overall accountability of non production stability for his area/domain
8. Partners with Level 1 and Level 3 support teams to improve resolution rates, efficiency targets, and organizational Service Level Agreements.
9. Partners with SRE enablement and works as SRE eventually to identify the key areas and provides the SRE recommendation from UAT to PERF and PROD for key business transactions supported.
10. Knowledge of technologies like OSE, Kubenetes, APIGEE,Platform services, Datapower, Google cloud, AWS, CI/CD pipeline, ITIL and Service Management
11. Identifies and leads the implementation of Service Automation to reduce cost, reduce risk, improve efficiency and enable Service Management to keep up with the ever-increasing volume of with fast pace of newer technologies.
12. Continually evolve the working practices within and services provided by Production Management to improve efficiency and productivity.
13. Ability to conduct blameless problem management/post-mortem phase of major incidents, develop executive briefings, assess major incident impacts and drive service improvements to prevent repeat of an incident
14. Create PMR for P1/P2 incidents and close on the actions.
15. Identify the risks, classify them in the non production estate and work with the peers, team members, create Service Improvement plans and drive them to closure.
16. Create Operational readiness documents for major initiatives and provide handover to production team in a seamless manner.
17. Work with SRE team to create a proactive analysis of UAT and PERF view before handing over to production management.
18. Accountable for end to end service health of NAM Core space
19. Overall accountable for patching, changes, Infra changes, certificates and other KTLO activities in his domain assigned
20. Overall accountability of the monitoring and its usage by its stakeholders. Work with the monitoring team for setup and overall accountability
21. Represent DevOps team in various digital forums and facilitate generate of reports and presentations.
22. Be proficient in various technologies of OSE, Apigee, AWS and other new age technologies
23. Adopt automation laid down by Production management automation and AIOps.
24. Support and Achieve successful internal audits
Qualifications:
25. 6-10 years development or production support experience with North America Consumer applications. Experience or familiarity Cloud Technology is a plus.
26. Solid ITIL Foundation understanding.
27. Engineering Background in system admin, development, DevOps or equivalent field, preferably with experience in Distributed Consumer applications.
28. Experience/ familiarity with automation technologies, advanced analytics and predictive modelling.
29. Ability to develop and manage relationships at all levels.
30. Experience with databases Oracle, DB2
31. Experience in programming in one of the following languages unix shell scripting, Java, etc.
32. Competent with cloud concepts API, web services and microservices
33. Strong analytical, algorithmic, and problem-solving skills
Core Competencies/ Skills:
34. Fluent English
35. Strong analytical skills, strong problem-solving skills and ability to logically break down tasks into smaller manageable parts
36. Solid understanding of systems and application design
37. Systematic problem-solving approach
38. Strong communication skills and sense of ownership and drive
39. Adaptable and can work with large complex and multi team owned services
40. Extremely organized, detailed oriented and thorough in every aspect.
41. Able to balance multiple tasks and projects effectively while adapting to new variables
42. Utilizing creative and innovative thinking but also adhering to a strong sense of ownership, customer service and integrity demonstrated through clear communication
43. Drive, self-motivated and eager to learn
Education:
44. Bachelor’s/University degree or equivalent experience
Certification in Site Reliability Engineer, Sales Force or Cloud Based Certification like AWS or Google Cloud is a plus
------------------------------------------------------
Job Family Group:
Technology
------------------------------------------------------
Job Family:
Systems & Engineering
------------------------------------------------------
Time Type:
Full time
------------------------------------------------------
Primary Location:
Jacksonville Florida United States
------------------------------------------------------
Primary Location Full Time Salary Range:
$113, - $170,
In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit Available offerings may vary by jurisdiction, job level, and date of hire.
------------------------------------------------------
Anticipated Posting Close Date:
Apr 30, 2025
------------------------------------------------------