About the Company:
At RemoteStar, we’re hiring for one of our clients, a leading multinational IT services and consulting company specializing in digital transformation, cloud solutions, and AI-driven innovation. With a strong global presence, the company partners with enterprises across various industries to deliver cutting-edge technology solutions.
Job Title: Site Reliability Engineer (SRE)
Experience: 5 to 9 years
Location: Pan India (Remote)
Work Mode: Initially remote for this project. Later, the client will transition to a hybrid model (3 days from office per week).
Working Hours: 1 PM to 10 PM and 2 PM to 11 PM, 5 days a week.
Industry Preference: Healthcare background is a must have.
Responsibilities:
1. Deal with operational issues such as production failures, infrastructure problems, security, and monitoring.
2. Ensure the availability, performance, and scalability of a website or application.
3. Work closely with developers to identify and fix potential issues before they cause problems for users.
4. Monitor systems and create plans for responding to incidents.
5. Involved in capacity planning and performance tuning to ensure that the site can handle increased traffic without issue.
6. Have a deep understanding of how distributed systems work in order to troubleshoot and optimize them.
7. Familiar with various monitoring tools such as AppDynamics, Splunk, and GCP Operations Suite.
8. Understand how different types of databases work to effectively troubleshoot any issues that may arise.
9. Experience working with cloud-native applications to manage them effectively.
10. Communicate clearly and concisely about system alerts or outages to other team members.
11. Deal with unexpected outages or performance issues.
12. Experience with monitoring tools, configuration management tools, and automation tools.
13. Good experience in Azure and GCP.
Note: Candidate should be able to mature their SRE practice across the division and be comfortable being a champion and leader in the SRE space.
#J-18808-Ljbffr