Service Management Lead
Project Role Description : Lead the delivery of programs, projects or managed services. Coordinate projects through contract management and shared service coordination. Develop and maintain relationships with key stakeholders and sponsors to ensure high levels of commitment and enable strategic agenda
Must have skills : Site Reliability Engineering
Good to have skills : NA
Minimum 5 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:As a Service Management Lead, you will lead the delivery of programs, projects, or managed services. You will coordinate projects through contract management and shared service coordination. Your role will involve developing and maintaining relationships with key stakeholders and sponsors to ensure high levels of commitment and enable the strategic agenda.
You will be based in Noida and should have a minimum of 5 years of experience in Site Reliability Engineering.
Expectation: Candidate should have technical exposure in public cloud – AWS & Azure, DevOps, Microservices and Coding. An Architect with extensive technical expertise to work with customers' complex business problems. Candidate should have extensive exposure on Infrastructure in cloud with IAC, Python, PowerShell, Ansible, GitHub, Jenkins, Terraform, JSON, Puppet etc.
A candidate having more than 12+ years of experience in Infrastructure which includes design, build and deployment in DataCenter services and Cloud. Well familiar with Coding, design, build and deployment with CI/CD Pipelines. Should have at least two end-to-end project design, implementation and support as an SRE.Should be well familiar on identifying opportunities which includes technical debt, reducing waste and coding techniques specially on IAC. Knowledge on SPLUNK is an added advantage.
Should have working knowledge on any Observability tool and other enterprise monitoring tools is an added plus.
Certifications: One Data Center technologies and Cloud.Desired exposure:
- Exposure and hands-on exposure on Publc Cloud – Specially on Infrastructure.
- Should be well versed with Monitoring, Observability and other enterprise management tools.
- Extensive exposure on coding using python. Powershell, Ansible, Jenkins, Terraform etc.
- Exposure as an hands-on SRE atleast for 5+ years.
- Exposure to automation – Specially IAC, building Pipelines in public cloud, Deployments, ARM & other templates.
- Strong knowledge on Coding – Python, Powershell, Ansible, Jenkins, Terraform, Git, JSON , Puppet etc.
- Strong in SRE knowledge and exposure specially identifying toils, techdebt, reducing waste etc.
- Must have exposure on SPLUNK and other enterprise management tools
- Exposure to Observability tools and framework
- IaaS/PaaS products - Support for Containers and Cloud Native Stack
- Lateral and Logical Troubleshooting as Cloud admin.
- Complete understand of Cloud Network topology
- Docker- Design/Built/Deployment – At least 2 years of technical exposure
- CI – CD exposure – with Full end-to-end DevOps life cycle experience.
- Exposure as an SRE with strong coding background to automating Toils.
- Troubleshooting, health check, administration, management, vendor coordination, interaction with external partner, elevation to stakeholders for support or application teams for application development related issues (bug, code maintenance, code evolution)
- Capacity monitoring; monitoring; application availability managements & monitoring, reporting and maintenance activities (if documented)
- Work on reduction of repeated failures; generate reports, dashboards
- Performance review: performance management, tuning, fix issues, work on reduction of repeated failures, scripts, automation
- Generate reports, dashboards, deploy agents
- Monitor Docker Envelops, maintain Dockers images
- Work on reduction of repeated failures; generate reports, dashboards
- Supporting Compliance requirements
- Must To Have Skills: Proficiency in Site Reliability Engineering.
- Excellent communication and relationship-building skills.
- Ability to lead and motivate teams to achieve project goals.