Posted on: 19/11/2025
Job Description :
- Responsible for generating scripts and templates required for the automatic provisioning of resources in Public Clouds infrastructure.
- Monitor all production servers
- Responsible for debugging technical issues inside a complex stack involving virtualization, containers, microservices, etc.
- Collaborate with the engineering teams to enable their applications to run on Cloud infrastructure
- Work closely with development team to ensure platforms are designed for scale, availability and performance
- Review and recommend improvements to operational processes and procedures utilizing automation.
- Create a comprehensive list of automated performance alerts so that timely actions can be taken.
- Maintain good design and verbatim of the infrastructure for the other members of the Cloud Operations team.
- Implement best practices to ensure security and availability.
- Own up all infrastructure related troubleshooting during unplanned outages
- Adhere to the incident escalation procedure based on the severity
- Maintain compliance with Security and Governance standards.
- Possess technical understanding of DR, BCP strategies, meeting and aligning with defined SLAs, preferably with practical implementation experience.
- Maintain technical documentation, architecture references, change management and reporting
Technical Skills and Experience :
- Experience identifying performance issues on Linux machines CPU, memory, disk
- Linux networking expertise, such as investigating network traffic, port forwarding, tuning OS parameters
- Experience with networking both on an individual server as well as virtual configuration in AWS VPCs
- Experience with configuration management tools such as Ansible, Puppet etc.
- Experience with setting up, securing and administering CI / CD tools like Jenkins
- Hands on experience with AWS services and implementation of spring boot microservices in the cloud
- Experience working with Docker and Kubernetes for container orchestration
- Experience with centralized logging systems, metrics, and tooling frameworks such as ELK, Prometheus, and Grafana
- Self-starter and Self-learner
- Great communication skills
- Familiar with Best Practices in Test Automation
Experience Range : 5 - 10 years
Educational Qualifications : B.Tech/B.E , BCA ,and MSC , M.Tech , MCA
Skills Required :
- AWS
- Docker
- Kubernetes
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1577314
Interview Questions for you
View All