Posted on: 30/11/2025
Job Summary :
Responsibilities :
- Automate infrastructure provisioning and configuration using tools like Terraform and Ansible.
- Develop and implement monitoring and alerting systems to proactively identify and troubleshoot incidents.
- Optimize infrastructure costs on AWS through resource management and utilization analysis
- Collaborate with development teams to implement DevOps practices and ensure smooth deployments.
- Participate in on-call rotations and diligently respond to incidents to minimize downtime
- Continuously improve infrastructure reliability and performance through automation and best practices.
- Stay up-to-date with the latest trends and technologies in cloud computing and SRE principles.
Qualifications :
- Proven expertise in deploying and managing infrastructure on AWS (EC2, S3, VPC, etc.)
- Experience in Linux OS is a must. Prior experience as a Linux administrator a plus.
- Strong understanding of networking fundamentals is a must.
- Strong knowledge of infrastructure automation tools like Terraform and Ansible
- Experience with DevOps methodologies and CI/CD pipelines
- A keen understanding of cost optimization principles in AWS
- Excellent problem-solving and analytical skills
- Ability to work independently and as part of a cross-functional team
- Diligent and proactive approach to incident response
- Willingness to participate in on-call rotations
Good to have :
- Experience with container orchestration tools (Kubernetes)
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1582513
Interview Questions for you
View All