Posted on: 10/11/2025
Description :
Responsibilities :
- Perform Production On-call/Release operations.
- Implementation of Production systems infrastructure to cater to growing needs of the company.
- Automate and maintain our continuous delivery pipeline for consistent software releases.
- Scale up of infrastructure on a need basis.
- Perform root cause analysis of production issues and provide a report which includes recommendations for identifying future issues more quickly as well as preventing future failures entirely, whether through process or technology improvements.
- Manage backups and disaster recovery, including backup monitoring and verification, and leading restoration tests and disaster recovery drills.
Requirements :
- At least 3+ years of experience in software infrastructure and engineering.
- MS/BS degree in Computer Science, related technical field, or equivalent work experience.
- Knowledge of Production Operations/Best practices.
- Experience in more than one end-to-end devops cycles in previous projects.
- Ability to effectively prioritize work with fast changing requirements.
- Being able to work independently, i. e. does not require a lot of hand-holding.
- Strong background in managing Linux/Unix systems.
- Knowledge/Experience with scripting languages - Python, bash shell.
- Excellent knowledge of Amazon Web Services Products (EC2 ECS, elasticache, Route53 VPC/Private cloud configurations and others)
- Experience with Infrastructure As Code - CloudFormation, Terraform [Must].
- Experience with MySQL, Nginx.
- Experience with CI/CD platforms such as Jenkins.
- Experience with Version Control systems - Git.
- Experience with Configuration Management Systems - Ansible, Chef[Must] etc.
- Experience with Monitoring Platforms - Nagios, Grafana, EFK, New Relic etc.
- Experience with maintaining and running large scale web apps.
- Experience with Micro-Services - container technologies, docker [Must].
- Experience with secrets Management Tool - Vault
- Experience with service discovery and configuration Tool - Consul
- Handling/Analyzing large amount of logs and anomaly detection.
- You have experience in analyzing and resolving complex infrastructure resources and application deployment issues.
Skills :
- Linux, Unix, AWS, DOCKER, CI/CD, MYSQL, Jenkins, GIT, Python.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1571952
Interview Questions for you
View All