Posted on: 15/10/2025
Description :
What Youll Do :
Key Responsibilities :
- DevSecOps & System Reliability : Contribute to a robust DevSecOps culture by building and maintaining large-scale, massively distributed, and fault-tolerant systems.
- Operational Excellence : Collaborate with cross-functional development and operations teams to build and operate highly available, cost-effective systems with exceptional uptime.
- Incident Management : Serve as a first responder in a 24/7, follow-the-sun model for incident and problem management.
- This includes providing Level 1 and Level 2 support, resolving trouble tickets, and managing PagerDuty alerts.
- Continuous Improvement : Conduct in-depth root cause analysis (RCA), implement remediations for incidents, and establish comprehensive end-to-end monitoring and alerting on all critical aspects of our systems.
- Documentation & Knowledge Transfer : Actively document operational tasks and contribute to knowledge transfer within the team.
- Change Management : Maintain a strong focus on all changes deployed to production environments.
What experience you need : :
- B.Tech
- 2+ years of experience in software engineering, systems administration, database administration, or networking.
- 1+ years of experience developing or administering software in a public cloud environment.
- Proven experience in monitoring infrastructure and applications to ensure functional and performance objectives are met.
- Proficiency in languages and technologies such as Python, Bash, and Linux.
- Demonstrable cross-functional knowledge across systems, storage, networking, security, and databases.
- Experience with automation and orchestration tools like Terraform, Chef, Ansible, or container technologies such as Docker and Kubernetes.
- Proficiency with continuous integration and continuous delivery (CI/CD) tooling and practices.
- Cloud certification is strongly preferred.
What could set you apart : :
- Proven experience designing, analyzing, and troubleshooting large-scale distributed systems.
- A proactive, systems-oriented approach to problem-solving, coupled with strong communication skills and a sense of ownership.
- Experience with Infrastructure as Code (IaC) tools, including Terraform or CloudFormation.
- A passion for automation and a desire to eliminate manual toil whenever possible.
- Experience building or maintaining systems in a highly secure, regulated, or compliant industry.
- A passion for and experience working within a DevOps culture as a collaborative team member.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1561108
Interview Questions for you
View All