Posted on: 29/09/2025
Job Description :
We are seeking a talented and experienced DevOps/SRE (Site Reliability Engineering) Senior Software Engineer to join our dynamic team. The ideal candidate will have a strong background in DevOps practices, cloud infrastructure management, automation, and team leadership skills. If you have a consistent track record of architecting and building large-scale systems, enjoy solving intriguing system challenges at internet-scale, are innovative at heart, and have a great balance of skills in learning, organizing, building, and enjoy making an impact, this role might be a great fit for you!
Responsibilities :
- Oversee the design, implementation, and maintenance of scalable and resilient cloud infrastructure on platforms spanning AWS and GCP. Ensure high availability, reliability, and performance of critical systems.
- Collaborate with your peers to be responsible for the entire software lifecycle, seek the right problem to solve, and strive for excellence.
- Manage individual project priorities, deadlines, and deliverables related to your technical expertise and assigned domains.
- Lead incident response efforts, working closely with cross-functional teams to resolve issues quickly and minimize downtime. Implement effective incident management processes and post-incident reviews.
- Collaborate with security teams to ensure the integrity and security of infrastructure and applications. Implement security best practices and compliance standards.
- Identify performance bottlenecks and optimize system resources for maximum efficiency. Conduct regular performance tuning and capacity planning exercises.
- Drive continuous improvement initiatives within the team and across the organization. Proactively identify areas for enhancement and implement solutions to address them.
- Maintain comprehensive documentation of systems, processes, and procedures. Foster a culture of knowledge sharing and contribute to the collective learning of the team.
- Participate in a 24x7 on-call rotation and be available to work with global teams in the event of critical outages.
Requirements :
- 8+ years of experience in DevOps/SRE roles.
- Experience in cloud-focused software development, preferably in Go, Python, or other object-oriented programming
languages.
- Experience with a number of the following : ECS, Docker, Kubernetes, Envoy, Istio, Linkerd, and Solo.
- Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormation.
- Strong understanding of distributed systems, microservices architecture, and cloud-native technologies.
- The drive and self-motivation to understand the intricate details of a complex infrastructure environment.
- Strong proficiency in cloud platforms such as AWS, Azure, or GCP.
- Solid understanding of networking, security, and compliance principles.
- Proven track record of driving results and delivering high-quality solutions in a fast-paced environment.
- Demonstrated ability to communicate clearly with both technical and non-technical project stakeholders, with the ability to work effectively in a cross-functional team environment.
- Certifications in relevant technologies, such as Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer, or Certified Information Systems Security Professional (CISSP), are preferred.
- BS Degree in Computer Science or Equivalent.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1553985
Interview Questions for you
View All