HamburgerMenu
hirist

Job Description

Job Description :


We are seeking a talented and experienced DevOps/SRE (Site Reliability Engineering) Senior Software Engineer to join our dynamic team. The ideal candidate will have a strong background in DevOps practices, cloud infrastructure management, automation, and team leadership skills. If you have a consistent track record of architecting and building large-scale systems, enjoy solving intriguing system challenges at internet-scale, are innovative at heart, and have a great balance of skills in learning, organizing, building, and enjoy making an impact, this role might be a great fit for you!


Responsibilities :


- Oversee the design, implementation, and maintenance of scalable and resilient cloud infrastructure on platforms spanning AWS and GCP. Ensure high availability, reliability, and performance of critical systems.


- Collaborate with your peers to be responsible for the entire software lifecycle, seek the right problem to solve, and strive for excellence.


- Manage individual project priorities, deadlines, and deliverables related to your technical expertise and assigned domains.


- Lead incident response efforts, working closely with cross-functional teams to resolve issues quickly and minimize downtime. Implement effective incident management processes and post-incident reviews.


- Collaborate with security teams to ensure the integrity and security of infrastructure and applications. Implement security best practices and compliance standards.


- Identify performance bottlenecks and optimize system resources for maximum efficiency. Conduct regular performance tuning and capacity planning exercises.


- Drive continuous improvement initiatives within the team and across the organization. Proactively identify areas for enhancement and implement solutions to address them.


- Maintain comprehensive documentation of systems, processes, and procedures. Foster a culture of knowledge sharing and contribute to the collective learning of the team.


- Participate in a 24x7 on-call rotation and be available to work with global teams in the event of critical outages.


Requirements :


- 8+ years of experience in DevOps/SRE roles.


- Experience in cloud-focused software development, preferably in Go, Python, or other object-oriented programming

languages.


- Experience with a number of the following : ECS, Docker, Kubernetes, Envoy, Istio, Linkerd, and Solo.


- Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormation.


- Strong understanding of distributed systems, microservices architecture, and cloud-native technologies.


- The drive and self-motivation to understand the intricate details of a complex infrastructure environment.


- Strong proficiency in cloud platforms such as AWS, Azure, or GCP.


- Solid understanding of networking, security, and compliance principles.


- Proven track record of driving results and delivering high-quality solutions in a fast-paced environment.


- Demonstrated ability to communicate clearly with both technical and non-technical project stakeholders, with the ability to work effectively in a cross-functional team environment.


- Certifications in relevant technologies, such as Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer, or Certified Information Systems Security Professional (CISSP), are preferred.


- BS Degree in Computer Science or Equivalent.


info-icon

Did you find something suspicious?