HamburgerMenu
hirist

Job Description

Job Description :


Function : IT Operations and Support DevOps / Cloud


DevOps :


We are seeking a highly skilled and motivated DevOps Engineer with 8+ years of experience to join our dynamic team. This role is critical to enhancing the reliability, scalability, and efficiency of our infrastructure as we grow our AI and SaaS offerings. If you are passionate about building robust systems and optimizing deployment processes, we'd love to hear from you.


The core responsibilities for the job include the following :


Cloud Infrastructure Management (MUST) :


- Design, implement, and manage scalable cloud infrastructure on AWS (primary), GCP, and Azure.


- Design, develop, and maintain auto-scaling solutions to support high-availability and performance demands.


- Ground-up design of multi-region network architectures for enterprise, including securing these networks in layered trust zones.


- Design and build subnets, VPC peers/inks, and NAT gateways at scale.


- Ground-up experience setting up high availability and disaster recovery systems aligned with specified RPOs and RTOs.


- Automate infrastructure management using Infrastructure-as-code tools.


Containerization and Orchestration (MUST) :


- Set up, monitor, and manage large-scale Kubernetes clusters for seamless application deployment.


- Set up and management of persistent volumes for different PODs with prior experience managing these at scale.


- Set up experience and understanding of best practices on setting up network policies within the Kubernetes cluster.


- Certified Kubernetes administrator (CKA) - preferred 3 CI/CD Pipeline (MUST):


CI/CD Pipeline (MUST) :


- Design, build, and maintain CI/CD pipelines with a focus on improving developer productivity.


- Very good understanding of key Devops metrics (ex, DORA) and had experience improving the same.


Collaboration and Support :


- Work closely with software engineering teams to align DevOps strategies with development goals.


- Provide technical guidance to developers on best practices for deploying and scaling applications.


Performance Monitoring and Troubleshooting :


- Implement robust monitoring/observability tools to track system health and application performance.


- Able to troubleshoot critical issues and develop proactive solutions to prevent downtime.


Security and Compliance (MUST) :


- Ensure security measures (SAST, DAST, etc) are integrated into DevOps processes and workflows.


Requirements :


- Prior experience designing deployment architectures (compute, storage, and network) for complex systems (like microservices, data warehouses/lakes, etc).


- Prior experience in designing for compliance standards. Manage security audits and planning for their renewals.


- 8+ years of experience in DevOps, cloud infrastructure, or related roles for medium or large-scale startups.


- Proven track record in designing and managing large-scale cloud services across AWS, GCP, and Azure platforms, including design of HA and DR setups.


- Strong hands-on technical experience, someone who can lead by example.


- Prior experience working in a medium-tier startup or in AI/SaaS companies is a strong advantage.


- Excellent troubleshooting skills with a focus on root cause analysis and prevention.


info-icon

Did you find something suspicious?