Posted on: 20/07/2025
Key Responsibilities :
- Maintain and optimize cloud infrastructure (AWS/GCP/Azure) using Infrastructure as Code (IaC) tools like Terraform or Pulumi.
- Automate infrastructure provisioning, configuration, and deployments using Ansible, Chef, or Helm.
- Monitor and maintain high availability, performance, and security of all environments.
- Set up and manage containerized environments using Docker and Kubernetes (EKS/GKE/AKS).
- Collaborate with development teams to ensure systems are designed for scalability, reliability, and cost efficiency.
- Implement and manage robust observability practices including logging (ELK, Loki), monitoring (Prometheus, Grafana, Datadog), and alerting.
- Manage secrets and credentials securely using tools like Vault, AWS Secrets Manager, or SOPS.
- Ensure robust security practices, vulnerability management, and compliance alignment (SOC 2, ISO 27001, etc.).
- Conduct root cause analysis and troubleshoot production issues with minimal downtime.
- Support disaster recovery and business continuity planning.
Required Skills & Qualifications :
- Strong expertise in cloud platforms (AWS, GCP and Azure).
- Deep understanding of Linux systems, networking, DNS, firewalls, and security practices.
- Proficient in Infrastructure as Code Terraform (preferred), Pulumi, or AWS CDK.
- Experience with CI/CD tools like GitHub Actions, GitLab CI, CircleCI, or Jenkins.
- Proficient in Docker and Kubernetes with experience deploying scalable microservices.
- Experience with monitoring, logging, and alerting stacks (Prometheus, Grafana, ELK, Loki, Datadog, New Relic).
- Familiarity with scripting languages (Bash, Python, or Go) for automation and tooling.
- Hands-on experience with GitOps workflows (ArgoCD, FluxCD) is a plus.
- Knowledge of application and infrastructure security principles.
- Solid understanding of software development lifecycles, Agile methodologies, and DevSecOps culture.
- Excellent communication and collaboration skills.
Good-to-Have Skills :
- Exposure to service mesh tools (Istio, Linkerd).
- Familiarity with cost optimization strategies in cloud infrastructure.
- Knowledge of compliance standards (SOC 2, HIPAA, GDPR) and relevant security tooling.
Why Join Us?
- Exposure to modern DevOps stacks and continuous learning opportunities.
- Collaborative, inclusive, and high-growth environment.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1516392
Interview Questions for you
View All