Posted on: 09/09/2025
Responsibilities :
- Design, build, and manage scalable infrastructure on cloud platforms (GCP, AWS, Azure, or OCI)
- Administer and optimize Kubernetes clusters and container runtimes (Docker, containerd)
- Develop and maintain CI/CD pipelines for multiple services and environments
- Manage infrastructure as code using tools like Terraform and/or Pulumi
- Automate operations with Python and shell scripting for deployment, monitoring, and maintenance
- Ensure high availability and performance of production systems and troubleshoot incidents effectively
- Monitor system metrics and implement observability best practices using tools like Prometheus, Grafana, ELK, etc.
- Collaborate with development, security, and product teams to align infrastructure with business needs
- Apply best practices in cloud networking, Linux administration, and configuration management
- Support compliance and security audits; assist with implementation of cloud security measures (e.g., firewalls, IDS/IPS, IAM hardening)
- Participate in on-call rotations and incident response activities
Required Qualifications :
- Hands-on experience with major cloud platforms (GCP, AWS, Azure, OCI), more than one will be a plus
- Proficient in Kubernetes administration and container technologies (Docker, containerd)
- Strong Linux fundamentals
- Scripting skills in Python and shell scripting
- Knowledge of infrastructure as code with hands-on experience in Terraform and/or Pulumi (mandatory)
- Experience in maintaining and troubleshooting production environments
- Solid understanding of CI/CD concepts with hands-on experience in tools like Jenkins, GitLab CI, GitHub Actions, ArgoCD, Devtron, GCP Cloud Build, or Bitbucket Pipelines
Required Skills : Python, Kubernetes, Cloud platforms (AWS, Azure, GCP)
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1543500
Interview Questions for you
View All