Posted on: 22/07/2025
Requirements :
- Extensive 3+ experience with cloud-native infrastructure (Kubernetes, Docker) in hybrid or multi-cloud environments.
- Expertise in building and operating large-scale infrastructure in public cloud environments (AWS, GCP, Azure).
- Expert in Infrastructure as Code (Terraform, Ansible, CloudFormation), with hands-on experience in automating infrastructure across multiple clouds.
- Familiarity with cloud-native DevSecOps practices and tools, integrating security and automation across the SDLC.
- Strong automation skills using CI/CD pipelines in a multi-cloud environment.
- Proficiency in languages like Go (must) and Python, and experience working with databases (MongoDB, Cassandra, NoSQL, Postgres).
- Expertise in monitoring, logging, and alerting frameworks (EFK stack, EKS, Kafka, Prometheus) for ensuring operational reliability.
- Strong knowledge of distributed systems, Linux systems, cloud computing, networking, and distributed storage technologies for building and optimizing infrastructure.
Key Responsibilities :
Cloud-Native Infrastructure Management :
- Design, implement, and manage robust cloud-native infrastructure using Kubernetes and Docker in complex hybrid or multi-cloud environments (AWS, GCP, Azure).
- Build and operate large-scale infrastructure components within public cloud environments.
Automation & Infrastructure as Code (IaC) :
- Act as an expert in Infrastructure as Code (IaC), with hands-on experience in automating infrastructure deployment and management across multiple clouds using tools like Terraform, Ansible, and CloudFormation.
- Develop and maintain robust CI/CD pipelines to automate deployments and infrastructure changes in a multi-cloud setting.
Observability & Reliability Engineering :
- Design, implement, and optimize comprehensive monitoring, logging, and alerting frameworks.
- Utilize expertise in tools and technologies such as EFK stack (Elasticsearch, Fluentd/Logstash, Kibana), Prometheus, and Kafka to ensure high operational reliability and provide deep insights into system performance.
- Proactively identify and address performance bottlenecks and potential issues within distributed systems.
DevSecOps & Security Integration :
- Apply familiarity with cloud-native DevSecOps practices and tools, integrating security and automation throughout the Software Development Lifecycle (SDLC).
Technical Proficiency & Collaboration :
- Demonstrate proficiency in programming languages like Go (mandatory) and Python for automation and tool development.
- Possess experience working with various databases including MongoDB, Cassandra, NoSQL, and Postgres.
- Apply strong knowledge of distributed systems, Linux systems, cloud computing, networking, and distributed storage technologies to build and optimize infrastructure.
- Collaborate closely with development, operations, and security teams to ensure seamless integration and operational excellence.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1517482
Interview Questions for you
View All