Posted on: 01/08/2025
Role Overview:
We are seeking a seasoned Senior DevOps Engineer and Infrastructure Architect to lead the design, deployment, and governance of our enterprise-grade infrastructure. This individual will serve as a critical enabler of operational excellence, driving automation, scalability, and resiliency across a multi-cloud and hybrid landscape with a strong emphasis on AWS and on-premise environments.
The ideal candidate is both a strategic architect and a hands-on technologist, capable of influencing cross-functional engineering teams, enforcing DevSecOps practices, and enabling robust platforms for application delivery, data infrastructure, and AI/ML systems.
Having Very good BFSID Domain project experience
Key Responsibilities :
Cloud & Infrastructure Architecture :
- Design and implement secure, resilient, and high-performance cloud architectures on AWS, while supporting integration with GCP, Azure, and on-premise infrastructure (e.g., VMware, OpenStack).
- Define hybrid cloud strategies that address security, network segmentation, identity federation, and data governance across environments.
- Develop infrastructure blueprints and reference architectures that align with business and technical requirements.
Infrastructure as Code & Automation :
- Champion Infrastructure as Code (IaC) using Terraform, CloudFormation, or Pulumi for scalable and repeatable provisioning.
- Automate environment creation, configuration management, and orchestration workflows using Ansible, Helm, or equivalents.
- Establish GitOps-based pipelines for environment consistency, change tracking, and governance.
DevOps & Continuous Delivery :
- Architect, implement, and manage enterprise-grade CI/CD pipelines using tools like GitHub Actions, GitLab CI, Jenkins, or ArgoCD.
- Drive DevSecOps adoption by embedding security, compliance checks, and observability into the software delivery lifecycle.
- Enable release strategies such as blue/green, canary deployments, and feature flagging.
Hybrid Infrastructure & On-Premise Integration :
- Lead the integration and optimization of on-prem systems with cloud-native services, ensuring seamless connectivity, policy alignment, and resource efficiency.
- Manage infrastructure for container platforms, virtualized environments, and legacy applications within private datacenters.
- Enforce standardized disaster recovery (DR), backup, and failover strategies across hybrid deployments.
Monitoring, SRE, and Reliability Engineering :
- Define and monitor SLAs, SLIs, and SLOs across services; implement proactive alerting and auto-remediation strategies.
- Operationalize observability using platforms like Prometheus, Grafana, ELK, CloudWatch, and Datadog.
- Drive incident response, root cause analysis (RCA), and post-mortem processes to ensure continuous improvement.
AI/ML Platform Enablement (Preferred) :
- Collaborate with data engineering and machine learning teams to provision infrastructure optimized for AI/ML pipelines, GPU workloads, and data lakes/pools.
- Support orchestration frameworks such as Kubeflow, MLflow, Airflow, and cloud-native ML services (e.g., SageMaker, Vertex AI).
- Optimize infrastructure for data ingestion, feature engineering, and real-time inference workflows.
Required Qualifications :
- 8+ years of experience in DevOps, Site Reliability Engineering, or Infrastructure Architecture roles.
- Deep technical proficiency with AWS, coupled with working experience across GCP, Azure, or private cloud stacks.
- Expert-level skills in Linux systems, containerization (Docker, Kubernetes), and networking/security best practices.
- Hands-on experience with infrastructure automation, CI/CD tools, and scripting (e.g., Python, Go, Shell).
- Strong foundation in cloud security (IAM, VPC, KMS, WAF, GuardDuty), encryption, identity, and compliance frameworks.
Preferred Qualifications (Added Advantages) :
- AWS Certifications (DevOps Engineer Professional, Solutions Architect Professional).
- Experience managing data platforms, AI/ML pipelines, and high-volume data lake architectures.
- Familiarity with enterprise ITIL, SRE principles, and compliance mandates (e.g., ISO 27001, SOC2, GDPR).
- Experience in cost optimization, cloud spend governance, and FinOps best practices.
Did you find something suspicious?
Posted By
Posted in
DevOps / SRE
Functional Area
DevOps / Cloud
Job Code
1522993
Interview Questions for you
View All