HamburgerMenu
hirist

Job Description

Role Overview:

We are seeking a seasoned Senior DevOps Engineer and Infrastructure Architect to lead the design, deployment, and governance of our enterprise-grade infrastructure. This individual will serve as a critical enabler of operational excellence, driving automation, scalability, and resiliency across a multi-cloud and hybrid landscape with a strong emphasis on AWS and on-premise environments.

The ideal candidate is both a strategic architect and a hands-on technologist, capable of influencing cross-functional engineering teams, enforcing DevSecOps practices, and enabling robust platforms for application delivery, data infrastructure, and AI/ML systems.

Having Very good BFSID Domain project experience

Key Responsibilities :

Cloud & Infrastructure Architecture :

- Design and implement secure, resilient, and high-performance cloud architectures on AWS, while supporting integration with GCP, Azure, and on-premise infrastructure (e.g., VMware, OpenStack).

- Define hybrid cloud strategies that address security, network segmentation, identity federation, and data governance across environments.

- Develop infrastructure blueprints and reference architectures that align with business and technical requirements.

Infrastructure as Code & Automation :

- Champion Infrastructure as Code (IaC) using Terraform, CloudFormation, or Pulumi for scalable and repeatable provisioning.

- Automate environment creation, configuration management, and orchestration workflows using Ansible, Helm, or equivalents.

- Establish GitOps-based pipelines for environment consistency, change tracking, and governance.

DevOps & Continuous Delivery :

- Architect, implement, and manage enterprise-grade CI/CD pipelines using tools like GitHub Actions, GitLab CI, Jenkins, or ArgoCD.

- Drive DevSecOps adoption by embedding security, compliance checks, and observability into the software delivery lifecycle.

- Enable release strategies such as blue/green, canary deployments, and feature flagging.

Hybrid Infrastructure & On-Premise Integration :

- Lead the integration and optimization of on-prem systems with cloud-native services, ensuring seamless connectivity, policy alignment, and resource efficiency.

- Manage infrastructure for container platforms, virtualized environments, and legacy applications within private datacenters.

- Enforce standardized disaster recovery (DR), backup, and failover strategies across hybrid deployments.

Monitoring, SRE, and Reliability Engineering :

- Define and monitor SLAs, SLIs, and SLOs across services; implement proactive alerting and auto-remediation strategies.

- Operationalize observability using platforms like Prometheus, Grafana, ELK, CloudWatch, and Datadog.

- Drive incident response, root cause analysis (RCA), and post-mortem processes to ensure continuous improvement.

AI/ML Platform Enablement (Preferred) :

- Collaborate with data engineering and machine learning teams to provision infrastructure optimized for AI/ML pipelines, GPU workloads, and data lakes/pools.

- Support orchestration frameworks such as Kubeflow, MLflow, Airflow, and cloud-native ML services (e.g., SageMaker, Vertex AI).

- Optimize infrastructure for data ingestion, feature engineering, and real-time inference workflows.

Required Qualifications :

- 8+ years of experience in DevOps, Site Reliability Engineering, or Infrastructure Architecture roles.

- Deep technical proficiency with AWS, coupled with working experience across GCP, Azure, or private cloud stacks.

- Expert-level skills in Linux systems, containerization (Docker, Kubernetes), and networking/security best practices.

- Hands-on experience with infrastructure automation, CI/CD tools, and scripting (e.g., Python, Go, Shell).

- Strong foundation in cloud security (IAM, VPC, KMS, WAF, GuardDuty), encryption, identity, and compliance frameworks.

Preferred Qualifications (Added Advantages) :

- AWS Certifications (DevOps Engineer Professional, Solutions Architect Professional).

- Experience managing data platforms, AI/ML pipelines, and high-volume data lake architectures.

- Familiarity with enterprise ITIL, SRE principles, and compliance mandates (e.g., ISO 27001, SOC2, GDPR).

- Experience in cost optimization, cloud spend governance, and FinOps best practices.

info-icon

Did you find something suspicious?