Posted on: 15/08/2025
Position Summary :
We are looking for a Lead Site Reliability Engineer to join our Cloud Infrastructure Engineering Team (CIE) in Bangalore R&D.
Cloud Infrastructure Engineering ensures the continuous availability of the technologies and systems that are the foundation of athenahealths services.
We are directly responsible for thousands of servers, petabytes of storage, and handling thousands of web requests per second, all while sustaining growth at a meteoric rate.
We enable an operating system for the medical office that abstracts away administrative complexity, leaving doctors free to practice medicine.
The Team :
We are a bunch of Site Reliability Engineers who are passionate about reliability, automation, and scalability.
We use an agile-based framework to execute our work, ensuring we are always focused on the most important and impactful needs of the business.
We support systems in Hybrid Cloud and make data-driven decisions for which one best suit the needs of the business.
We are relentless in automating away manual, repetitive work so we can focus on projects that help move the business forward.
Ideal Qualifications :
- 8 to 12 years of experience building, scaling, and supporting highly available systems and services hosted on diverse set of hosts, e.g Physical Hosts, VM, EC2, Kubernetes and Containers.
- Hands-on experience in integrating on-premise infrastructure with public and private clouds.
- Strong understanding of distributed systems, networking, and cloud-agnostic and On-Prem hosted applications.
- Strong experience with enterprise grade middleware and Core Infrastructure, e.g Web Servers, MQ, Caching, Apache & Load Balancers (NetScaler) hosted on a virtual machine cluster.
- Strong Expertise in configuration management tools like Puppet or Ansible.
- Experience with Infrastructure-as-Code, Linux, VmWare and API integration.
- Hands on with Terraform.
- Experience with microservices architectures and containerization technologies.
- Hands-on experience with CI/CD pipelines and automation practices.
- Proficiency in at least one scripting or programming language (Ansible, Python, Go, Ruby, etc.)
- Expertise in the delivery, maintenance, and support of Linux systems and infrastructure.
- Experience with cloud platforms ( AWS), containerization ( Docker), and orchestration ( Kubernetes).
- Familiarity with observability tools (e.g, Prometheus, Grafana, ELK stack, CloudWatch, Splunk).
- Familiarity with telemetry, latest monitoring, visualization tools.
- Expertise in promoting and driving system visibility to aid in the rapid detection and resolution of issues.
Job Responsibilities :
Ownership of Hybrid Cloud Infrastructure.
We are seeking a Hybrid Cloud SME to lead the technical design, architecture, and implementation of hybrid cloud solutions.
- The ideal candidate will possess deep technical knowledge of cloud platforms, systems integration, and a comprehensive understanding of both On-Premise (Data Center) and Cloud environments(AWS).
- Hybrid Cloud Architecture & Design: Lead the design, implementation, and management of hybrid cloud solutions, ensuring seamless integration of on-premises data centers with public and private cloud environments (AWS).
- Technical Leadership: Provide expert-level guidance on hybrid cloud technologies and best practices, including cloud migrations, cloud-agnostic application development and management, security, networking, and cost optimization strategies.
- Consultation : Act as the primary technical advisor to clients, translating business requirements into scalable, cost-effective hybrid cloud solutions.
- Assist in navigating complex cloud transformation projects (e.g EC2 to EKS)
- Hybrid Cloud Strategy & Roadmap : Develop and refine hybrid cloud strategies, including future scalability, security, compliance, and governance.
- Provide thought leadership on emerging hybrid cloud trends and technologies.
- Collaboration : Work closely with cross-functional teams including Developers, SREs, Architects, and Product teams to ensure successful hybrid cloud implementations and operational excellence.
- Security & Compliance : Ensure hybrid cloud solutions meet security and compliance standards (e.g HIPAA, SOC 2 & Hi-Trust).
- Work with security teams to establish robust cloud security frameworks, including identity and access management (IAM) and encryption strategies.
- Continuous Improvement: Evaluate new hybrid cloud technologies and tools, driving continuous improvement in cloud solutions.
- Recommend enhancements to architecture, tooling, and processes to optimize cloud performance and cost.
Behaviors & Abilities Required :
- Strong leadership and mentoring abilities, with a track record of developing high-performance engineering teams.
- Excellent problem-solving, troubleshooting, and diagnostic skills.
- Ability to work in a cross-functional, collaborative environment.
- Effective communication skills, with the ability to translate technical concepts to non-technical stakeholders.
Did you find something suspicious?
Posted By
Judah Muthiah
Senior Recruiting Manager at athenaHealth Technology Private Limited.
Last Active: 5 Dec 2025
Posted in
DevOps / SRE
Functional Area
Site Reliability Engineering
Job Code
1530273
Interview Questions for you
View All