HamburgerMenu
hirist

Prama - DevOps/Site Reliability Engineer

PRAMA INNOVATIONS INDIA PRIVATE LIMITED
5 - 10 Years
Gurgaon/Gurugram

Posted on: 16/04/2026

Job Description

Company Overview :

Prama.ai specializes in AI-powered and Generative AI solutions across Data, Cloud, and APIs, helping businesses build intelligent platforms and scalable AI-driven products.

With 500+ projects delivered globally and 30+ engagements currently in progress, Prama.ai brings deep expertise in Data Engineering, AI/ML, and Advanced Analytics across industries such as SaaS, FinTech, Healthcare, and EdTech. We have also built 6 proprietary products, reflecting our strong product engineering DNA.

Finomics.ai is Prama.ai's in-house financial intelligence and analytics platform, reinforcing our commitment to building practical, real-world solutions that deliver measurable business impact.

Prama.ai is ISO 27001 certified and SOC 2 Type II compliant, demonstrating our strong commitment to security and compliance. We are a trusted partner with AWS, Google Cloud Platform (GCP), Snowflake, and Databricks.

Headquartered in Phoenix, Prama.ai operates globally with offices in the USA (Chandler, AZ), Canada (Toronto), and India (Ahmedabad and Bangalore).

Location : Gurgaon | Work from Office

Benefits : 5 Day Working | Career Growth | Flexible working | Medical Insurance

Primary skills : Linux , Containers & Kubernetes, Observability / Monitoring Tools, ELK Stack (Elasticsearch, Logstash, Kibana), Splunk / Dynatrace, Elasticsearch , Networking Basics, Troubleshooting & Incident Handling, ITSM Process

Experience : 5+ Years

Role Overview :

We are looking for a highly skilled SRE / DevOps Engineer with strong expertise in Linux systems, containerization, and observability platforms. The ideal candidate will be responsible for ensuring system reliability, performance, and scalability across production environments.

Responsibilities :

- Manage and maintain Linux-based infrastructure ensuring high availability and performance

- Deploy, manage, and scale applications using containers and Kubernetes

- Implement and maintain observability and monitoring solutions

- Work extensively with the ELK Stack (Elasticsearch, Logstash, Kibana) for logging and analytics

- Configure and optimize tools like Splunk / Dynatrace for monitoring and incident detection

- Perform deep analysis and tuning of Elasticsearch clusters (critical skill)

- Troubleshoot production issues and ensure quick resolution (RCA, incident handling)

- Collaborate with development and operations teams for system improvements

- Follow and implement ITSM processes (Incident, Problem, Change Management)

- Handle networking fundamentals including DNS, TCP/IP, load balancing, etc.

Required Skills :

- Strong expertise in Linux (core system administration)

- Hands-on experience with Docker & Kubernetes

- Strong experience in Elasticsearch (must-have)

- Experience with ELK Stack for centralized logging

- Knowledge of Splunk / Dynatrace / other monitoring tools

- Solid understanding of networking concepts

- Strong troubleshooting and debugging skills

- Experience in incident management & production support

Good to Have :

- Experience with cloud platforms (AWS / Azure / GCP)

- Knowledge of CI/CD pipelines

- Exposure to automation/scripting (Python, Bash)

info-icon

Did you find something suspicious?

Similar jobs that you might be interested in