HamburgerMenu
hirist

Job Description

Description :

We are seeking an experienced Cloudera Hadoop Administrator / Lead Platform Support Specialist to manage, support, and optimize large-scale data platforms built on Hadoop, Spark, and Airflow. The ideal candidate will have strong hands-on experience in platform support, monitoring, troubleshooting, and automation, along with the ability to lead operational support activities in production environments.

This role is critical in ensuring high availability, performance, and reliability of big data platforms across production and staging environments, while collaborating closely with development, QA, and infrastructure teams.

Key Responsibilities :

Platform Operations & Support :

- Provide end-to-end platform support for Cloudera-based Hadoop ecosystems, including Hadoop, Spark, and Airflow.

- Monitor and maintain platform health, availability, and performance across production and non-production environments.

- Proactively identify system bottlenecks and capacity issues and recommend optimization strategies.

Incident Management & Troubleshooting :

- Troubleshoot and resolve complex technical issues related to :

1. Platform functionality and performance

2. Application integrations

3. User access and security

- Lead root cause analysis (RCA) for recurring or high-severity incidents.

- Participate in on-call rotations and respond to critical incidents outside regular business hours when required.

Collaboration & Deployment Support :

- Work closely with development, QA, and infrastructure teams to ensure smooth deployment and operation of applications.

- Support release management activities and validate platform readiness for deployments.

- Act as a point of escalation for platform-related issues.

Automation & Continuous Improvement :

- Design and implement automation scripts and tools to improve platform reliability and reduce manual operational tasks.

- Optimize monitoring, alerting, and incident response processes.

- Contribute to continuous improvement initiatives focused on operational excellence.

Documentation & Knowledge Management :

- Document incidents, resolutions, configuration changes, and operational procedures.

- Maintain and enhance internal knowledge bases and runbooks to support faster issue resolution and onboarding.

Required Skills & Qualifications :

- 5+ years of experience in platform or production support, with strong exposure to Hadoop-based ecosystems.

- Hands-on experience administering and supporting :

1. Cloudera Hadoop

2. Apache Spark

3. Apache Airflow

- Strong experience in monitoring and maintaining large-scale production platforms.

- Proven troubleshooting skills across distributed systems and integrations.

- Experience with automation and scripting (Shell, Python, or similar).

- Familiarity with Linux/Unix environments.

- Strong understanding of incident management, SLAs, and support processes.

- Excellent communication, coordination, and documentation skills.


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in