AI/ML

Artificial Intelligence

Machine Learning

Security Architect - AI

Cloud Architect - ML/AI

Emerging Technologies

DevOps / SRE

CyberSecurity

Quality Assurance

Platform Engineering / SAP/Oracle

Lead Data Engineer - Apache Spark

HiringEye

Hyderabad

8 - 12 Years

4.9

6+ Reviews

Data Engineering Apache Spark ETL Data Ingestion Kafka SQL Python Spark Data Warehousing Datamart OLTP

Posted on: 09/12/2025

Job Description

Description :

Responsibilities :

- Lead the design and development of end-to-end data pipelines using Apache Spark (Batch and Streaming).

- Architect and implement real-time data ingestion frameworks using Kafka.

- Build scalable ETL/ELT workflows to support analytics, reporting, and data science initiatives.

- Develop and maintain data models (conceptual, logical, physical) for enterprise data platforms.

- Optimize Spark jobs for performance, reliability, and scalability.

- Ensure data quality, governance, and security across all data flows.

- Drive best practices for coding standards, CI/CD, and cloud-based data architecture.

- Mentor junior engineers and collaborate with cross-functional teams (Data Science, DevOps, Product).

- Troubleshot complex data processing issues and provided technical leadership during incidents.

Requirements :

- 7+ years of hands-on experience in Data Engineering.

- Strong working knowledge of Spark, Python, SQL, and API Integration frameworks is a must.

- Working experience in Modern data architecture and modeling concepts, including Cloud data lakes, data warehouses, and data marts.

- Familiarity with dimensional modeling, star schemas, and real-time/batch ETL pipelining, including experience with data streaming (Kafka).

- In-depth experience with Kafka for real-time data ingestion and streaming.

- Strong proficiency in SQL (analytical, performance tuning).

- Solid understanding of data modeling principles (OLTP, OLAP, dimensional modeling, star/snowflake schemas).

- Experience building large-scale distributed data processing systems.

- Hands-on experience with cloud platforms such as AWS / Azure / GCP (any).

- Knowledge of CI/CD, containerization (Docker), and orchestration tools (Airflow, Jenkins, etc. ).

- Strong problem-solving, debugging, and leadership skills.

- Bachelor's or Master's degree in Computer Science, Engineering, or related field.

Preferred Qualifications :

- Experience with Delta Lake, Lakehouse architecture, or cloud-native data platforms.

- Exposure to NoSQL databases (Cassandra, MongoDB, DynamoDB).

- Knowledge of data governance, metadata management, and cataloging tools.

- Prior experience leading a technical team or project.

Did you find something suspicious?

Similar jobs that you might be interested in

Posted by

Rajeev Reddy

Founder at HiringEye

Last Active: 27 Jan 2026

Job Views:
20

Applications: 15

Recruiter Actions: 0

Posted in

Data Engineering

Functional Area

Data Engineering

Job Code

1586715

Jobs by location

Interview Questions for you

View All

How to Write Leave Application for Urgent Work: Format & Samples (2025)

Top 90+ Machine Learning Interview Questions and Answers

Top 40+ Deep Learning Interview Questions and Answers