As a Big Data Engineer, you will be responsible for designing, building, and managing scalable data pipelines, and processing frameworks that support the ingestion, transformation, and storage of large datasets. You will work closely with data scientists, analysts, and other engineers to deliver high-quality, reliable, and efficient data infrastructure solutions. The ideal candidate will have a deep understanding of big data technologies, data warehousing concepts, and best practices for building scalable systems.

Key Responsibilities :

- Design & Implement Data Pipelines : Build and maintain robust, scalable, and efficient data pipelines to process large volumes of data from diverse sources.

- Data Integration : Integrate data from multiple sources into a centralized data warehouse or data lake for analytical purposes.

- Data Modeling : Collaborate with stakeholders to design and implement data models that optimize storage, retrieval, and processing.

- Optimization & Tuning : Ensure the performance, scalability, and reliability of data systems by optimizing queries, processing workflows, and data storage.

- Data Processing Frameworks : Develop and manage batch and real-time data processing frameworks using big data technologies like Hadoop, Spark, or Kafka.

- Collaboration : Work with data scientists, analysts, and software engineers to define data requirements, understand business needs, and design technical solutions.

- Monitoring & Troubleshooting : Monitor the health of data pipelines, troubleshoot issues, and ensure data quality and integrity.

Skills & Qualifications :

- Experience with Big Data Technologies: Proficient in Hadoop, Spark, Kafka, Hive, Flink, and related technologies.

- Data Warehousing : Familiarity with data warehousing concepts and technologies (e.g., Redshift, Snowflake, BigQuery).

- Programming Languages : Strong proficiency in languages such as Python, Java, Scala, or SQL.

- ETL Development : Experience with designing and implementing ETL processes for data integration and transformation.

- Cloud Platforms : Experience working with cloud platforms such as AWS, GCP, or Azure, especially in data related services.

- Distributed Systems : Strong understanding of distributed systems and cloud-native architectures.

- Data Modeling & Design : Solid understanding of relational and NoSQL databases, as well as best practices in database design.

- Problem-Solving & Analytical Skills : Strong problem-solving ability and analytical mindset to identify data issues and optimize workflows.

Preferred Qualifications :

- Experience with Real-Time Data Processing : Experience with tools like Apache Flink, Kafka Streams, or Apache Pulsar.

- Machine Learning Experience : Knowledge of integrating machine learning models or AI systems into the data pipeline.

- Advanced Degree : A Masters degree or PhD in Computer Science, Engineering, Data Science, or related fields is a plus.

Benefits :

- Competitive salary and performance-based bonuses

- Health, dental, and vision insurance

- Generous PTO and sick leave

- Retirement plan options (401k, etc.)

- Professional development and training opportunities

- Flexible working hours and remote work options

- A dynamic, collaborative, and inclusive work environment

The job is for:

Women candidates preferred

Differently-abled candidates preferred

Ex-defence personnel preferred

May work from home

For women joining back the workforce

Did you find something suspicious?

Similar jobs that you might be interested in

Posted by

Sriram

SR Hr at MAPCOM INFO SOLUTION PRIVATE LIMITED

Last Active: 11 Dec 2025

Job Views:
893

Applications: 594

Recruiter Actions: 27

Posted in

Data Engineering

Functional Area

Big Data / Data Warehousing / ETL

Job Code

1548927

Jobs by location

Interview Questions for you

View All

How to Write Leave Application for Urgent Work: Format & Samples (2025)

Top 90+ Machine Learning Interview Questions and Answers

Top 40+ Deep Learning Interview Questions and Answers