Posted on: 03/03/2026
Note : Would prefer Female candidates for this role.
Job Description :
We are looking for a Principal Data Engineer to join our growing team of Advanced Data Analytics experts. This role calls for a seasoned professional with deep expertise in data engineering, data modeling, and analytics, along with strong technical leadership and architectural thinking. The ideal candidate thrives on solving complex data challenges-ensuring data quality, integrity, and scalability-and driving data initiatives that enable analytics, machine learning, and evidence-based decision-making.
The Data Engineer will work closely with System and Enterprise Architects, Technical Project Managers, Product Owners, and Software Engineers on data-driven initiatives, supporting the data needs of multiple teams, systems, and products. The perfect fit for this role is self-motivated, collaborative, and enthusiastic about advancing Roche's next generation of products and data initiatives. As Roche advances toward AI-enabled diagnostics, hands-on experience with Generative AI (GenAI) POCs will be considered a plus, especially in leveraging data assets for automation, insight generation, and feature engineering. Location- Baner, Pune
Key Responsibilities :
- Design, implement, and optimize data architectures and pipelines using AWS services for scalable, high-performance, and reliable data systems.
- Lead data modeling, exploratory data analysis (EDA), and transformation activities, ensuring high-quality, well-structured data for analytics and machine learning.
- Collaborate with data scientists to support feature extraction and data preparation for AI/ML models.
- Partner with software engineers and architects to integrate data-driven capabilities into Roche's SaaS and digital health solutions.
- Ensure data governance, lineage, and compliance across cloud environments.
- Stay current with advancements in data engineering, analytics, and emerging AI/GenAI trends, evaluating their applicability to Roche's data platforms.
- Present data strategies, insights, and technical solutions clearly to both technical and business audiences.
Required Experience, Skills & Qualifications :
- 9 + years of experience in data engineering and analytics.
- Expertise in AWS services (e.g., S3, Redshift, Glue, Athena, EMR).
- Strong in data modeling, SQL, Python, and distributed frameworks like Apache Spark.
- Hands-on experience in exploratory data analysis (EDA), data quality improvement, and feature engineering.
- Experience developing SaaS and cloud-native data applications.
- Sound understanding of software architecture patterns and agile methodologies.
- Proven ability to work closely with cross-functional teams and drive data-centric initiatives.
- Excellent communication skills to bridge technical and business discussions.
Desired Experience, Skills & Qualifications :
- Experience in the Healthcare / Diagnostics domain is a plus.
- Familiarity with healthcare data standards (HIPAA, HL7, FHIR).
- Experience designing data lakes, data warehouses, or data mesh architectures.
- Strong knowledge of data governance, security, and compliance in cloud ecosystems.
- Generative AI (GenAI) Experience : Hands-on exposure to GenAI, including POCs for data summarization, feature extraction, or AI-driven insights, leveraging LLMs and AI pipelines in production or experimental setups.
The job is for:
Did you find something suspicious?
Posted by
Posted in
Data Engineering
Functional Area
Data Engineering
Job Code
1617698