BCSS is seeking a Databricks Data Engineer to support its enterprise-wide Sustainability initiative. The engineer will be responsible for building data pipelines and models to support product-level carbon footprint analysis.
This role involves integrating structured engineering, manufacturing, and supplier data into a unified model using Databricks on AWS.
Key Responsibilities :
- Develop and optimize ETL/ELT pipelines on Databricks using PySpark and SQL to support carbon footprint analytics.
- Build data models that combine engineering (EBOM), manufacturing (MBOM), supplier, and factory operations data to generate emissions metrics.
- Integrate and transform data from :
MBOM from SAP, including multilevel BOM explosion logic.
EBOM from Oracle-based systems (no explosion required).
Supplier environmental data (e.g., material-level emissions).
Factory data (e.g., energy consumption, material usage).
- Collaborate with sustainability analysts, engineering teams, and supply chain stakeholders to translate carbon calculation logic into data transformations.
- Ensure high data quality, lineage, and governance using Delta Lake, Unity Catalog, and standard best practices.
- Leverage AWS services such as S3, Glue, and Athena for orchestration and storage.
- Document data logic and workflows for traceability and compliance with ESG standards.
Technical Requirements:
- Hands-on experience with Databricks (Delta Lake, Unity Catalog, Jobs, Workflows).
- Strong skills in PySpark and SQL.
- Experience with SAP MBOM structures, especially multilevel BOM explosion.
- Understanding of Oracle-based EBOM systems and ability to integrate structured data.
- Familiarity with AWS data ecosystem (S3, Glue, Lambda, Athena).
- Strong knowledge of data modeling, pipeline optimization, and performance tuning.
Preferred Qualifications :
- Experience in carbon accounting, lifecycle analysis, or sustainability-focused data projects.
- Understanding of manufacturing data and supply chain operations.
- Exposure to SAP ECC/S/4HANA and Oracle-based PLM or engineering systems.
- Experience with version control, CI/CD (e.g., Git, Databricks Repos).