- Analyze and understand existing data warehouse implementations to support migration and consolidation efforts.
- Reverse-engineer legacy stored procedures (PL/SQL, SQL) and translate business logic into scalable Spark SQL code within Databricks notebooks.
- Design and develop data lake solutions on AWS using S3 and Delta Lake architecture, leveraging Databricks for processing and transformation.
- Build and maintain robust data pipelines using ETL tools with ingestion into S3 and processing in Databricks.
- Collaborate with data architects to implement ingestion and transformation frameworks aligned with enterprise standards.
- Evaluate and optimize data models (Star, Snowflake, Flattened) for performance and scalability in the new platform.
- Document ETL processes, data flows, and transformation logic to ensure transparency and maintainability.
- Perform foundational data administration tasks including job scheduling, error troubleshooting, performance tuning, and backup coordination.
- Work closely with cross-functional teams to ensure smooth transition and integration of data sources into the unified platform.
- Participate in Agile ceremonies and contribute to sprint planning, retrospectives, and backlog grooming
- Triage, debug and fix technical issues related to Data Lakes
- Maintain and Manage Code repositories like Git
You Must Have :
- 5+ years of experience working with Databricks, including Spark SQL and Delta Lake implementations.
- 3 + years of experience in designing and implementing data lake architectures on Databricks.
- Strong SQL and PL/SQL skills with the ability to interpret and refactor legacy stored procedures.
- Hands-on experience with data modeling and warehouse design principles.
- Proficiency in at least one programming language (Python, Scala, Java).
- Bachelors degree in Computer Science, Information Technology, Data Engineering, or related field.
- Experience working in Agile environments and contributing to iterative development cycles. Experience working on Agile projects and Agile methodology in general
We Value :
- Databricks cloud certification is a big plus
- Exposure to enterprise data governance and metadata management practices.