HamburgerMenu
hirist

Principle Data Scientist - Big Data Technologies

Talent Acceleration Corridor
Multiple Locations
5 - 8 Years

Posted on: 25/07/2025

Job Description

Job summary :


Our client is a leading Software as a Service (SaaS) company, revolutionizing the US healthcare industry by leveraging cutting-edge Artificial Intelligence (AI) solutions to transform and manage data. They are looking for highly motivated and skilled Lead/Sr. Data Scientist focused on the application of emerging technologies. The candidates must be well versed with Python, Scala, Spark, SQL and AWS platform.


The individuals who will join the new Evolutionary Platform team should be continually striving to advance AI/ML excellence and technology innovation. The mission is to power the next generation of the digital product and services through innovation, collaboration, and transparency. You will be a technology leader and doer who enjoys working in a dynamic, fast- paced environment.


Responsibilities :


- Leverage AI/ML techniques and solutions to identify and mathematically interpret complex healthcare problems.


- Full-stack development of data pipelines involving Big Data.

- Design and development of robust application/data pipelines using Python, Scala, Spark, and SQL

- Lead a team of Data Scientists, developers as well as clinicians to strategize, design and evaluate AI based solutions to healthcare problems.

- Increase efficiency and improve the quality of solutions offered.

- Managing the complete ETL pipeline development process from conception to deployment

- Collaborating with and guiding the team on writing, building, and deployment of data software

- Following best design and development practices to ensure high quality code.

- Design, build and maintain efficient, secure, reusable, and reliable code

- Perform code reviews, testing, and debugging


Desired Profile :


- Bachelor's or Masters degrees in computer science, Mathematics, or any other quantitative discipline from Premium/Tier 1 institutions

- 5 to 7 years of experience in developing robust ETL data pipelines and implementing advanced AI/ML algorithms (GenAI is a plus).

- Strong experience working with technologies like Python, Scala, Spark, Apache Solr, MySQL, Airflow, AWS etc.

- Experience working with Relational databases like MySQL, SQLServer, Oracle etc.

- Good understanding of large system architecture and design

- Understands the core concepts of Machine Learning and the math behind it.

- Experience working in AWS/Azure cloud environment

- Experience using Version Control tools such as Bitbucket/GIT code repository

- Experience using tools like Maven/Jenkins, JIRA

- Experience working in an Agile software delivery environment, with exposure to continuous integration and continuous delivery tools


- Great collaboration and interpersonal skills

- Ability to work with team members and lead by example in code, feature development, and knowledge sharing


info-icon

Did you find something suspicious?