Data Scientist - Numpy/Pandas

VY Systems Pvt Ltd.
Anywhere in India/Multiple Locations
6 - 15 Years

Posted on: 15/05/2025

Job Description

Job Summary :

We are seeking a highly skilled and innovative Data Scientist with a strong background in AWS services, Python scripting, data preprocessing, and LLMs (Large Language Models). The ideal candidate will also have working knowledge of SQL and experience solving real-world problems using advanced data science techniques.


Key Responsibilities :


- Build and deploy scalable data science solutions leveraging AWS cloud services.

- Develop and maintain efficient Python scripts for data analysis, transformation, and modeling.

- Perform data wrangling, preprocessing, and feature engineering for various types of structured and unstructured data.

- Apply machine learning and LLM-based models to solve business problems and drive insights.

- Design experiments, conduct hypothesis testing, and validate model performance.

- Collaborate with engineering, product, and business teams to integrate models into production systems.

- Write clean, maintainable code and maintain model documentation and reproducibility.

- Ensure data privacy and compliance standards are maintained across all processes.


Required Skills :


- Strong experience in Python for data science, including use of libraries such as Pandas, NumPy, Scikit-learn, Transformers, etc.

- Proficient in AWS services such as S3, Lambda, SageMaker, EC2, Redshift, etc.

- Hands-on experience with LLMs (e.g., OpenAI, HuggingFace Transformers, LangChain) and their fine-tuning or application.

- Solid understanding of data preprocessing techniques for both structured and unstructured data.

- Working knowledge of SQL for querying and manipulating large datasets.

- Experience with model deployment and production-grade data pipelines.


info-icon

Did you find something suspicious?