Data Scientist - Python/SQL

YO IT Group
Multiple Locations
2 - 4 Years

Posted on: 30/05/2025

Job Description


Title : Data Scientist

Experience : 2 to 4 years

Location : Bengaluru , Chennai , Pune

Must Have :


- Relevant 2 years of relevant hands-on experience in Data Scientist.


- Hands-on experience in developing and deploying ML models using Python, PyTorch, and Scikit-learn.


- Practical knowledge of NLP tasks like entity recognition and text classification, and working with GenAI models such as GPT or LLaMA.


- Experience with tools like LangChain or LlamaIndex for building and optimizing RAG (Retrieval-Augmented Generation) pipelines.


- Understanding of MLOps practices and ability to handle big data in both batch and streaming modes.

Position Overview :


Tata Communications is looking for a Data Scientist to join our AI Center of Excellence (AI-CoE). This role demands a deep understanding of Machine Learning (ML), Natural Language Processing (NLP), Generative AI, and Retrieval-Augmented Generation (RAG). You will work on cutting-edge projects that transform business needs into AI-enabled solutions, contributing directly to product innovation and the development of monetizable differentiators for our products and services.

Key Responsibilities :


- Design, develop, test, and deploy ML models for diverse telecom and business use cases.


- Conduct data preprocessing, feature engineering, and model evaluation.


- Optimize ML/DL models for scalability and performance.


- Apply NLP techniques for tasks such as :


1. Named Entity Recognition (NER)


2. Text classification


3. Language modeling (e.g., GPT, LLaMA, Claude, Grok)


- Build and refine RAG-based models to improve information retrieval and response generation.


- Integrate RAG systems into existing applications to enhance user experience and data accessibility.


- Collaborate with cross-functional teams, including software engineers, product managers, and subject matter experts.


- Present complex technical findings to non-technical stakeholders clearly and effectively.


- Maintain detailed documentation of model development, processes, and best practices.

Required Skills :


- Strong foundation in Probability and Statistics.


- Hands-on experience with Machine Learning and Deep Learning techniques.


- Proficiency in :


1. Programming languages : Python, SQL


2. ML/DL frameworks : PyTorch, Scikit-learn, NumPy


3. GenAI libraries/tools : LangChain, LlamaIndex


- Knowledge of MLOps principles and working with Big Data in both batch and streaming contexts.


- Strong analytical and problem-solving skills.


- Excellent communication and collaboration abilities.


- Ability to manage multiple projects in a fast-paced environment.

- Bachelors or Masters degree in Data Science, Computer Science, Statistics, Mathematics, or a related field from a reputed institute.

Preferred : 2 years of professional experience in data science or a related domain.


info-icon

Did you find something suspicious?