Posted on: 07/11/2025
Description :
Role : Lead in AI (DataScience)
Location : Delhi
Project : Central Government
JOB SUMMARY :
The Organization is seeking a highly skilled and experienced Data Scientist to join the team and contribute to the development of data- driven solutions under the Digital India initiative. The ideal candidate will have a strong background in data science, natural language processing, Generative Ai with the ability to extract insights from the large datasets, develop predictive models, and build scalable data processing systems. This role Is critical in leveraging data to enhance e-governance services and improve decision-making processes within government agencies.
KEY RESPONSIBILITIES :
- Data Analysis and Modeling
- Analyze large and complex datasets to identify patterns, trends, and insights.
- Develop predictive models and machine learning algorithms to support various e-governance projects.
- Perform data mining and statistical analysis to inform policy decisions.
- Build natural language processing (NLP) pipelines for document analysis, entity extraction, and semantic search across government datasets.
- Optimize embedding models and vector databases for multilingual government content (Hindi, English, and regional languages).
- Design and implement Retrieval-Augmented Generation (RAG) systems with hybrid search capabilities combining dense and sparse retrieval methods.
Data Engineering :
- Design, build, and maintain scalable data pipelines and ETL processes.
- Ensure data quality, integrity, and security across all data processing activities.
- Optimize data architectures for performance and scalability.
- Design and maintain scalable ML pipelines using frameworks like Apache Airflow, Kubeflow, or MLflow for continuous model training and deployment.
- Implement model monitoring, drift detection, and automated retraining processes for production AI systems.
- Technology Evaluation and Implementation :
- Stay updated with the latest data science tools, frameworks, and technologies.
- Evaluate and implement emerging tools to enhance data processing, modeling, and analytics capabilities.
Model Deployment and Production :
- Deploy machine learning models into production environments with proper monitoring, logging, and alerting systems.
- Implement MLOps practices including continuous integration, continuous deployment, and automated testing for AI systems.
Data Pipeline Development :
- Design, develop, and maintain scalable data pipelines and ETL processes to handle large volumes of data.
- Ensure the timely and accurate movement of data from various sources to data warehouses or data lakes.
- Collaboration and Stakeholder Engagement :
- Work closely with product teams, developers, and government stakeholders to understand requirements and design technical solutions.
- Present analytical findings and recommendations to non-technical audiences.
- Collaborate across departments to integrate AI-driven insights into e-governance services
Education :
Essential : Bachelors degree in computer science, Data Science, Artificial Intelligence, or related field.
Desirable : Masters degree or specialization in AI, Machine Learning, or related discipline will be an added advantage.
Experience :
- Minimum of 5 years of hands-on experience in data science, machine learning, or AI roles.
- Proven experience in developing predictive models, NLP systems, and generative AI applications.
- Experience with building and maintaining scalable data pipelines and production AI systems.
- Prior exposure to e-governance or public sector AI initiatives is desirable.
Technical Proficiency :
- Strong programming skills in Python (preferred) and proficiency with ML libraries such as TensorFlow, PyTorch, scikit-learn, or Hugging Face.
- Hands-on experience with data engineering frameworks like Apache Spark, Airflow, Kafka, or Flink.
Did you find something suspicious?