HamburgerMenu
hirist

Job Description

Description :

About Us :


DataWeave is a cutting-edge SaaS-based digital commerce analytics platform that empowers retailers with competitive intelligence and equips consumer brands with digital shelf analytics on a global scale.

By harnessing the power of DataWeave, retailers gain the ability to make smarter pricing and merchandising decisions, while consumer brands can optimize their digital shelf performance across key performance indicators such as share of search, content quality, price competitiveness, and stock availability.

At the heart of DataWeave's capabilities lies its state-of-the-art AI-powered proprietary technology, which aggregates and analyzes 500+ billion data points, covering over 400,000 brands, 4,000+ websites, and spanning across more than 20 industry verticals.

Our distinguished clientele includes some of the world's largest and most renowned retailers and brands, among them Nordstrom, Overstock, The Home Depot, Mars, Bush Brothers, Mondelez, Pernod Ricard, and many others.

We are a globally distributed team, composed of over 220 talented engineers, product managers, and eCommerce experts located across San Francisco, Seattle, Austin, and Toronto in North America, complemented by our technology-focused offices in Bangalore.

What Do We Offer ?


- Cutting-Edge Research & Applied ML Opportunities : Tackle some of the most challenging problems in NLP, Computer Vision, and Large Language Models (LLMs).


- Experiment with state-of-the-art algorithms and AI agents on massive text and image datasets. Your work will shape the future of AI!


- Immediate Impact : See the direct impact of your innovations on our customers.

- Your contributions wont just sit on a shelftheyll drive real-world change in retail/ecommerce.


- End-to-End Ownership : Experience the thrill of owning projects from concept to deployment.

- Youll have the freedom to explore, build, and optimize solutions, ensuring theyre as impactful as possible.

- Culture of Openness and Collaboration : We believe in a flat hierarchy, fostering open communication and organization-wide visibility.


- Your voice matters, and your ideas are heard.


- Fun and Flexible Work Environment : Enjoy a vibrant workplace with flexible working hours that let you balance work and life seamlessly.

- Continuous Learning and Growth : Stay at the forefront of technology with access to learning opportunities, courses, and tech conferences.

- Plus, benefit from mentorship by experienced team members who are leaders in their fields.

- Competitive Rewards and Growth : We offer competitive salary packages and fast-paced career growth opportunities.

- Your hard work and contributions will be recognized and rewarded.

- Ready to shape the future of AI with us? Read more!

Who are we looking for? :

The ideal candidate is a skilled AI Developer (4+ years of ML experience) with research acumen and experience building/fine tuning ML/DL models from scratch, developing backends, and deploying scalable AI applications in production environments.

You thrive in translating business problems into AI solutions and driving productization opportunities.

You are a team player with strong collaboration skills and the ability to lead and inspire a team towards achieving impactful results.

Qualifications :

- Bachelors degree in Computer Science, Data Science, Mathematics, or a related field, with strong fundamentals in programming paradigms and software engineering.

- 6+ years of hands-on experience in AI/ML development, including end-to-end model lifecycle ownership. (3+ years acceptable with exceptional depth in GenAI/LLMs.)

- Expert-level Python proficiency, with strong experience using PyTorch or TensorFlow for building, training, and deploying deep learning models.

- Strong grounding in Generative AI, including experience working with Large Language Models (LLMs), vision-language models, and multimodal systems.

- Proficiency in data preprocessing, feature engineering, and data augmentation workflows for structured, unstructured, image, audio, and text data.

Hands-on experience with core NLP and CV libraries : SpaCy, NLTK, HuggingFace Transformers, OpenCV, and experience with key tasks like :

1. Object Detection, Classification, Segmentation

2. OCR and Document AI

3. Text Classification, Summarization, NER, Sequence Labeling

4. Embedding models and semantic similarity

- Proven experience training, fine-tuning, quantizing, evaluating, and deploying transformer-based models such as BERT, GPT, T5, LLaMA, Mistral, Phi, Claude, and Falcon.

- Deep understanding of LLM ecosystems including OpenAI, Anthropic, Meta, Google, Mistral, and AWS Bedrock models.

- Familiarity with model optimization, serving and scalability techniques such as : quantization (GPTQ, AWQ), distillation, pruning, ONNX export, TensorRT-LLM, vLLM, DeepSpeed, and FlashAttention.

- Experience designing experiments, evaluating models, and applying statistical rigor, A/B testing, prompt evaluation, and offline/online metrics.

- Strong leadership qualitiescapable of mentoring teams, driving AI strategy, and collaborating in high-velocity product environments.

Preferred / Good to Have :


- Masters or PhD in Computer Science, AI/ML, Applied Math, or related fields.

- Startup or high-growth tech experience; ability to operate in ambiguity with an ownership mindset.

- Experience building full MLOps pipelines using MLFlow, Kubeflow, Airflow, SageMaker, or Vertex AI, with CI/CD for ML.

- Strong experience with LLM fine-tuning and parameter-efficient training (PEFT) such as : LoRA, QLoRA, DoRA, Adapters, Prefix Tuning, Reward-model training, SFT and RLHF pipelines.

- Practical exposure to LangChain, LangGraph, LlamaIndex, and advanced workflow orchestration frameworks for multi-agent or tool-using systems.

- Expertise in building Retrieval-Augmented Generation (RAG) pipelines using vector DBs like Pinecone, Chroma, Qdrant, Weaviate, or FAISS, including :

1. Chunking strategies & embedding models

2. Hybrid search (sparse + dense)

3. Guardrails, reranking, hallucination checks, evaluation frameworks

- Experience with LLM application evaluation, including Ragas, DeepEval, Promptfoo, or custom eval frameworks.

Understanding of modern state-of-the-art research in :

- Transformer optimizations (Mamba, state-space models, multi-modal models)

- Self-supervised learning (SSL, contrastive learning)

- Agentic AI and tool-use systems

- Efficient training frameworks (FSDP, ZeRO)

- Contributions to open-source ML/AI repositories, publications, or participation in ML research communities is a plus.

Role and responsibilities :


As a Senior / lead Data Scientist specializing in LLMs, NLP, Computer Vision, and Deep Learning, you will play a critical role in driving AI innovation and solving complex business problems within the Retail domain.

You will be responsible for building robust AI capabilities that align with our product strategy, leading research initiatives, and developing scalable, production-ready models.

Additionally, you will mentor junior team members, lead by example, and contribute to the growth of our AI team.

- Model Development and Research : Build robust ML models using state-of-the-art architectures and solve some of the complex problems in the retail domain like product matching, attribute extraction, price optimization etc.

- Optimization and Scalability : Think scale and automationmeasure everything and optimize proactively for performance and efficiency with MLOps best practices.

- Team Growth and Mentorship : Be a tech thought leader, fostering a culture of innovation, passion, and collaboration & mentor and train junior team members, enabling them to grow their skills.

- Ownership and Leadership : Take end-to-end ownership of projects, from ideation to deployment, with minimal supervision


info-icon

Did you find something suspicious?

Similar jobs that you might be interested in