Posted on: 19/08/2025
Exp : 6+ years
Location : Gurgoan
Job Type : 6 months contract + ext.
1. Core Responsibilities :
Design, develop, and optimize ML systems :
- Implement model behavior evaluation frameworks : design, test, and debug experimental pipelines in both research and product contexts
Infrastructure & Systems Engineering :
- Tune system performance, manage resource utilization (GPU/CPU/memory), profile system behavior, and deploy optimization tooling
ML Infrastructure & Data Platforms :
- Design and manage data pipelines, orchestration (e.g., Kafka, Spark, Airflow), and large-scale storage systems (Iceberg, Delta) to enable ML workflows
- Craft and maintain data ingestion, streaming, feature engineering, and operational reliability practices.
Model Deployment & Monitoring :
- Build telemetry and failure-analysis systems to detect model drift, calibration issues, or adversarial impacts
- Support on-call rotations and incident response for critical production infrastructure.
Cross-functional Collaboration & Mentorship :
- Work alongside ML researchers, software engineers, hardware teams, and product groups to translate AI research into scalable, user-safe implementations
- Mentor junior engineers, conduct code reviews, and lead by example in building reliable ML systems
2. Key Qualifications & Skills :
Technical Proficiency :
- Strong coding skills in languages such as Python, C++, and sometimes Rust
- Deep experience with ML frameworks (e.g., PyTorch, TensorFlow), and familiarity with LLMs or multimodal models
- Hands-on knowledge with GPU programming (CUDA, kernel optimization), performance profiling, or architectures like HBM, warp utilization, low-precision formats
Systems & Infrastructure Mastery :
- Experience designing simulation frameworks, benchmarking tools, or architectural modeling stacks
ML Engineering Foundations :
- Background in computer vision, audio ML, or real-time systems for authentication or perception, when relevant
Soft Skills :
- Strong analytical thinking, problem-solving, and ability to work with ambiguous or loosely defined challenges
- Proactive communicator, able to collaborate efficiently across diverse teams, and resilient in fast-paced environments
Did you find something suspicious?