Posted on: 15/10/2025
Description :
- Build and optimize models for extracting insights from documents such as blueprints, scanned PDFs, SVG files, or text-heavy datasets.
- Develop and enhance multi-modal models that integrate vision with language-based features (NLP/LLMs).
- Follow best data science and machine learning practices, including data-centric development, experiment tracking, model validation, and reproducibility.
- Collaborate with cross-functional teams, including software engineers, ML researchers, and product teams, to convert research ideas into real-world applications.
- Write clean, scalable, and production-ready code using Python and frameworks like PyTorch, TensorFlow, or HuggingFace.
- Stay updated with the latest research in computer vision, NLP, and machine learning, and evaluate applicability to construction industry challenges.
Requirements :
- Solid understanding of image processing, visual document understanding, OCR, and feature extraction, as well as text processing, embeddings, and transformer-based architectures.
- Familiarity with SVG graphics, LLM-based models, or multimodal architectures is a plus.
- Deep understanding of unsupervised learning techniques like clustering, dimensionality reduction, and representation learning.
- Proficiency in Python and ML frameworks such as PyTorch, OpenCV, TensorFlow, and HuggingFace Transformers.
- Hands-on experience with model optimization techniques (e. g., quantization, pruning, knowledge distillation) - good to have.
- Experience with version control systems (Git), project tracking tools (JIRA), and cloud environments (GCP, AWS, or Azure).
- Familiarity with Docker, Kubernetes, and containerized ML deployment pipelines.
- Strong analytical and problem-solving skills, with a passion for building innovative solutions; ability to rapidly prototype and iterate.
- Comfortable working in a fast-paced, agile, startup-like environment, with excellent communication and collaboration skills.
Did you find something suspicious?