Posted on: 25/02/2026
Description :
- AI & ML Data Collection Leadership : Drive the execution of AI & ML initiatives related to data collection, ensuring alignment with overall business goals and strategies.
- Document Enrichment Ownership : Own and evolve enrichment models for all incoming documents, including OCR, document structure extraction, entity extraction, entity resolution, and duplicate detection to ensure high-quality downstream data consumption.
- Technical Oversight : Provide hands-on technical leadership in the engineering of ML models and services, focusing on unstructured document processing, NLP, classifiers, and enrichment models. Oversee and contribute to scalable, reliable, and efficient solutions.
- Team Leadership & Development : Lead, mentor, and develop a high-performing team of engineers and data scientists. Foster a culture of innovation, continuous improvement, and effective communication across geographically dispersed teams.
- NLP Technologies : Contribute to the development and application of NLP techniques, including OCR post-processing, classifiers, transformers, LLMs, and other methodologies to process and enrich unstructured documents. Ensure seamless integration into the broader AI/ML infrastructure.
- Data Pipeline Engineering : Design, develop, and maintain advanced document ingestion and enrichment pipelines using orchestration, messaging, database, and data platform technologies. Ensure scalability, performance, and reliability.
- Cross-functional Collaboration : Work closely with other AI/ML teams, data collection engineering teams, and product management to ensure enrichment efforts support broader AI/ML and product objectives.
- Innovation & Continuous Improvement : Continuously explore and implement new technologies and methodologies to improve the efficiency, accuracy, and quality of document enrichment systems.
- System Integrity & Security : Ensure all data collection and enrichment systems meet high standards of integrity, security, and compliance. Implement best practices for data governance and model transparency.
- Talent Acquisition & Retention : Actively recruit, train, and retain top engineering talent. Foster an environment where team members feel valued, encouraged to innovate, and supported in reaching their full potential.
- Process Improvement : Apply Agile, Lean, and Fast-Flow principles to improve team efficiency and the delivery of high-quality data collection and enrichment solutions.
- Support Company Vision and Values : Model and promote behaviors aligned with the companys vision and values. Participate in company-wide initiatives and projects as required.
Experience, Skills, and Qualifications :
- Bachelors, Masters, or PhD in Computer Science, Mathematics, Data Science, or a related field.
- 6+ years of experience in software engineering, with a focus on AI & ML technologies, particularly in data collection and unstructured data processing.
- 3+ years of experience managing individual contributors in a leadership role.
- Strong expertise in NLP and machine learning applied to document understanding and enrichment, including classifiers, LLMs, GenAI, RAG, and/or Agentic AI.
- Hands-on experience building and operating document enrichment systems, including OCR, NER, entity resolution, document classification, and duplicate detection.
- Experience with data pipeline and messaging technologies such as Apache Kafka, Airflow, and cloud data platforms (e.g., Snowflake) is preferred.
- Proficiency in Python, Java, SQL, and other relevant programming languages and tools.
- Strong understanding of cloud-native technologies and containerization (e.g., Kubernetes, Docker), with experience managing these systems globally.
- Demonstrated ability to solve complex technical challenges and deliver scalable solutions.
- Excellent communication skills and a collaborative approach to working with global teams and stakeholders.
- Experience in fast-paced, data-intensive environments; fintech experience is highly desirable.
Working Conditions :
This position is based in a standard office environment. Employees use PCs and phones throughout the day. Limited corporate travel may be required to remote offices, meetings, or events. Morningstar is an equal opportunity employer.
Did you find something suspicious?