Muhammad Azlan Yahaya

Muhammad Azlan Yahaya

Senior AI Engineer · Malaysia

AI Engineer with 8+ years of deep expertise in LLMs, NLP, and Python-based AI application development. Skilled in building intelligent systems like NLP search platforms, RAG systems, AI Agents and Chatbots. Experienced in end-to-end solutions using FastAPI/Flask, vector databases, and cloud deployment.

About

AI Engineer with deep expertise in LLMs, NLP, and Python-based AI application development. Skilled in building intelligent systems like NLP search platforms, Retrieval-Augmented Generation (RAG) systems, AI Agents and Chatbots. Experienced in developing end-to-end solutions using FastAPI/Flask, integrating vector databases, and deploying models on Cloud. Looking to apply cutting-edge AI/ML techniques in a dynamic and impactful engineering team.

8+
Years Experience
AI/ML
Primary Focus
Malaysia
Based In
Remote
Work Style

Experience

AI Engineer

The Right Contact

02/2023 – 10/2025 New York, United States

NLP Search Platform for Medical Database

  • Developed NLP search platform enabling clinicians and non-technical users to query a complex medical database using plain English
  • Engineered transformer-based model to translate natural language queries into optimized SQL statements
  • Applied ML algorithms to personalize and improve search accuracy by learning from user interactions
  • Architected scalable, cloud-native microservices deployed on AWS (Lambda, API Gateway, ECS)
  • Implemented advanced NLP features like synonym recognition and contextual query suggestions

Software Engineer

Enso Consulting

03/2021 – 12/2022 American Fork, United States
  • Developed full-stack application enabling users to interact conversationally with multiple documents through intelligent retrieval
  • Designed and implemented advanced retrieval pipeline combining structured and unstructured data sources for improved relevance and accuracy
  • Built scalable backend architecture for efficient query processing and high-performance operation
  • Created intuitive and visually engaging user interface for seamless navigation and interaction

Data Engineer

Commune

01/2019 – 12/2020 San Mateo, United States
  • Designed and implemented ML system to recognize and extract structured information from accounting documents
  • Built scalable, reliable, and distributed backend services for real-time data processing under high workloads
  • Developed advanced document analysis to detect and classify QR codes, tables, headings, logos, and signatures

Python Developer

Aimesoft

08/2016 – 02/2019 Hanoi, Vietnam
  • Designed and trained deep learning models using PyTorch and TensorFlow for classification, regression, and sequence modeling
  • Built ML pipelines using scikit-learn with feature engineering, model selection, and hyperparameter tuning
  • Deployed models to production via ONNX, TorchScript, or TensorFlow Serving with cloud-native inference pipelines
  • Specialized in model optimization, data preprocessing, and scalable training/inference strategies

Education

Master of Computer Science

Nilai University

2012 – 2016 Negeri Sembilan, Malaysia

Selected Projects

Transformer-based NL-to-SQL and image-triggered search for clinicians. ML personalization, synonym/context features. AWS microservices.

NLP Transformers FastAPI AWS
Multi-Document Conversational System Software Engineer

Full-stack application with intelligent retrieval system for conversational document interaction. Advanced retrieval pipeline combining structured and unstructured data sources.

RAG LangChain ChromaDB FastAPI
Document Information Extraction Data Engineer

ML system for recognizing and extracting structured information from accounting documents. Scalable distributed backend with advanced document analysis capabilities.

Deep Learning Document AI FastAPI MongoDB
DL Model Training & Production Deployment Python Developer

PyTorch/TensorFlow models for NLP and CV. Production deployment via ONNX, TorchScript, TF Serving; cloud-native inference pipelines.

PyTorch TensorFlow ONNX Production ML

Contact

Open to AI/ML roles and collaborations. Get in touch.