Hi, I'm Saqlain
Engineering AI products at scale and creating practical solutions across domains.
MS

About

Focusing on designing and refining AI systems with a strong emphasis on clarity, stability, and rigorous reasoning. My work covers machine learning, NLP, and language models, where I take initial concepts and develop them into reliable, well-structured systems. I examine prototypes from the ground up, isolate weak points, and rebuild their logic until the behavior holds under varied conditions. I work with models, data, and algorithms in a direct, methodical way that keeps the system predictable and controlled. Outside technical work, I spend my time playing cricket or training in martial arts (Black Belt, Dan 1). I write clean code, question assumptions without delay, and depend on solid algorithmic structure.

Work Experience

I
Research Intern - Machine Learning
June 2025 - Present
Working with the NMCAD Lab on an eVTOL project as part of the machine learning team, contributing to data driven models for improving system performance and operational efficiency.
P
AI Engineer Intern
June 2025 - September 2025
Designed and deployed RESTful APIs for a compliance platform, enhancing performance and reducing response latency by over 40%. Developed a RAG based chatbot leveraging hybrid retrieval techniques to deliver precise responses for ABPI case and clause related queries, improving overall information accuracy.
B
Research Intern
June 2024 - August 2024
Built a clinical RAG pipeline using LLMs and a FAISS vector store to extract symptoms and answer medical queries with reduced hallucinations. Developed a live streaming ASR system using open source models with a substantial accuracy gain.

Open Source Contributions

Added Differential Diffusion to Kolors

HuggingFace Diffusers 🤗
Python
Diffusion Models
Image Generation
PyTorch

Optimized ALBERT Test Model Size

HuggingFace Transformers 🤗
Python
Transformers
Model Optimization
NLP

Projects

Retro Reels - Shortform Content Generation Pipeline

Built a fully automated content pipeline that generates and uploads shortform videos using LLMs and YouTube API. Integrated GitHub Actions to schedule daily creation and publishing workflows for consistent automation and automated the generation of video scripts, titles, hashtags, and metadata for optimized reach.

Python
LLMs
GitHub Actions
YouTube API

Presently - Web to Presentation Video Tool

Developed a tool to transform web content into presentation videos with synchronized narration and music. Automated slide creation, narration, and audio generation using Gemini and text-to-speech models and streamlined content curation through web scraping and AI-driven summarization for presentation flow.

Python
MoviePy
Gemini
python-pptx

PACLI - Personal Assistant CLI

Built a command-line LLM assistant to manage and modify calendar events via natural language. Integrated an Agentic RAG framework for reasoning over contextual information about scheduled events and automated the discovery and scheduling of upcoming Codeforces contests based on user intent.

Python
LangChain
LLMs
Agentic AI

ReTweetify - Turkish Hate Speech Classification

Fine-tuned multilingual BERT for Turkish hate speech classification using parameter-efficient LoRA adapters. Implemented token-level explainability using SHAP to visualize word contributions in predictions and built a rewriting pipeline to paraphrase hateful text while retaining tone and semantic meaning.

Python
Transformers
BERT
LoRA
SHAP

TopiQ - Topic Modelling Using Large Language Models

Built a hybrid topic generation framework using KeyBERT, Llama2, UMAP, and HDBSCAN. Constructed topic similarity graphs and performed community detection and k-core analysis for deeper insights.

Python
Transformers
NetworkX
KeyBERT
Llama2
UMAP
HDBSCAN

ML Research Papers Implementation

A collection of machine learning research papers implemented from scratch, including reproducible code, experiment logs, and detailed explanations.

Python
PyTorch
Numpy

Research and Publications

Reading Between the Lines: LLM-Powered Topic Modelling and Graph-Based Insights from Research Abstracts

ICIVC 2025•Paper (Coming Soon)
Accepted
This study presents an innovative approach to research abstract analysis by combining advanced topic modeling with large language models (LLMs) and graph-based analytics. We employ BERTopic for topic extraction, LLama2 for semantic enhancement, and graph neural networks to uncover hidden patterns and relationships within academic literature. Our methodology demonstrates significant improvements in identifying emerging research trends, cross-disciplinary connections, and knowledge evolution patterns compared to traditional approaches.
Topic Modelling
BERTopic
LLama2
Graph Analytics
LLM

Comparative Analysis of Traffic Accident Detection with Emphasis on Explainability of DL Models

ICMBDC 2024•Paper
Accepted
This paper investigates traffic accident detection using deep learning models with a focus on explainability using SHAP. The study demonstrates how explainable AI makes traffic monitoring systems more trustworthy and interpretable for real-world applications.
Deep Learning
Computer Vision
Explainable AI
SHAP

Insights & Updates

Skills

Python
Java
C
JavaScript
SQL
R
TensorFlow
PyTorch
Pandas
NumPy
Scikit-learn
Keras
SpaCy
NLTK
🤗HuggingFace
Langchain
Langgraph
Langfuse
AWS
Azure
Docker
Git/GitHub
FastAPI

Education

Bachelor of Technology, Computer Science and Engineering
Specialization: Artificial Intelligence and Machine Learning
2022 - 2026

Achievements

Knight at LeetCode

Achieved the title of Knight at LeetCode with a rating of 2000+.

First Runner-Up, Flipr Hackathon 26.1

Secured the First Runner-Up position in the ML/AI track at Flipr Hackathon 26.1 for building an AI powered personal finance tracker.

Black Belt Dan 1 in Karate

Attained the rank of Black Belt Dan 1 in Karate, reflecting dedication, discipline, and expertise in martial arts.