Research & Architecture

Yashraj Motwani

I go deep on the models everyone else treats as black boxes.

Yashraj Motwani

Five years at IIT Kharagpur, then R&D at EXL building NLP systems for Energy, Healthcare, and Travel clients. I've fine-tuned LLMs when fine-tuning wasn't trendy. Built intent clustering systems, NER pipelines, and retrieval architectures that actually worked at scale. The difference between a demo and a deployment is in the details — and I've spent years learning those details.

At ContextJet, I bring research depth to production problems. GraphRAG, hybrid retrieval, custom embeddings — I build the systems that make AI answers trustworthy, not just plausible.

Highlights

B.Tech + M.Tech, IIT Kharagpur
NLP R&D at EXL (Energy, Healthcare, Travel)
Research at Sungkyunkwan University, Seoul
Research at Johannes Gutenberg University, Mainz
Based in Bengaluru, India

Expertise

01

LLM Fine-tuning

Custom models with LoRA, Q-LoRA for domain-specific tasks. Not just prompting — actual model optimization.

02

RAG & Retrieval

State-of-the-art retrieval with ColBERT, SPLADE, hybrid search. Benchmarked, not guessed.

03

NLP Pipelines

Intent clustering, NER, summarization, question answering. Production-grade, not research-grade.

Selected Work

Enterprise Hierarchical RAG System
01

Enterprise Hierarchical RAG System

System architecture, LLM optimization, deployment

Architected production-scale RAG platform handling millions of documents across 160+ client deployments. Hierarchical document ingestion, multimodal search with image analysis, integrated mobile chatbot. Optimized for 100K+ requests/second throughput.

Nexo GraphRAG Engine
02

Nexo GraphRAG Engine

GraphRAG architecture, retrieval optimization

Engineered knowledge-graph-powered RAG combining Neo4j with vector embeddings for context-aware retrieval. Entity extraction, relationship mapping, multi-hop reasoning. Sub-second query latency at scale.

Unsupervised Intent Clustering
03

Unsupervised Intent Clustering

Architecture, clustering algorithms, visualization

Built transcript analysis system using GPT-3.5, LLaMA 2/3 embeddings with K-Means, DBSCAN, HDBSCAN for hierarchical intent clustering. Deployed for Energy, Healthcare, Travel clients.

04

LLM Fine-tuning Framework

Model optimization, custom training pipelines

Fine-tuned GPT-3.5, Llama 2, flan-t5 using LoRA, Q-LoRA, few-shot learning for domain-specific generation across multiple industries.

05

Advanced NER Pipeline

Pipeline design, model stacking

Stacked BERT-CRF and reverse question answering models with iterative RoBERTa fine-tuning. Cutting-edge accuracy for entity extraction.

Recognition

B.Tech + M.Tech from IIT Kharagpur
Published research: GNN for energy reconstruction (46% MAE improvement)
2 gold medals at Inter IIT Cultural events
Led ML project team of 15+ at spAts IIT Kharagpur
Headed group of 50+ for NSSC (India's Largest Space Tech Fest)

Currently

  • Building LLM-powered applications at ContextJet
  • Experimenting with RAG pipelines and retrieval optimization
  • Fine-tuning models for client-specific use cases
  • Growing ContextJet's NLP capabilities

Want to work together?

Reach out through ContextJet or connect directly.