Hi, I'm Yashraj.
NLP Engineer & AI Researcher.
Co-founder at ContextJet.ai.

B.Tech + M.Tech, IIT Kharagpur ('17–'22)Gen AI & NLP R&D SpecialistCo-founder, ContextJet.aiEx-EXL, Assistant Manager (AI-NLP R&D)Ex-Senior AI Engineer, Sasken Technologies + HoneywellBased in Bengaluru, India

What I build

LLM Fine-tuning & Optimization

Custom models with LoRA, Q-LoRA for domain-specific tasks.

RAG & Retrieval Systems

State-of-the-art retrieval with ColBERT, SPLADE, hybrid search.

NLP Pipelines

Intent clustering, NER, summarization, question answering at scale.

Selected builds

Enterprise Hierarchical RAG System for Hardware MNC

Architected production-scale RAG platform handling millions of documents across 160+ client deployments. Built hierarchical document ingestion, multimodal search with image analysis, integrated mobile chatbot. Optimized Google Gemini and OpenAI endpoints achieving 100,000+ requests/second throughput with cost-efficient LLM routing.

Role: system architecture, LLM optimization, deployment

Nexo GraphRAG Engine

Engineered knowledge-graph-powered RAG system combining Neo4j graph database with vector embeddings for context-aware retrieval. Implemented entity extraction, relationship mapping, and multi-hop reasoning for complex queries. Deployed for enterprise knowledge management with sub-second query latency at scale.

Role: GraphRAG architecture, knowledge graphs, retrieval optimization

Unsupervised Intent Clustering System

Built transcript analysis system using GPT-3.5, LLaMA 2/3 embeddings with K-Means, DBSCAN, HDBSCAN for hierarchical intent clustering. Deployed for Energy, Healthcare, Travel clients.

Role: architecture, clustering algorithms, visualization

LLM Fine-tuning Framework

Fine-tuned GPT-3.5, Llama 2, flan-t5 using LoRA, Q-LoRA, few-shot learning techniques for domain-specific generation across multiple industries.

Role: model optimization, custom training

Advanced NER Pipeline

Stacked BERT-CRF and reverse question answering models with iterative RoBERTa fine-tuning, achieving cutting-edge accuracy for entity extraction.

Role: pipeline design, model stacking

LangChain Competitive Analysis Tool

Built AI-powered analysis tool with Streamlit + LangGraph agent. Engineered robust RAG pipeline benchmarking ColBERT, SPLADE, DPR, BM25-BERT for optimal retrieval.

Role: full-stack development, RAG architecture

AI-powered Product Discovery Assistant

Built knowledge graph using NetworkX, RDF, Schema.org, migrated to Neo4j. Created LangChain query processor translating natural language to structured KG queries.

Role: knowledge graph design, query processing

Customer Sentiment Analysis (Bosch India)

Achieved 93.22% accuracy using BERT models and TF-IDF feature engineering. Built CSAT Django web app with automated scraping and real-time sentiment analysis.

Role: research, web app development

Writing & thinking

Articles

Notes

Automations die without ownership.
Guardrails are product decisions.
Fewer actions, clearer wins.

Current

  • Building LLM-powered applications at ContextJet.
  • Experimenting with RAG pipelines and retrieval optimization.
  • Fine-tuning models for client-specific use cases.
  • Growing ContextJet's NLP capabilities.

Signals

  • B.Tech + M.Tech from IIT Kharagpur
  • Published research: GNN for energy reconstruction (46% MAE improvement)
  • 2 gold medals at Inter IIT Cultural events
  • Led ML project team of 15+ at spAts IIT Kharagpur
  • Headed group of 50+ for NSSC (India's Largest Space Tech Fest)

Want to build something?

Talk to ContextJet