BTech CSE | GenAI & RAG Engineer

Hi, I'm Samarth Singh

Building production-ready GenAI systems with RAG and LLMOps. Specialized in semantic search, fine-tuning transformers, and deploying intelligent LLM applications with retrieval architectures.

View My Work Contact Me

Who I Am

About Me

Samarth Singh

GenAI & RAG Engineer

GenAI Engineer & Research Enthusiast

BTech Computer Science student at VIT Bhopal (CGPA 8.45) specializing in Generative AI, RAG systems, and LLMOps. Proven expertise in building production-ready LLM applications with semantic search, hybrid retrieval (BM25 + dense vectors), and parameter-efficient fine-tuning (LoRA/PEFT). Experienced in deploying transformer models with evaluation frameworks achieving 89%+ correctness.

Passionate about making LLMs more accurate and grounded through intelligent retrieval architectures. Published fine-tuned models on Hugging Face with comprehensive evaluation (ROUGE, BERTScore, METEOR). Seeking GenAI/LLMOps internships to build scalable AI systems that solve real-world problems with measurable impact.

Years Experience

10+

Projects Completed

8.45/10

CGPA

Download Resume

Technical Arsenal

My Skills & Technologies

The tools, languages, and frameworks I leverage to build powerful and innovative web applications.

LLMOps & GenAI

LangChain

Proficiency Level

85%Advanced

LangGraph

Proficiency Level

75%Proficient

Agno

Proficiency Level

70%Proficient

Vector Databases

Proficiency Level

80%Advanced

FastAPI

Proficiency Level

85%Advanced

Groq

Proficiency Level

82%Advanced

Hugging Face

Proficiency Level

90%Expert

Prompt Engineering

Proficiency Level

85%Advanced

MLflow & W&B

Proficiency Level

75%Proficient

Gradio/Streamlit

Proficiency Level

85%Advanced

ML/DL & AI Engineering

PyTorch

Proficiency Level

88%Advanced

TensorFlow

Proficiency Level

75%Proficient

Scikit-learn

Proficiency Level

82%Advanced

Transformers

Proficiency Level

90%Expert

Pandas/NumPy

Proficiency Level

88%Advanced

Jupyter

Proficiency Level

85%Advanced

Programming Languages

Python

Proficiency Level

92%Expert

C++

Proficiency Level

80%Advanced

JavaScript

Proficiency Level

88%Advanced

TypeScript

Proficiency Level

90%Expert

SQL

Proficiency Level

85%Advanced

Full-Stack Development

Next.js

Proficiency Level

90%Expert

Node.js

Proficiency Level

85%Advanced

PostgreSQL

Proficiency Level

82%Advanced

React

Proficiency Level

90%Expert

Tailwind CSS

Proficiency Level

95%Expert

Tools & DevOps

Git/GitHub

Proficiency Level

90%Expert

Docker

Proficiency Level

80%Advanced

AWS

Proficiency Level

70%Proficient

Linux/CUDA

Proficiency Level

85%Advanced

Academic Background

Education & Certifications

My academic journey and professional development

2023 - 2027

Bachelor of Technology in Computer Science and Engineering

VIT Bhopal University•Bhopal, Madhya Pradesh

CGPA: 8.45/10 | Focus on Artificial Intelligence, Machine Learning, and Cloud Computing

Key Courses

Data Structures & AlgorithmsOperating SystemsObject-Oriented ProgrammingComputer NetworksDatabase Management SystemsCloud ComputingSoftware Engineering

Development Metrics

Coding Activity

My development metrics and problem-solving statistics

GitHub Stats

Active days this year

75%

LocationJoined: Following: 0

LeetCode Stats

@Sam_9415

My Work

Featured Projects

Explore my latest work showcasing my skills and expertise

⭐ Featured

Argus — Autonomous Research Engine

Production-grade multi-agent research pipeline using LangGraph supervisor pattern with 5 specialist agents (planner, researcher, critic, writer, supervisor) orchestrated via LLM-driven routing. Async job architecture with SQLite persistence and LangGraph checkpointing — research jobs survive agent failures and every LLM call is traced end-to-end in LangSmith. Integrates Tavily, ArXiv, and Wikipedia to synthesize cited markdown reports in 30–90 seconds.

Agents

Report Time

30–90s

Tools

LangGraphFastAPIGroq+3

View Live

Argus — Autonomous Research Engine

⭐ Featured

DoCopilot - RAG Document Q&A System

Production-grade RAG application with hybrid search (BM25 + dense vectors) using Qdrant and reranking. Achieved 89.2% correctness, 90.5% relevance, 100% source grounding on 40-query evaluation with guardrails for PII redaction and prompt injection detection. Built full-stack with Next.js frontend and FastAPI backend, processing PDFs/TXT with 2.86s average latency.

Correctness

89.2%

Relevance

90.5%

Avg Latency

2.86s

Next.jsFastAPIQdrant+3

View Code

DoCopilot - RAG Document Q&A System

⭐ Featured

FLAN-T5 Dialogue Summarizer

Fine-tuned FLAN-T5-base with LoRA on SAMSum dataset (14.7K dialogues), achieving 49.01 ROUGE-1, 72.25 BERTScore F1, and 42.51 METEOR scores. Implemented parameter-efficient training updating only 2% of parameters with FP16 mixed precision. Deployed interactive Gradio app on Hugging Face Spaces with configurable beam search and published model with reproducible evaluation.

ROUGE-1

49.01

BERTScore

72.25

Params Updated

PythonLoRAPEFT+3

View Live

FLAN-T5 Dialogue Summarizer

⭐ Featured

RoBERTa Banking Intent Classifier

Fine-tuned RoBERTa-base on Banking77 dataset (77 intents, 13K queries) achieving 93.7% accuracy and 93.6% macro-F1. Implemented standard transformer fine-tuning with AdamW optimizer, weight decay, and FP16 training on GPU. Added experiment hygiene with fixed seeds, consistent tokenization, epoch-level metrics tracking, and best-checkpoint selection for robust evaluation.

Accuracy

93.7%

Macro-F1

93.6%

Intents

PyTorchTransformersCUDA+2

View Live

RoBERTa Banking Intent Classifier

⭐ Featured

Project Loom

Full-stack project-sharing platform with Next.js leveraging SSR and ISR, reducing page load times by 50%. Architected scalable backend with Sanity.io headless CMS managing 1,000+ project entries. Implemented secure authentication with NextAuth.js and PostgreSQL, enabling users to manage profiles, post projects, and interact with content.

Next.jsTypeScriptSanity.io+2

View Live

Project Loom

Modern Portfolio

Personal portfolio website built with Next.js, TypeScript, and Tailwind CSS featuring modern UI elements, smooth animations with Framer Motion, and optimized SEO for GenAI/RAG internships. Implements dark mode, responsive design, and accessibility best practices with Lighthouse scores 90+ across all metrics.

Next.jsTypeScriptTailwind CSS+1

View Live

Modern Portfolio

Dexplorer

Interactive Pokémon discovery web application for searching and exploring the original 150 Pokémon with detailed information, stats, and type filtering. Built with React and modern JavaScript, featuring responsive design and smooth user experience.

JavaScriptReactTailwind CSS+1

View Live

Dexplorer

View All Projects on GitHub

Tech Writing

Blog & Writing

Deep-dives into production AI systems, architecture decisions, and lessons learned the hard way.

Medium·5 min read·Feb 2026

DoCopilot: Building a Production-Grade RAG System with Hybrid Search, Reranking, and Safety Guardrails

How I went from a basic chatbot to an 89%+ accurate document QA system — with ablation studies to prove it. Most RAG tutorials stop at 'embed your PDF, do cosine similarity, feed to GPT.' That's fine for a demo. It breaks in production.

Correctness

89.2%

Source Rate

100%

Avg Latency

2.86s

RAGLLMQdrantFastAPINext.jsAI Safety

Read on Medium

Medium·7 min read·Sep 2025

The Power of Normalization: How Feature Scaling Transforms Neural Network Performance on Tabular Data

A simple standardization step can transform a completely broken model (R² ≈ 0) into a high-performing one (R² > 0.8) — while dramatically reducing training time. The 160,800x improvement in R² score proves normalization is not optional.

R² Gain

160,800x

RMSE Drop

55.7%

Speed Up

2–5x

Neural NetworksMachine LearningNormalizationTensorFlowPython

Read on Medium

View all posts on Medium

Recognition & Growth

Achievements & Certifications

Awards, certifications, and milestones in my journey

Certification

Google IT Support Professional Certificate

Google Career Certificates

Comprehensive 5-course program covering troubleshooting, networking, operating systems, system administration, and security. Credential ID: whvAjzYf

Jan 2026

Certification

Applied Machine Learning in Python

University of Michigan - Coursera

Completed specialization in supervised/unsupervised learning, feature engineering, model evaluation, and scikit-learn for practical ML applications.

2025

Publication

Published Fine-Tuned Models on Hugging Face

Hugging Face Hub

FLAN-T5 Summarizer with reproducible evaluation achieving 49.01 ROUGE-1 and 72.25 BERTScore F1 on SAMSum dataset

Oct 2025

Academic

VIT Bhopal Academic Excellence

VIT Bhopal University

Maintained 8.45 CGPA with focus on AI/ML coursework including DSA, Cloud Computing, and Software Engineering

2023-Present

My Approach

Planning & Strategy

We'll collaborate to map out your website's goals, target audience, and key functionalities. We'll discuss site structure, navigation, and content requirements.

Development & Progress

Once we agree on the plan, I cue my lofi playlist and dive into coding. From initial sketches to polished code, I keep you updated every step of the way.

Deployment & Launch

After thorough testing and your final sign-off, I deploy your project to production with CI/CD pipelines, monitoring, and a smooth launch. Post-launch support ensures everything runs flawlessly from day one.

Roadmap

Future Enhancements

Planned features and improvements — turning this portfolio into a continuously evolving product.

Agentic Portfolio Assistant

A LangGraph-powered chat agent embedded in the portfolio — visitors can ask questions about projects, get technical details, or request a tailored summary. RAG over project READMEs + resume as the knowledge base.

LangGraphRAGNext.jsStreaming

Live Project Dashboards

Real-time evaluation dashboards for deployed models — live ROUGE/BERTScore trends for the FLAN-T5 summarizer, accuracy drift detection for the RoBERTa classifier, and latency telemetry for Argus.

MLflowW&BRechartsFastAPI

Multi-Language Support (i18n)

Internationalise the portfolio with next-intl — Hindi, Spanish, Japanese. Auto-detect browser locale and serve locale-specific OG images and structured data for broader reach.

next-intlSEOnext/navigation

Interactive ML Playground

Embed live Hugging Face Spaces iframes directly into project cards — let visitors run the FLAN-T5 summarizer or RoBERTa classifier without leaving the portfolio.

Hugging FaceGradioiframeStreaming

Auth & Guestbook

Add NextAuth.js GitHub OAuth so visitors can leave verified notes in a public guestbook. Messages stored in Postgres via Prisma — a small but memorable personal touch.

NextAuth.jsPostgreSQLPrismaOAuth

CMS-Backed Projects & Blog

Move projects and blog posts to a headless CMS (Sanity.io or Contentlayer) so new content can be published without code deployments. MDX support for rich blog articles.

Sanity.ioContentlayerMDXISR

Open to collaboration — reach out if any of these interest you

Get In Touch

Contact Me

Have a project in mind or want to collaborate? Feel free to reach out!

samarthsin2006@gmail.com

Phone

+91 9452026413

Location

Pratapgarh, U.P.

Send a Message

Fill out the form below and I'll respond as soon as possible.