
•
•
•
•
•
•
1
•
•
2
•
•
3
•
•
4
•
•
5
•
•
6
•
•
7
•
•
8
•
•
9
•
•
10
•
•
11
•
•
12
•
•
13
•
•
14
•
•
15
•
•
16
•
•
•
•
•
•
•
•
•
•
1
•
•
2
•
•
3
•
•
4
•
•
5
•
•
6
•
•
7
•
•
8
•
•
9
•
•
10
•
•
11
•
•
12
•
•
13
•
•
14
•
•
15
•
•
16
•
•
•
•
•
•
•
•
1
•
•
2
•
•
3
•
•
4
•
•
5
•
•
6
•
•
7
•
•
8
•
•
9
•
•
10
•
•
11
•
•
12
•
•
13
•
•
14
•
•
15
•
•
16
•
•
•
•
•
•
•
•
•
•
1
•
•
2
•
•
3
•
•
4
•
•
5
•
•
6
•
•
7
•
•
8
•
•
9
•
•
10
•
•
11
•
•
12
•
•
13
•
•
14
•
•
15
•
•
16
•
•
17
•
•
18
•
•
19
•
•
20
•
•
21
•
•
22
•
•
23
•
•
24
•
•
25
•
•
•
•
•
•
•
•
•
•
•
1
•
•
2
•
•
3
•
•
4
•
•
5
•
•
6
•
•
7
•
•
8
•
•
9
•
•
10
•
•
11
•
•
12
•
•
13
•
•
14
•
•
15
•
•
16
•
•
17
•
•
18
•
•
19
•
•
20
•
•
21
•
•
22
•
•
23
•
•
24
•
•
25
•
•
26
•
•
27
•
•
28
•
•
29
•
•
30
•
•
•
•
•
•
My Builds
10+
End-to-End
Projects Owned
AppSageAI 2.0
Privacy-First AI Resume Analyzer

Technologies Used: RAG, LangChain, Gemini APIs, Explainable AI, Guardrails, CI/CD
Integrated Gemini 2.5 Pro (VertexAI) with RAG pipeline (FAISS + HF embedding) to handle 20K+ daily chats with chat persistence
Architected Full-stack application (TypeScript + FastAPI), with SSE streaming, JWT auth, and automated CI/CD on Cloud Run
Encrypted user data with AES-256 and rotating keys (GDPR compliant) in Firebase while maintaining <1s p99 TTFT for 600+ DAU
AppSageAI 2.0
Privacy-First AI Resume Analyzer

Technologies Used: RAG, LangChain, Gemini APIs, Explainable AI, Guardrails, CI/CD
Integrated Gemini 2.5 Pro (VertexAI) with RAG pipeline (FAISS + HF embedding) to handle 20K+ daily chats with chat persistence
Architected Full-stack application (TypeScript + FastAPI), with SSE streaming, JWT auth, and automated CI/CD on Cloud Run
Encrypted user data with AES-256 and rotating keys (GDPR compliant) in Firebase while maintaining <1s p99 TTFT for 600+ DAU
AppSageAI 2.0
Privacy-First AI Resume Analyzer

Technologies Used: RAG, LangChain, Gemini APIs, Explainable AI, Guardrails, CI/CD
Integrated Gemini 2.5 Pro (VertexAI) with RAG pipeline (FAISS + HF embedding) to handle 20K+ daily chats with chat persistence
Architected Full-stack application (TypeScript + FastAPI), with SSE streaming, JWT auth, and automated CI/CD on Cloud Run
Encrypted user data with AES-256 and rotating keys (GDPR compliant) in Firebase while maintaining <1s p99 TTFT for 600+ DAU
Freemoji
Apple Genmoji for WhatsApp

Technologies Used: Distributed Systems/Training, CUDA, Fine-tuning, vLLM, Ollama
Fine-tuned Flux.1-dev on 1,646 WhatsApp emojis using LoRA with DeepSpeed Zero-2 on 2 Nvidia A100 GPUs, cutting model loss to 0.245
Designed Prompt Assist (2-shot Prompting + Llama 3.3) to refine user inputs before fine-tuned model, improving emoji quality
Boosted performance by 40% via GPU-accelerated inference (vLLM/MLX + Diffusers/Mflux + Streamlit) and Int8 Quantization
Freemoji
Apple Genmoji for WhatsApp

Technologies Used: Distributed Systems/Training, CUDA, Fine-tuning, vLLM, Ollama
Fine-tuned Flux.1-dev on 1,646 WhatsApp emojis using LoRA with DeepSpeed Zero-2 on 2 Nvidia A100 GPUs, cutting model loss to 0.245
Designed Prompt Assist (2-shot Prompting + Llama 3.3) to refine user inputs before fine-tuned model, improving emoji quality
Boosted performance by 40% via GPU-accelerated inference (vLLM/MLX + Diffusers/Mflux + Streamlit) and Int8 Quantization
Freemoji
Apple Genmoji for WhatsApp

Technologies Used: Distributed Systems/Training, CUDA, Fine-tuning, vLLM, Ollama
Fine-tuned Flux.1-dev on 1,646 WhatsApp emojis using LoRA with DeepSpeed Zero-2 on 2 Nvidia A100 GPUs, cutting model loss to 0.245
Designed Prompt Assist (2-shot Prompting + Llama 3.3) to refine user inputs before fine-tuned model, improving emoji quality
Boosted performance by 40% via GPU-accelerated inference (vLLM/MLX + Diffusers/Mflux + Streamlit) and Int8 Quantization
Verta
A Personal Shopping Copilot

Technologies Used: HuggingFace, OpenAI APIs, Microservices, LLM Optimization, BigData
Architected a Chatbot with an end-to-end multi-agent RAG leveraging 4 LLMs using LangGraph, Guardrails, FAISS, and BigQuery
Built the backend with FastAPI, deployed on Google Cloud Run (Docker + Git Actions CI/CD), capable of handling 20k queries/sec
Evaluated with DeepEval, achieving Correctness 0.81 and Answer Relevancy 0.87 for high-quality, relevant responses
Tracked experiments with MLflow and monitored traces with LangSmith, improving model performance via human-feedback loops
Optimized state management via layered Redis caching, enabling 3× faster responses (1.5s p95 E2E and 250 ms p95 TTFT) under 6× load
Verta
A Personal Shopping Copilot

Technologies Used: HuggingFace, OpenAI APIs, Microservices, LLM Optimization, BigData
Architected a Chatbot with an end-to-end multi-agent RAG leveraging 4 LLMs using LangGraph, Guardrails, FAISS, and BigQuery
Built the backend with FastAPI, deployed on Google Cloud Run (Docker + Git Actions CI/CD), capable of handling 20k queries/sec
Evaluated with DeepEval, achieving Correctness 0.81 and Answer Relevancy 0.87 for high-quality, relevant responses
Tracked experiments with MLflow and monitored traces with LangSmith, improving model performance via human-feedback loops
Optimized state management via layered Redis caching, enabling 3× faster responses (1.5s p95 E2E and 250 ms p95 TTFT) under 6× load
Verta
A Personal Shopping Copilot

Technologies Used: HuggingFace, OpenAI APIs, Microservices, LLM Optimization, BigData
Architected a Chatbot with an end-to-end multi-agent RAG leveraging 4 LLMs using LangGraph, Guardrails, FAISS, and BigQuery
Built the backend with FastAPI, deployed on Google Cloud Run (Docker + Git Actions CI/CD), capable of handling 20k queries/sec
Evaluated with DeepEval, achieving Correctness 0.81 and Answer Relevancy 0.87 for high-quality, relevant responses
Tracked experiments with MLflow and monitored traces with LangSmith, improving model performance via human-feedback loops
Optimized state management via layered Redis caching, enabling 3× faster responses (1.5s p95 E2E and 250 ms p95 TTFT) under 6× load
End-to-End Kidney Tumor Classification
Technologies Used: CNN, Clinical Data, Compute Engine, Docker, Airflow, CI/CD
Built an end-to-end image classification pipeline using a modified VGG16 CNN model using PyTorch, achieving an F1 score of 0.96
Developed interpretable ML workflows with feature importance analysis, SHAP-based insights, and calibration for bias reduction
Automated data ingestion, model training, and evaluation pipelines using Airflow, MLflow, and DVC for reproducibility and tracking
Containerized and deployed the trained model as a Flask web app on AWS EC2 via Docker and Git Actions for real-time inference
End-to-End Kidney Tumor Classification
Technologies Used: CNN, Clinical Data, Compute Engine, Docker, Airflow, CI/CD
Built an end-to-end image classification pipeline using a modified VGG16 CNN model using PyTorch, achieving an F1 score of 0.96
Developed interpretable ML workflows with feature importance analysis, SHAP-based insights, and calibration for bias reduction
Automated data ingestion, model training, and evaluation pipelines using Airflow, MLflow, and DVC for reproducibility and tracking
Containerized and deployed the trained model as a Flask web app on AWS EC2 via Docker and Git Actions for real-time inference
End-to-End Kidney Tumor Classification
Technologies Used: CNN, Clinical Data, Compute Engine, Docker, Airflow, CI/CD
Built an end-to-end image classification pipeline using a modified VGG16 CNN model using PyTorch, achieving an F1 score of 0.96
Developed interpretable ML workflows with feature importance analysis, SHAP-based insights, and calibration for bias reduction
Automated data ingestion, model training, and evaluation pipelines using Airflow, MLflow, and DVC for reproducibility and tracking
Containerized and deployed the trained model as a Flask web app on AWS EC2 via Docker and Git Actions for real-time inference
Self-Driving Solar Vehicle

Technologies Used: TensorFlow, Image Processing, OpenCV, Computer Vision, CNN
Built a track-following autonomous solar vehicle by identifying 2 types of cones using OpenCV and a custom CNN model
Improved model mAP to 0.93 by using data augmentation (daylight shifts and motion blurs) and K-Means for anchor-box selection
Deployed on-edge to Nvidia Jetson Nano and integrated STM32 microcontroller using custom low-level drivers (Embedded C) for 24 FPS real-time steering with a patented steering system
Optimized end-to-end inference via pruning and quantization, achieving 120ms latency and 96.7% autonomous navigation accuracy
Self-Driving Solar Vehicle

Technologies Used: TensorFlow, Image Processing, OpenCV, Computer Vision, CNN
Built a track-following autonomous solar vehicle by identifying 2 types of cones using OpenCV and a custom CNN model
Improved model mAP to 0.93 by using data augmentation (daylight shifts and motion blurs) and K-Means for anchor-box selection
Deployed on-edge to Nvidia Jetson Nano and integrated STM32 microcontroller using custom low-level drivers (Embedded C) for 24 FPS real-time steering with a patented steering system
Optimized end-to-end inference via pruning and quantization, achieving 120ms latency and 96.7% autonomous navigation accuracy
Self-Driving Solar Vehicle

Technologies Used: TensorFlow, Image Processing, OpenCV, Computer Vision, CNN
Built a track-following autonomous solar vehicle by identifying 2 types of cones using OpenCV and a custom CNN model
Improved model mAP to 0.93 by using data augmentation (daylight shifts and motion blurs) and K-Means for anchor-box selection
Deployed on-edge to Nvidia Jetson Nano and integrated STM32 microcontroller using custom low-level drivers (Embedded C) for 24 FPS real-time steering with a patented steering system
Optimized end-to-end inference via pruning and quantization, achieving 120ms latency and 96.7% autonomous navigation accuracy
Fantasy Team Recommendation for IPL 2024
Technologies Used: Fine-tuning, QLoRA, Prompt Engineering, NLTK, Transformers
Fine-Tuned Gemma 2 using QLoRA on cricket dataset resulting in 10% improvement in ROUGE score for cricket-specific text generation tasks
Established algorithm using NLTK to extract structured data from unstructured IPL historical data including player stats and match scorecards
Leveraged Prompt engineering (2-shot) to enhance prediction accuracy resulting in 85% accurate team prediction for the IPL matches
Fantasy Team Recommendation for IPL 2024
Technologies Used: Fine-tuning, QLoRA, Prompt Engineering, NLTK, Transformers
Fine-Tuned Gemma 2 using QLoRA on cricket dataset resulting in 10% improvement in ROUGE score for cricket-specific text generation tasks
Established algorithm using NLTK to extract structured data from unstructured IPL historical data including player stats and match scorecards
Leveraged Prompt engineering (2-shot) to enhance prediction accuracy resulting in 85% accurate team prediction for the IPL matches
Fantasy Team Recommendation for IPL 2024
Technologies Used: Fine-tuning, QLoRA, Prompt Engineering, NLTK, Transformers
Fine-Tuned Gemma 2 using QLoRA on cricket dataset resulting in 10% improvement in ROUGE score for cricket-specific text generation tasks
Established algorithm using NLTK to extract structured data from unstructured IPL historical data including player stats and match scorecards
Leveraged Prompt engineering (2-shot) to enhance prediction accuracy resulting in 85% accurate team prediction for the IPL matches
Rent the Runway Fashion Recommender System
Technologies Used: Collaborative filtering, SVD, Cosine Matrix, Web Scraping
Scraped product data and user reviews via BeautifulSoup, creating structured datasets with detailed attributes and feedback metrics
Engineered preprocessing pipelines for text cleaning, tokenization, and feature extraction to enhance review relevance
Developed hybrid recommendation using matrix factorization for collaborative and cosine similarity for content-based filtering
Incorporated incremental SVD updates to address cold-start users and products, cutting system update time by 40%
Rent the Runway Fashion Recommender System
Technologies Used: Collaborative filtering, SVD, Cosine Matrix, Web Scraping
Scraped product data and user reviews via BeautifulSoup, creating structured datasets with detailed attributes and feedback metrics
Engineered preprocessing pipelines for text cleaning, tokenization, and feature extraction to enhance review relevance
Developed hybrid recommendation using matrix factorization for collaborative and cosine similarity for content-based filtering
Incorporated incremental SVD updates to address cold-start users and products, cutting system update time by 40%
Rent the Runway Fashion Recommender System
Technologies Used: Collaborative filtering, SVD, Cosine Matrix, Web Scraping
Scraped product data and user reviews via BeautifulSoup, creating structured datasets with detailed attributes and feedback metrics
Engineered preprocessing pipelines for text cleaning, tokenization, and feature extraction to enhance review relevance
Developed hybrid recommendation using matrix factorization for collaborative and cosine similarity for content-based filtering
Incorporated incremental SVD updates to address cold-start users and products, cutting system update time by 40%
Advanced Performance Metrics for Ultimate Frisbee Athletes
Technologies Used: Predictive Modeling, Bayesian analysis, ML Pipelines
Engineered player rating models by combining XGBoost and mixed-effects linear modeling for context-aware performance evaluation
Designed on/off plus-minus models with Bayesian modeling, increasing predictive accuracy 20% for offensive and defensive impact
Developed composite scoring metrics integrating multiple player features, improving ranking robustness by 15%
Built reproducible pipelines for model training and validation using DVC and MLFlow, ensuring scalability and consistent evaluation
Advanced Performance Metrics for Ultimate Frisbee Athletes
Technologies Used: Predictive Modeling, Bayesian analysis, ML Pipelines
Engineered player rating models by combining XGBoost and mixed-effects linear modeling for context-aware performance evaluation
Designed on/off plus-minus models with Bayesian modeling, increasing predictive accuracy 20% for offensive and defensive impact
Developed composite scoring metrics integrating multiple player features, improving ranking robustness by 15%
Built reproducible pipelines for model training and validation using DVC and MLFlow, ensuring scalability and consistent evaluation
Advanced Performance Metrics for Ultimate Frisbee Athletes
Technologies Used: Predictive Modeling, Bayesian analysis, ML Pipelines
Engineered player rating models by combining XGBoost and mixed-effects linear modeling for context-aware performance evaluation
Designed on/off plus-minus models with Bayesian modeling, increasing predictive accuracy 20% for offensive and defensive impact
Developed composite scoring metrics integrating multiple player features, improving ranking robustness by 15%
Built reproducible pipelines for model training and validation using DVC and MLFlow, ensuring scalability and consistent evaluation
End-to-End SQL Database Chatbot
Technologies Used: Agent, ORM, SQL, LangChain, Groq
Developed an AI Agent using Langchain, SQL, and Gemma 2, allowing users to query databases with Natural Language Queries
Leveraged SQLAlchemy ORM and custom connection handling to support both SQLite and MySQL databases dynamically
Created SQL Database Toolkit with Gemma 2 zero-shot question answering, enhancing the chatbot's ability to handle complex SQL queries accurately
End-to-End SQL Database Chatbot
Technologies Used: Agent, ORM, SQL, LangChain, Groq
Developed an AI Agent using Langchain, SQL, and Gemma 2, allowing users to query databases with Natural Language Queries
Leveraged SQLAlchemy ORM and custom connection handling to support both SQLite and MySQL databases dynamically
Created SQL Database Toolkit with Gemma 2 zero-shot question answering, enhancing the chatbot's ability to handle complex SQL queries accurately
End-to-End SQL Database Chatbot
Technologies Used: Agent, ORM, SQL, LangChain, Groq
Developed an AI Agent using Langchain, SQL, and Gemma 2, allowing users to query databases with Natural Language Queries
Leveraged SQLAlchemy ORM and custom connection handling to support both SQLite and MySQL databases dynamically
Created SQL Database Toolkit with Gemma 2 zero-shot question answering, enhancing the chatbot's ability to handle complex SQL queries accurately
Get in touch with me at
henil.gajjar2102@gmail.com
42.3555° N, 71.0565° W
Thursday, Nov 13, 2025
Get in touch with me at
henil.gajjar2102@gmail.com
42.3555° N, 71.0565° W
Thursday, Nov 13, 2025
Get in touch with me at
henil.gajjar2102@gmail.com
42.3555° N, 71.0565° W
Thursday, Nov 13, 2025
