About
Hi, I’m Henil Gajjar, a data scientist and machine learning engineer based in Boston. With a strong background in AI and technology, I specialize in building intelligent systems, deploying scalable cloud solutions, and optimizing multimodal models for impactful applications. I am deeply passionate about leveraging cutting-edge technologies to create innovative solutions, collaborating on meaningful projects that drive data-driven insights and real-world transformations.
Hi, I’m Henil Gajjar, a data scientist and machine learning engineer based in Boston. With a strong background in AI and technology, I specialize in building intelligent systems, deploying scalable cloud solutions, and optimizing multimodal models for impactful applications. I am deeply passionate about leveraging cutting-edge technologies to create innovative solutions, collaborating on meaningful projects that drive data-driven insights and real-world transformations.


Highlights
Highlights
30+
Projects
Completed
30+
Projects
Completed
30+
Projects
Completed
6+
Years of
Python, SQL, AWS, GCP
6+
Years of
Python, SQL, AWS, GCP
6+
Years of
Python, SQL, AWS, GCP
2+
Years of
Industry Experience
2+
Years of
Industry Experience
5+
Years of
ML, DL, NLP
5+
Years of
ML, DL, NLP
30+
Projects
completed
30+
Projects
completed
2+
Years
of industry experience
2+
Years
of industry experience
2+
Years of
LLMs, Agents, RAGs
2+
Years of
LLMs, Agents, RAGs
2+
Years of
LLMs, Agents, RAGs
2+
Years of
LLMs, Agents, RAGs
5+
Years of
ML, DL, NLP
5+
Years of
ML, DL, NLP
10+
End-to-End
Deployments
10+
End-to-End
Deployments
Tech Stack
Tech Stack
I am dedicated to expanding my knowledge and expertise in my field. Throughout my career, I've acquired various skills, which I continue to perfect.
I am dedicated to expanding my knowledge and expertise in my field. Throughout my career, I've acquired various skills, which I continue to perfect.
Languages
Python, C++, R, SQL, Dart, MATLAB
Languages
Python, C++, R, SQL, Dart, MATLAB
Languages
Python, C++, R, SQL, Dart, MATLAB
Frameworks
Langgraph, Langchain, Hugging Face, Transformers, NLKT, TensorFlow, PyTorch, Flask, Keras, Pandas, Django
Frameworks
Langgraph, Langchain, Hugging Face, Transformers, NLKT, TensorFlow, PyTorch, Flask, Keras, Pandas, Django
Frameworks
Langgraph, Langchain, Hugging Face, Transformers, NLKT, TensorFlow, PyTorch, Flask, Keras, Pandas, Django
(Vector) Databases
FaissDB, ChromaDB, DataStax AstraDB, MySQL, PostgresDB, MongoDB, Google Firebase, Apache Spark, Redis, Hadoop
(Vector) Databases
FaissDB, ChromaDB, DataStax AstraDB, MySQL, PostgresDB, MongoDB, Google Firebase, Apache Spark, Redis, Hadoop
(Vector) Databases
FaissDB, ChromaDB, DataStax AstraDB, MySQL, PostgresDB, MongoDB, Google Firebase, Apache Spark, Redis, Hadoop
Cloud Operations
Google Cloud Platform (Cloud Run, SQL, VertexAI, Artifact Registry, Big Query), Amazon Web Services (Lambda, Bedrock,Sagemaker, ECR, S3) Microsoft Azure (Databricks, AzureML)
Cloud Operations
Google Cloud Platform (Cloud Run, SQL, VertexAI, Artifact Registry, Big Query), Amazon Web Services (Lambda, Bedrock,Sagemaker, ECR, S3) Microsoft Azure (Databricks, AzureML)
Cloud Operations
Google Cloud Platform (Cloud Run, SQL, VertexAI, Artifact Registry, Big Query), Amazon Web Services (Lambda, Bedrock,Sagemaker, ECR, S3) Microsoft Azure (Databricks, AzureML)

CI, CD, CT, CM
MLFlow, WandB Weave, Dagshub, Airflow, Github Actions, FastAPI, Langserve, Docker, Kubernetes, Langsmith, LangFuse
CI, CD, CT, CM
MLFlow, WandB Weave, Dagshub, Airflow, Github Actions, FastAPI, Langserve, Docker, Kubernetes, Langsmith, LangFuse
CI, CD, CT, CM
MLFlow, WandB Weave, Dagshub, Airflow, Github Actions, FastAPI, Langserve, Docker, Kubernetes, Langsmith, LangFuse

Techniques
Generative AI, RAG, Fine-Tuning LLM, Agents and Tools, NLP, Statistical Modeling, Data Modeling, Predictive Modeling
Techniques
Generative AI, RAG, Fine-Tuning LLM, Agents and Tools, NLP, Statistical Modeling, Data Modeling, Predictive Modeling
Techniques
Generative AI, RAG, Fine-Tuning LLM, Agents and Tools, NLP, Statistical Modeling, Data Modeling, Predictive Modeling
Experience
Experience
MY CAREER JOURNEY
I’ve worked with companies and teams, both in corporate environments and as an independent contributor. I thrive on collaborating with organizations that value the power of data-driven solutions and cutting-edge technology.
Edlight PBC
Machine Learning Intern
Sept 2024 – Now
Edlight PBC
Machine Learning Intern
Sept 2024 – Now
Created Ember, a multi-agent chatbot designed to assist middle school teachers with curriculum-related queries. Built using LangGraph and FastAPI, and deployed on AWS Lambda, Ember integrates advanced RAG techniques, boosting faithfulness by 13.4%. Developed a custom SQL-Agent to retrieve curriculum data from PostgresDB, enhancing query relevance by 25% and streamlining access to teaching resources.
Created Ember, a multi-agent chatbot designed to assist middle school teachers with curriculum-related queries. Built using LangGraph and FastAPI, and deployed on AWS Lambda, Ember integrates advanced RAG techniques, boosting faithfulness by 13.4%. Developed a custom SQL-Agent to retrieve curriculum data from PostgresDB, enhancing query relevance by 25% and streamlining access to teaching resources.
Impacter AI
AI Developer Intern (Unpaid)
Sept 2024 – Now
Impacter AI
AI Developer Intern (Unpaid)
Sept 2024 – Now
Built SalesPal, an intelligent sales copilot leveraging Langchain and Azure Databricks to enhance sales strategies through data-driven insights. Developed an automation agent to streamline real-estate listing management on PropStream and integrated an AI-driven email and call agent, boosting sales effectiveness by analyzing customer interactions.
Built SalesPal, an intelligent sales copilot leveraging Langchain and Azure Databricks to enhance sales strategies through data-driven insights. Developed an automation agent to streamline real-estate listing management on PropStream and integrated an AI-driven email and call agent, boosting sales effectiveness by analyzing customer interactions.
Hyperlab Sportech Pvt. Ltd.
Head of AI/ML
Jan 2022 - Aug 2023
Hyperlab Sportech Pvt. Ltd.
Head of AI/ML
Jan 2022 - Aug 2023
Engineered Helios, an ML-driven training app for athletes, achieving 5k+ downloads and a 4.7-star rating within its first week on the App Store and Play Store. Developed HyperAI, a multimodal AI coach powered by AWS Bedrock, Firebase, and AstraDB, providing personalized insights based on athlete history. Built a multimodal RAG pipeline with GPT-4 and Llama 3, integrating vision, text, and audio for tailored coaching. Designed an LSTM model for timeout drills, contributing to a $25M Shark Tank investment.
Engineered Helios, an ML-driven training app for athletes, achieving 5k+ downloads and a 4.7-star rating within its first week on the App Store and Play Store. Developed HyperAI, a multimodal AI coach powered by AWS Bedrock, Firebase, and AstraDB, providing personalized insights based on athlete history. Built a multimodal RAG pipeline with GPT-4 and Llama 3, integrating vision, text, and audio for tailored coaching. Designed an LSTM model for timeout drills, contributing to a $25M Shark Tank investment.
SAE Nirma Collegiate Club
Team Lead
Dec 2020 - Jun 2022
SAE Nirma Collegiate Club
Team Lead
Dec 2020 - Jun 2022
Developed an OpenCV algorithm with 85% accuracy for real-time cone detection and lane navigation. Designed a CNN-based steering model achieving 93% accuracy at 24 FPS, winning the Best Autonomous Round Award. Deployed on Jetson Nano and STM32 Discovery, enabling precise vehicle control. Led the team to victory in Asia’s Largest Electric Solar Vehicle Competition 2021, earning 8/11 trophies and outperforming competitors by 500+ points. Managed the team for SAEINDIA EBAJA 2022, securing 3 trophies, including the Best Acceleration Award, and an overall 5th place finish among 47 teams.
Developed an OpenCV algorithm with 85% accuracy for real-time cone detection and lane navigation. Designed a CNN-based steering model achieving 93% accuracy at 24 FPS, winning the Best Autonomous Round Award. Deployed on Jetson Nano and STM32 Discovery, enabling precise vehicle control. Led the team to victory in Asia’s Largest Electric Solar Vehicle Competition 2021, earning 8/11 trophies and outperforming competitors by 500+ points. Managed the team for SAEINDIA EBAJA 2022, securing 3 trophies, including the Best Acceleration Award, and an overall 5th place finish among 47 teams.
Nirma University
Student Researcher
May 2021 - Dec 2021
Nirma University
Student Researcher
May 2021 - Dec 2021
Curated a comprehensive 150k-row dataset of Li-ion battery packs to extract actionable insights. Implemented a Random Forest regression model to predict optimal charging/discharging temperatures, enhancing battery health by 12%. Applied clustering techniques to analyze current rates and temperature trends across 1,300 charge-discharge cycles, driving data-informed optimization strategies.
Curated a comprehensive 150k-row dataset of Li-ion battery packs to extract actionable insights. Implemented a Random Forest regression model to predict optimal charging/discharging temperatures, enhancing battery health by 12%. Applied clustering techniques to analyze current rates and temperature trends across 1,300 charge-discharge cycles, driving data-informed optimization strategies.
Education
Education
THE LEARNING PILLARS
THE PILLARS
THE LEARNING PILLARS
My education has equipped me with the tools to navigate complex huddles and the curiosity to keep learning.
My education has equipped me with the tools to navigate complex huddles and the curiosity to keep learning.

M.S. in Data Science
Northeastern University
Boston, USA
Expected: Dec 2025
GPA: 4.0
M.S. in Data Science
Northeastern University
Boston, USA
Expected: Dec 2025
GPA: 4.0
M.S. in Data Science
Northeastern University
Boston, USA
Expected: Dec 2025
GPA: 4.0
Related Coursework: Large Language Models, Machine Learning Operations, Supervised and Unsupervised Machine Learning
Related Coursework: Large Language Models, Machine Learning Operations, Supervised and Unsupervised Machine Learning
Part-Times:
Head Teaching Assistant for CS2810 - Mathematics of Data Modelling
Teaching Assistant for DS3000 - Foundations of Data Science
Teaching Assistant for DS4400 - Machine Learning and Data Mining 1
Teaching Assistant for DS4420 - Machine Learning and Data Mining 2
Part-Times:
Head TA for CS2810 - Mathematics of Data Modelling
TA for DS3000 - Foundations of Data Science
TA for DS4400 - Machine Learning and Data Mining 1
TA for DS4420 - Machine Learning and Data Mining 2
Part-Times:
Head Teaching Assistant for CS2810 - Mathematics of Data Modelling
Teaching Assistant for DS3000 - Foundations of Data Science
Teaching Assistant for DS4400 - Machine Learning and Data Mining 1
Teaching Assistant for DS4420 - Machine Learning and Data Mining 2

B.Tech in Electronics and Communication Engg.
Nirma University
Ahmedabad, India
May 2023
GPA: 3.9
B.Tech in Electronics and Comm. Engg.
Nirma University
Ahmedabad, India
May 2023
GPA: 3.9
B.Tech in Electronics and Communication Engg.
Nirma University
Ahmedabad, India
May 2023
GPA: 3.9
Related Coursework: Database Management, Applied Statistics, Machine Learning, Computer Vision, Big Data
Related Coursework: Database Management, Applied Statistics, Machine Learning, Computer Vision, Big Data
Achievements and Awards
Achievements and Awards
1
Certificate of Honour
Nirma University
October 2022
1
Certificate of Honour
Nirma University
October 2022
2
SAEINDIA EBAJA 2022 (National Electric ATV Competition)
SAEINDIA - 3 Awards
June 2022
2
SAEINDIA EBAJA 2022 (National Electric ATV Competition)
SAEINDIA - 3 Awards
June 2022
3
Asia's Biggest Electric Solar Vehicle Championship 2021
ISIE ESVC - 7 Awards
December 2021
3
Asia's Biggest Electric Solar Vehicle Championship 2021
ISIE ESVC - 7 Awards
December 2021
Patents and Publications
Patents and Publications
1
Steering System for Autonomous Solar Electric Vehicle
The Patent Office, Govt. of India
May 2022
1
Steering System for Autonomous Solar Electric Vehicle
The Patent Office, Govt. of India
May 2022
2
A comprehensive study on lane detecting autonomous car using computer vision
Expert Systems with Applications 233, 120929
2023
2
A comprehensive study on lane detecting autonomous car using computer vision
Expert Systems with Applications 233, 120929
2023
3
Comparative Analysis of CNN models for Self-Driving Cars in a Simulated Environment
2023 3rd International Conference on Range Technology (ICORT)
2023
3
Comparative Analysis of CNN models for Self-Driving Cars in a Simulated Environment
2023 3rd International Conference on Range Technology (ICORT)
2023
4
A Comparative Analysis of Various Deep-Learning Models for Noise Suppression
EAI Endorsed Transactions on Internet of Things
2024
4
A Comparative Analysis of Various Deep-Learning Models for Noise Suppression
EAI Endorsed Transactions on Internet of Things
2024
5
Enhancing Home Automation: A Smart System with M5 Stack and Multiple Control Interfaces
2023 IEEE 11th Region 10 Humanitarian Technology Conference (R10-HTC), 335-340
2023
5
Enhancing Home Automation: A Smart System with M5 Stack and Multiple Control Interfaces
2023 IEEE 11th Region 10 Humanitarian Technology Conference (R10-HTC), 335-340
2023
6
Low Voltage Systems for Electric ATVs
2023 IEEE 3rd International Conference on Sustainable Energy and Future
2023
6
Low Voltage Systems for Electric ATVs
2023 IEEE 3rd International Conference on Sustainable Energy and Future
2023
7
Comparative Analysis of CNN models for Self-Driving Cars in a Simulated Environment
2023 3rd International Conference on Range Technology (ICORT)
2023
7
Comparative Analysis of CNN models for Self-Driving Cars in a Simulated Environment
2023 3rd International Conference on Range Technology (ICORT)
2023