Ayush Gupta
AI & Deep Learning
Data Engineering
Cloud Computing
Machine Learning
Big Data
NLP
Duke University
3.96 / 4.0 GPA
4+ yrs
Experience
NASA
Project Lead
$200k+
Funding Secured (Scholarships & Grants)
About Me
I’m a Generative AI Engineer and Data Science graduate from Duke University, passionate about building end-to-end AI solutions and tackling complex problems with data. I currently lead a NASA-funded research project on methane emissions detection using satellite data, leveraging expertise in LLMs, Deep Learning, and Cloud Computing to drive real-world impact and innovation.
Education
Duke University
M.S. Data Science | GPA: 3.96/4.0 | Aug 2023 – May 2025
University of Petroleum & Energy Studies, India
Bachelor of Technology (B. Tech), Production Engineering | May 2015 – Apr 2019

Current Focus
Leading a NASA-funded research project on methane emissions detection using satellite imagery and deep learning techniques.
Experience
My professional journey in data science and AI engineering.
AI Engineer
Geolabe (NASA-partnered project) | May 2024 – Current
- • Developed an AI model to detect oil and gas assets from high-resolution Sentinel-2 satellite imagery, leveraging Google Earth Engine, GCP, and Kubernetes for large-scale data processing. Trained on 2M+ geospatial images using CNNs and parallel GPU pipelines, enabling the development of a U.S.-wide application for automated energy infrastructure monitoring.
- • Developed an end-to-end geospatial data pipeline using Python and BigQuery, significantly improving the pre-processing stage of the Methane Detection AI model. This led to an increase in detection accuracy from 85% to 97% and improved processing scalability by 1,000×.
- • Built an Agentic AI system that autonomously scans all SBIR/STTR solicitations across U.S. agencies, identifies geospatially relevant topics, verifies their status on official portals, and generates daily intelligence reports in Markdown and CSV. Assisted in securing multiple non-dilutive funding grants (SBIRs) using the AI agent totaling over $2,000,000, including a grant from NASA and DoD. The main application enables methane monitoring at 1/10th the cost of currently available methods.
Data Scientist
Citizens Bank | Sep 2024 – Current
- • Developed a predictive model (Marketing Mix Model) to analyze the impact of various marketing channels and improve sales for individual business products
- • Incorporated the time-lagged impact of each marketing channel (Adstock) and leveraged advanced statistical techniques (Mixed Models), integrating seasonality, holiday effects, and macroeconomic variables
Technical Project Manager
Duke Health (Data +) | May 2024 – Jul 2024
- • Led a project to boost the response rate of patient-entered questionnaires by 43% for Duke Health, using data analytics to pinpoint response time delays and optimize follow-up strategies, resulting in improved patient engagement
Sr. Production Data Analyst
Reliance Industries Limited | Jul 2019 – Jul 2023
- • Optimized error rates in flow meters (used to measure oil production) by 15% using regression modeling. Utilized Python for modeling and SQL for ETL operations
- • Improved production forecasting efficiency from 85% to 99%, thus making precise predictions
- • Saved ~ USD 500,000 in future penalties with these accurate predictions, received special recognition for project
- • Supported the commissioning of a $5 billion oil & gas project as Operations Analytics Lead, managing a team of 12 people
Projects
Showcasing my technical projects and implementations.
Detecting Safety Helmets in the Manufacturing Industry (Explainable AI Project)
Industrial Safety with AI | Apr 2025
Built an AI-based object detection model to identify safety helmets in manufacturing environments, using deep learning and Explainable AI (XAI) to visualize model focus and decision rationale.
Used Explainable AI (XAI) to see that the model’s layer focused not only on helmets but also showed bias toward safety vests. Developing an Agentic AI safety system that autonomously monitors manufacturing video streams, detects helmet compliance using YOLOv8, applies Explainable AI (Grad-CAM) for decision transparency, and generates safety compliance reports.
Out of Distribution detection in Large Language Problems (LLMs)
LLM & GenAI | Apr 2025
Fine-tuned Large Language Models (LLMs) for Out-of-Distribution (OOD) detection using synthetic adversarial datasets, achieving AUC scores of 0.99 for near- and far-OOD tasks, significantly improving model reliability.
Developed a modular evaluation pipeline to identify and benchmark LLM failures under distributional shifts, contributing to LLM observability, safety, and real-world deployment readiness.
Nutri AI
Generative AI Hackathon by Open AI | Nov 2023
Built a personalized nutrition app with a chatbot by leveraging scientific nutritional research and FDA data to provide personalized food recommendations based on users' dietary preferences, health goals, and nutritional needs.
Improved context with OpenAI API, Google Maps APIs and Prompt Engineering.
Stock Price Analysis Application
Infrastructure as Code Implementation | Sep 2023
Developed a stock price analysis application by using PySpark for Data Wrangling, Spark SQL for data querying, and ETL operations, and Delta Lake for reliable and scalable data storage and deployed using Kubernetes.
Skills
My technical expertise and professional capabilities.
Technical Skills
Languages & Libraries
Soft Skills
Awards
Recognition for my academic and professional achievements.
Rajiv Gandhi Scholarship
2024
For Extraordinary Research & Academic Excellence
Funding of $70,000
Dean's Research Award
2023
For Master's Students at Duke University
R-Samman Award
2021
Reliance Industries Limited
Get in Touch
Feel free to reach out if you want to collaborate or just say hello.
Tel: +1 919-638-2462