OM

LAKHIA

DATA SCIENTIST

M.S. Statistical Data Science

SPECIALIZATION

Python • SQL • Machine Learning

ABOUT

DATA SCIENTIST

I DELIVER RESULTS. CUT REPORTING TIME 30% BY AUTOMATING ETL PIPELINES AT UL SOLUTIONS. BUILT PREDICTIVE HEALTH MODELS THAT UNCOVER RISK PATTERNS. TRAINED 100+ PEERS IN TABLEAU AND POWER BI. M.S. STATISTICAL DATA SCIENCE CANDIDATE (3.7 GPA) WHO TRANSFORMS RAW DATA INTO BUSINESS ACTION.

PYTHON, SQL, MACHINE LEARNING. PROVEN TRACK RECORD TURNING COMPLEX DATASETS INTO CLEAR INSIGHTS THAT DRIVE DECISIONS. READY TO POWER YOUR DATA-DRIVEN SUCCESS.

EXPERIENCE

HANDS-ON EXPERIENCE AT UL SOLUTIONS, SFSU BLOOMBERG LAB, AND MULTIPLE INTERNSHIPS. DELIVERED PROJECTS THAT REDUCED REPORTING TIME BY 30%.

TRAINED 100+ STUDENTS IN DATA VISUALIZATION TOOLS. AUTOMATED ETL PIPELINES AND IMPROVED DATA QUALITY ACROSS ORGANIZATIONS.

30%
REPORTING TIME REDUCTION
100+
STUDENTS TRAINED
3.7
GPA

EXPERIENCE

SAR DATA INTERN

UL Solutions
Fremont, CA
May 2025 – Aug 2025
Collaborated with cross-functional teams to evaluate and deploy Python + SQL models, cutting reporting latency by 30%
Designed Power BI dashboards that translated data into actionable insights for compliance and operations leaders
Automated ETL and validation steps, improving data quality and supporting adoption of analytics workflows
Communicated findings and model outputs clearly, enabling non-technical stakeholders to act with confidence
INDUSTRY

BLOOMBERG LAB ASSISTANT

San Francisco State University
San Francisco, CA
Jan 2025 – Present
Partnered with faculty and students to define project requirements and deliver datasets for analysis
Developed SQL queries and Tableau dashboards that identified financial and operational patterns
Trained 100+ students in data visualization workflows, strengthening collaboration and communication skills
Ensured reproducibility and accuracy of datasets, preventing common analytics pitfalls
ACADEMIC

DATA SCIENTIST INTERN

Kintu Designs IT
Surat, India
Dec 2023 – May 2024
Migrated raw business data into PostgreSQL, applying preprocessing to ensure model-ready datasets
Built Tableau dashboards that revealed trends and informed client strategies
Applied regression and forecasting models in Python to support business planning
Standardized reporting processes, reducing turnaround time by 40% and improving adoption
INDUSTRY

DATA ANALYST INTERN

Brainy Beams
Ahmedabad, India
May 2023 – Jun 2023
Converted manual Excel workflows into SQL pipelines, improving scalability and accuracy
Built Power BI dashboards that simplified complex data for client decision-making
Conducted data validation checks, ensuring integrity before reporting
Recommended process improvements that enhanced long-term efficiency
INDUSTRY

SKILLS

PROGRAMMING & DATABASES

Python (Pandas, NumPy, Scikit-learn)
SQL
R
PostgreSQL
MySQL

ANALYTICS & ML

Data Modeling
Regression & Classification
Forecasting
Hypothesis Testing

VISUALIZATION

Tableau
Power BI
Excel (Pivot Tables, Charts)

DATA MANAGEMENT

ETL Pipelines
Data Preprocessing
Data Cleaning & Validation
Documentation

COLLABORATION

Stakeholder Engagement
Teamwork
Requirement Gathering
Presentation of Insights

Featured Projects

Explore my portfolio of data science projects with advanced analytics, interactive visualizations, and comprehensive performance metrics.

Machine Learning
Stock Price Prediction using SARIMAX

Advanced time series forecasting model using Seasonal AutoRegressive Integrated Moving Average with eXogenous variables to predict stock market trends with high accuracy.

PythonSARIMAXPandasNumPyMatplotlibTime Series Analysis
Healthcare Analytics
Heart Disease Prediction

Machine learning classification model using clinical features to predict cardiovascular disease risk, enabling early intervention and preventive healthcare strategies.

PythonRandom ForestScikit-learnPandasROC/AUC Analysis
Health Research
Seasonal Health Patterns Analysis

Comprehensive analysis of seasonal variations in aging-associated health measures, examining Alzheimer's risk factors and mental health patterns across different time periods.

PythonStatistical AnalysisPandasSeabornTime SeriesHealthcare Data

Explore More Projects

Visit my GitHub profile to see additional projects, contributions, and ongoing research in data science and machine learning.

View All Projects on GitHub

Education & Certifications

Education

In Progress
3.7 GPA

Master of Science in Statistical Data Science

San Francisco State University

San Francisco, CA
Aug 2024 – Present
Relevant Coursework
Advanced Statistical MethodsMachine Learning & Predictive AnalyticsBig Data AnalyticsStatistical Computing with R/PythonTime Series AnalysisBayesian StatisticsData Mining & Pattern RecognitionExperimental Design
Completed
3.96 GPA

Bachelor of Technology in Computer Science and Engineering

CHARUSAT University

India
Aug 2020 – May 2024
Relevant Coursework
Artificial Intelligence & Machine LearningData Structures & AlgorithmsDatabase Management SystemsComputer Vision & Image ProcessingNatural Language ProcessingSoftware EngineeringOperating SystemsComputer NetworksWeb DevelopmentObject-Oriented Programming

Certifications

RecentApr 2025

Data Engineering on AWS – Foundations

AWS

RecentDec 2024

Storytelling with Data

DataCamp

Continuous Learning: I'm committed to staying current with the latest developments in data science and analytics. Currently pursuing additional certifications in cloud computing and advanced machine learning techniques.

Get In Touch

I'm always interested in discussing new opportunities, collaborations, or just connecting with fellow data enthusiasts. Feel free to reach out!

Contact Information

Location

San Francisco, CA

Ready to Collaborate?

Whether you're looking for a data scientist to join your team, need consultation on a project, or want to discuss the latest trends in machine learning, I'd love to hear from you.