🌿
UBC campus
🏔
Vancouver
💻
shipping code
🌊
Pacific coast
🤖
ML things
🏏
cricket always
🎓 MDS @ UBC
📍 Vancouver, BC
✦ 30+ clients
🚀 ships on time
👋 Hey, I'm
Ojasv
🧑‍💻
Issar
a data scientist
who DELIVERS 🚀
let's build together!

> currently:

Data Scientist LLM Integration RAG Pipelines AWS Bedrock ML Engineer Computer Vision Vancouver 🍁 PyTorch NLP MDS @ UBC 30+ Clients Ships on time ✓ Open to work LangChain FastText · FAISS Data Scientist LLM Integration RAG Pipelines AWS Bedrock ML Engineer Computer Vision Vancouver 🍁 PyTorch NLP MDS @ UBC 30+ Clients Ships on time ✓ Open to work LangChain FastText · FAISS
HuggingFace Hackathon Winner 🏆 AWS SAM Docker Published Research 📄 XGBoost Scikit-learn Delhi → Vancouver Snowflake pgvector Springer LNNS 🎓 Pandas R Cricket & Code 🏏 Transformers HuggingFace Hackathon Winner 🏆 AWS SAM Docker Published Research 📄 XGBoost Scikit-learn Delhi → Vancouver Snowflake pgvector Springer LNNS 🎓 Pandas R Cricket & Code 🏏 Transformers
⚡ currently

What I'm working on

Active
🎓 MDS @ UBC

Finishing my Master of Data Science at UBC — coursework in unsupervised ML, NLP, Bayesian stats, and more. Graduating June 2026.

🎓
Freelancing
🔧 Freelance

Taking on freelance ML, data science, and LLM integration projects alongside studies. 30+ clients served across edtech, fintech, and e-commerce — fully remote.

Work with me →
💼
Applying
🚀 Job Search

Actively looking for ML / LLM engineering roles in Canada. Available to start soon. Let's talk if you're hiring.

Get in touch →
🎯

📊 by the numbers
0+
happy clients
🤝
4+
years experience
📦
2M+
users impacted
👥
0
avg performance gains
📈
1
published paper 🎉
📄
0
unfinished projects
FUN FACT

I classified 958,524 asteroids using a NASA/JPL dataset. So yes, I've technically done planetary defense research. 🪐

🛠 what I do

Services that move the needle

🤖
ML System Design

End-to-end model development — problem scoping, training, evaluation, deployment. All production-ready.

PyTorchTensorFlowHuggingFace
🧠
LLM & GenAI Integration

RAG pipelines, vector search, multi-model architectures. Grounded, fast, cost-efficient — on Claude, GPT, Bedrock.

LangChainFAISSBedrock
👁
Computer Vision

Object detection, classification, visual pipelines. QA, e-commerce, security. With explainability.

OpenCVYOLOGrad-CAM
🔤
NLP & Multilingual AI

Sentiment, classification, multilingual. Arabic NLP on XLM-RoBERTa across 15K+ reviews.

TransformersXLM-RoBERTaspaCy
⚙️
Data Engineering

Scalable ETL, warehouse migrations, real-time streaming. The backbone your ML needs to survive.

KafkaAirflowSnowflake
☁️
Cloud & MLOps

Lambda pipelines, containerised models, auto-scaling. Not done until it's live and monitored.

AWS LambdaDockerSAM

✦ skills

What's in my toolkit

Python PyTorch AWS Bedrock LangChain LLM Integration TensorFlow scikit-learn RAG Pipelines FAISS HuggingFace XGBoost AWS Lambda Docker SQL Snowflake Kafka Airflow Computer Vision OpenCV spaCy R PySpark Streamlit NLP XLM-RoBERTa Grad-CAM AWS SAM LlamaIndex

↑ darker = more proficient. hover to see them wiggle.


🚀 selected projects

Things I've actually shipped

📡 Jan 2026
Serverless · GenAI · AWS
Reddit UBC Reporter
Dual-model Bedrock pipeline — Llama 3 for post categorisation with confidence scoring, Claude Sonnet for abstractive weekly summaries. 4 independent Lambda functions via AWS SAM, EventBridge scheduling, S3 persistence, Postmark email dispatch.
BedrockLlama 3ClaudeLambdaSAMEventBridge
🌾 2025
Vision Transformer · Climate ML
Crop Yield Forecasting
Reimplemented MMST-ViT for satellite-weather fusion. Stress-tested across Heatwave, Drought, and Extreme Rainfall partitions. Measured RMSE degradation under distribution shift; retrained with climate augmentation — temperature offsets, rainfall scaling, extreme-year oversampling.
MMST-ViTPyTorchSatelliteViTClimate
🌡 Dec 2025
Forecasting · Time-series
Global Temp Forecasting
222-year Berkeley Earth dataset. Benchmarked OLS, Random Forest, and kernel SVR on RMSE/MAE/R². SVR projected 2030 land temp at 10.56°C — ~+2°C above the 1951–80 baseline. Containerised with Docker, validated via pytest, reproducible Quarto report.
SVRRandom ForestDockerQuartopytest
🌋 2026
Dashboard · Data Viz · UBC MDS
DisasterDash
Interactive global disaster analytics dashboard (DSCI 532). Plotly Dash app visualising EM-DAT disaster records across event types, geographies, and time — built for researchers and policymakers. Fully reproducible data pipeline with automated preprocessing.
DashPlotlyEM-DATPythonUBC MDS
☄️ Nov 2025
Classification · Imbalanced Data
Asteroid Hazard Classifier
Binary PHO classifier on 958,524 NASA/JPL asteroid records. RFE + SMOTENC for severe class imbalance. XGBoost and RF ensembles — eccentricity and perihelion distance as top-2 risk predictors. Published ICICC 2026, Springer LNNS.
XGBoostSMOTENCRFENASA/JPLSpringer
🏙 2024
EDA · Geospatial Analysis
NYC Airbnb Analysis
EDA on 50,000+ NYC Airbnb listings from NYC Open Data. Choropleth and hex-bin geospatial visualisations surfacing borough-level pricing disparities. OLS and feature-importance analysis to identify key price drivers — room type, location entropy, host listing count.
PandasGeoPandasSeabornOLSFolium
✍ writing

When I write about it

📝
Medium · AI · Climate
AI in the Fight Against Climate Change
Published on Medium
Predicting environmental trends before it's too late — a deep dive into how machine learning is being applied to climate data, what the models are actually telling us, and why it matters beyond the research paper.
Climate MLAIMedium
Read on Medium →

👋 about me

I'm a Data Scientist & ML Engineer from Delhi, currently living in Vancouver and finishing my Master of Data Science at UBC.

I've been building ML systems professionally for 3.5+ years — everything from LLM pipelines on AWS to computer vision. 30+ clients across edtech, fintech, and e-commerce. Zero unfinished projects.

When I'm not writing Python I'm watching cricket, at the gym, or hiking somewhere in BC. Published researcher. Graduate TA. Occasional planetary defender.

from Delhi → Vancouver 🍁
2025 →
Master of Data Science 🎓
University of British Columbia · Vancouver
2024
Data Analyst Intern 📊
Spartan Poker · Gurugram
2022 →
Freelance Data Scientist 🔧
30+ clients · Remote
2020–24
B.Tech CS — Data Science 📚
Symbiosis Institute of Technology · Pune

📄 research

Published work

ICICC 2026
Published
SneakerXchange: Enhancing User Experience through HCI Principles
9th International Conference on Innovative Computing & Communication · Springer Lecture Notes in Networks and Systems · Top 15% acceptance rate
HCIUX ResearchUser StudiesSpringer LNNSTop 15%

💬 what clients say

Real words, real people

★★★★★

Ojasv implemented BERT-based sequential sentence classification and LDA topic modelling pipelines for my NLP research at Radboud. The tokenisation logic, attention-mask handling, and hyperparameter sweep were all production-quality. Invaluable for my PhD thesis.

AP
Amna Pottarath
PhD Candidate · Radboud University, Netherlands
★★★★★

Ojasv delivered a full Boston Housing regression pipeline in R — feature engineering, multicollinearity diagnostics via VIF, stepwise AIC model selection, and residual analysis. He explained the OLS assumptions clearly enough that I could defend the methodology in my exam.

PJ
Pearl Jindal
HR Manager · CODICE USA · MBA Kogod School, American University
★★★★★

Ojasv handled cross-lingual scraping, bilingual Arabic-English preprocessing with custom tokenisation, and fine-tuned XLM-RoBERTa for multilingual sentiment classification. The F1 scores on Arabic test data exceeded our in-house baseline by a significant margin.

FM
Faisal Abdulrazaq
PhD Candidate · Maynooth University, Ireland
★★★★★

He walked me through logistic regression, decision trees, and ensemble methods in R for Business Analysis at SMU — tuning regularisation parameters, interpreting ROC-AUC curves, and writing reproducible R Markdown reports. Thorough, precise, and always on time.

AA
Ayman Aboobacker
MSc Management · Singapore Management University
★★★★★

Ojasv built custom Power BI DAX measures, parameterised SQL queries, and Python automation scripts tailored to State Street's analytical workflows — then walked me through each piece clearly enough to answer technical interview questions with confidence.

MP
Madhuri Piprotar
Investor Services · State Street, Poland
★★★★★

Ojasv independently delivered a deep learning pipeline for Stock Price Prediction — LSTM architecture with sliding-window sequence encoding, dropout regularisation, and backtesting — plus a full Real-Time Traffic Signal Optimisation thesis from scratch. Both were exceptional.

VJ
Vishal Jaiswal
Freelance Data Science Consultant · India
★★★★★

Ojasv implemented BERT-based sequential sentence classification and LDA topic modelling pipelines for my NLP research at Radboud. The tokenisation logic, attention-mask handling, and hyperparameter sweep were all production-quality. Invaluable for my PhD thesis.

AP
Amna Pottarath
PhD Candidate · Radboud University, Netherlands
★★★★★

Ojasv delivered a full Boston Housing regression pipeline in R — feature engineering, multicollinearity diagnostics via VIF, stepwise AIC model selection, and residual analysis. He explained the OLS assumptions clearly enough that I could defend the methodology in my exam.

PJ
Pearl Jindal
HR Manager · CODICE USA · MBA Kogod School, American University
★★★★★

Ojasv handled cross-lingual scraping, bilingual Arabic-English preprocessing with custom tokenisation, and fine-tuned XLM-RoBERTa for multilingual sentiment classification. The F1 scores on Arabic test data exceeded our in-house baseline by a significant margin.

FM
Faisal Abdulrazaq
PhD Candidate · Maynooth University, Ireland

hover to pause · scroll to read


don't be a stranger 👋

Wanna work
together?

I respond within 24h. Always honest about what's feasible.

ojasvissar4@gmail.com