Portfolio

A collection of my data science and machine learning projects — blending statistics, code, and creativity to build meaningful solutions.

  • All
  • LLMs
  • Machine Learning
  • Computer Vision
  • NLP
  • R / Statistics
  • Neural Networks
Aspect Sentiment BI

Aspect Sentiment BI

Reddit sentiment pipeline with NLP, ML models, Airflow, and BI dashboards.

PythonNLPAirflow
Smart Invoice Extractor

Smart Invoice Extractor

OCR + NLP pipeline with FastAPI, Streamlit, and Docker for invoice parsing.

OCRNLPFastAPI
smartcurvefit R package

smartcurvefit

R package for robust nonlinear curve fitting using Rcpp.

RRcpp
US Crime GAM

US Crime GAM

Violent crime analysis using Generalized Additive Models.

RStatisticsML
Heart Attack Prediction

Heart Attack Prediction

Scalable classification pipeline with PySpark and Hadoop.

PySparkHadoop
Cricket Analytics App

Cricket Analytics App

Interactive R Shiny app for cricket performance visualization.

RShiny
Neural Networks Applied

Neural Networks Applied

Deep learning experiments on structured and image datasets.

TensorFlowKeras
Spirals Parkinson Dataset

Spirals (Parkinson Dataset)

Clustering spiral dataset in R to distinguish Parkinson vs healthy patients (KMeans & Hierarchical clustering).

RClusteringML
Music Genre Classification

Music Genre Classification

Kaggle project predicting genres from audio features.

MLKaggle
Image Classification

Image Classification

Deep learning CNNs for image classification tasks.

PythonCNNVision