projects
Some projects.
2025
LLM from Scratch
Implemented BPE tokenizer, Transformer decoder architecture, and autoregressive pretraining pipeline end-to-end in PyTorch
code
Vision Transformer from Scratch
Implemented ViT architecture and training pipeline from scratch in PyTorch for image classification
code
2024
Semantic Segmentation
Built U-Net for urban scene understanding; achieved 88.5% test accuracy and 0.40 mean IoU
code
GAN for Image Generation
Implemented Generator and Discriminator from scratch for MNIST image synthesis
code
Human Activity Recognition
Benchmarked LSTM, BiLSTM, and 1D-CNN on multivariate time series; best F1 of 0.630 with 1D-CNN
code
Multivariate Time Series Forecasting
Two-layer BiLSTM on 7-feature sequential dataset; RMSE of 4.03
code
Clustering Benchmarking
Compared LDA, QDA, KNN, and SVM; best F1 0.800 with SVM. K-Means clustering with PCA on MNIST achieving 0.790 purity
code
Boston Housing Price Prediction
Built a feedforward neural network for regression; achieved very low MSE of 0.00457
code
Wine Quality Prediction
Developed neural network with regularization (dropout, early stopping); achieved MSE of 0.0127
code
Heart Disease Classification Benchmark
Compared LDA, QDA, and KNN models; achieved best accuracy of 0.7527
code
DILLEMA: Robustness Testing for Vision Models
Designed framework to uncover failure cases in deep vision systems; detected up to 53.3% misclassification in ImageNet models
code
Statistical Learning: Linear Regression
Implemented OLS regression with hypothesis testing; achieved R² score of 0.897
code
Iris Classification with SVM
Applied SVM and benchmarked against classical models; achieved F1 score of 0.7999
code
Leaf Image Classification
Trained deep CNN (Inception-ResNet) for 15-class classification; achieved PRC score of 0.8867
code
CNN for Image Classification (MNIST & CIFAR-10)
Implemented LeNet and VGG-like models; achieved F1 0.9899 (MNIST) and 0.7866 (CIFAR-10)
code
Transfer Learning with MobileNetV2
Improved CNN performance via fine-tuning vs retraining on limited datasets
code
Bias-Variance Trade-off Analysis
Analyzed impact of model complexity on generalization in regression tasks
code
Image Augmentation for CNNs
Improved model robustness using augmentation techniques (rotation, flip, brightness)
code
Feature Selection for MPG Dataset
Applied forward/backward selection with cross-validation; achieved R² of 0.8462
code
Fully Convolutional Network for Heatmaps
Built FCN to generate spatial heatmaps for object localization without dense layers
code