Projects.deploy()
The trifecta: Efficiency, Responsibility, and Hard Engineering.
SkinIQ
A dermatologist in your pocket — from model training to production deployment.
Fine-tuned EfficientNet-B3 on HAM10000 (10k dermoscopy images, 7 skin condition classes) with weighted cross-entropy loss and WeightedRandomSampler to handle 58:1 class imbalance. Deployed a Dockerized FastAPI inference server on Railway with model weights hosted on HuggingFace Hub. Full product with scan history, personalized skincare routines, progress visualization, and routine calendar tracking.
Prompt Optima
Reframing AI performance as a cost problem.
Team project with @Rushil1234 at HackHarvard 2024. My contribution: the compression scheduler that decides what to strip vs. preserve based on a downstream task signal, and the Gemini API integration. Middleware that removes low-entropy tokens before the API call hits a paid endpoint — 60–95% token reduction depending on input redundancy.
Explainable AI Research
Improving algorithmic recourse beyond standard proximity-based CFEs.
Validated the Acceptability & Weighted Proximity (AWP) model to improve algorithmic recourse beyond standard proximity-based Counterfactual Explanations. Engineered evaluation pipelines modeling user-specific acceptability thresholds and aligned feature-change costs with human feasibility constraints.
Distributed JBOD System
High-concurrency storage built from scratch.
Architected a distributed storage system supporting network-mounted volumes with strict data integrity. Custom socket protocols and mutex synchronization for maximum throughput.
Healthcare Claims Adjudication
Hybrid rules-engine + LLM system for claims processing.
A production-ready healthcare claims adjudication system combining deterministic rules with LLM-based reasoning for ambiguous cases. Features RAG-powered retrieval from historical claims, human-in-the-loop review UI, and comprehensive audit logging with decision explanations.
Mosaic-Ward
Privacy-first patient monitoring — no raw video stored.
Real-time patient monitoring system using MediaPipe pose/gesture detection and Gemini multimodal LLM for clinical assessment. WebSocket streaming delivers sub-100ms latency with zero raw video retention — HIPAA-conscious by design. Abstracts patient data to skeletal landmarks before any inference call.
More on GitHub: SF Urban Analytics · Nittany AI ML Portfolio · AI Secretary Agent · and more
View All Repositories