I pretrained a GPT-style transformer on consumer hardware, published empirical findings on context window degradation, and build full-stack AI systems end-to-end.
An empirical study of context window degradation in MiniLLM — a 57.5M parameter GPT-style transformer trained from scratch on ~150M tokens using a single NVIDIA RTX 3050 (6GB). Four fine-tuned checkpoints evaluated through identical protocols across positional recall probes and multi-turn perplexity measurements, producing five novel findings about small-scale LLM behaviour.
GPT-style language model pretrained from scratch on 150M tokens, deployed as a production web app with real-time streaming.
End-to-end retrieval-augmented generation system supporting PDF, DOCX, and TXT ingestion with fully local inference.
Deep learning app that converts handwritten mathematical expressions into LaTeX — real-time inference via FastAPI.
Web app that auto-extracts structured contact info from PDFs, DOCX, and images using NLP + OCR.
This chatbot has my full profile embedded as a knowledge base — resume, research paper findings, all projects. Ask it anything.
// Powered by Gemini · Secured via Cloudflare Worker · Knowledge base: resume + paper + projects
Actively looking for research internships at AI labs. If you're working on language models, NLP systems, or AI infrastructure — let's talk.