Hi there, I'm
Shubham Misra
AI Engineer
Former AI Research Intern at Maker's Lab. I combine a foundation in Electrical Engineering with independently mastered CS fundamentals to create scalable, data-driven solutions.
Experience
SDE Intern
Tech Mahindra- Engineered a high-concurrency Python translation pipeline for 10+ Indic languages, handling 800+ requests/min with LRU caching.
- Implemented Wanda pruning on Llama-3.2-3B, reducing model weights by 50% while maintaining generation quality.
- Developed an autonomous AI agent for automotive control achieving 95% accuracy and reducing manual interaction by 40%.
- Built a multimodal RAG pipeline accelerating design cycles by 60% with 85% accuracy.
Skills
Languages
Frameworks
AI & ML
Tools & DevOps
Featured Projects
AirSense
I built a way to control your Mac with just hand gestures. AirSense turns your webcam into a motion sensor—pinch to pause Spotify, slide to adjust volume, swipe to skip tracks. I added smart power-saving that cuts CPU usage by 80% when you're not moving, plus precision controls so accidental gestures won't mess things up. It's touch-free computing that actually works.
Wanda-Python
I built a tool to shrink massive language models without breaking them. Wanda-Python implements a clever pruning technique that removes up to 50% of a model's weights by looking at both their size and how actively they're being used—not just their magnitude. I've packaged this as a simple command-line tool that works with LLaMA and OPT models, turning bloated billions-of-parameter models into lean machines that run faster without tanking performance.
YTMusic-to-Spotify
I solved the playlist migration headache between YouTube Music and Spotify. This Streamlit web app connects to both platforms via OAuth, lets you pick any YouTube Music playlist, and automatically matches and transfers every song to a fresh Spotify playlist. It handles the messy parts—fuzzy matching titles and artists, tracking what transferred and what didn't, and giving you a downloadable list of songs that couldn't be found so you can add them manually.
Recent Insights
Optimizing LLM Inference
Techniques for reducing latency and cost when deploying large language models.
Read ArticleThe Future of AI Agents
Exploring how autonomous agents are changing software development.
Read ArticleNext.js Performance Tips
Best practices for building high-performance web applications with Next.js App Router.
Read Article