Machine Learning Engineer

Hi, I'm Moksh Shah

I design and build intelligent systems that bridge the gap between cutting-edge research and real-world impact. Passionate about turning complex data into elegant, scalable solutions.

scroll

Building intelligence,
one model at a time

I'm a Machine Learning Engineer with a deep interest in developing production-grade ML systems. My work spans the full lifecycle — from exploratory analysis and model development to deployment and monitoring at scale.

I thrive at the intersection of research and engineering, translating state-of-the-art papers into robust, maintainable code. Whether it's building recommendation engines, fine-tuning large language models, or designing feature pipelines, I focus on solutions that are both technically sound and business-aware.

When I'm not training models, you'll find me contributing to open-source projects, reading papers on arXiv, or exploring the latest in AI tooling and infrastructure.

2+
Years of Experience
10+
Models in Production
5+
Open-Source Contributions
2
Degrees from UIUC (Master's and Bachelor's in CS)

ML & AI

PyTorch TensorFlow Scikit-learn Hugging Face LangChain NLP Computer Vision MLOps

Engineering

Python SQL Docker AWS Spark FastAPI Git Kubernetes

Where I've worked

A track record of delivering impactful solutions across industries.

2024 — Present
Machine Learning Engineer
Rational CyPhy · Champaign, Illinois
Building the brain behind autonomous aircraft — I develop reinforcement learning systems that enable drones to navigate complex environments using real-time computer vision. My work has driven a 65% improvement in autonomous mission success rates through deep learning model optimization and systematic hyperparameter tuning. I architect the MLOps infrastructure that takes models from research notebooks to production-grade deployment, and push the boundaries of training data quality using neural radiance fields (NeRF) and Gaussian Splatting to generate photorealistic synthetic environments. More recently, I've been building full 3D reconstruction pipelines using COLMAP and Gaussian Splats, creating detailed visual models of real-world environments that support mission planning and spatial reasoning for autonomous systems
2022 - 2022
Software Engineer Intern
Fiserv · Chicago, Illinois
Designed and shipped UI enhancements and gamification features across client-facing fintech products, resulting in a 40% lift in user satisfaction and engagement. I also built an internal Angular application for automated REST API endpoint validation — a tool that enabled real-time monitoring and continuous integration testing, replacing hours of manual QA with a single dashboard.
2021 — 2021
Software Engineer Intern
Fiserv · Chicago, Illinois
Took ownership of performance engineering for enterprise-scale distributed systems, executing comprehensive load testing that validated 99% reliability under peak traffic conditions. I also architected and maintained Spring Boot microservices within the company's core API infrastructure, delivering modular backend solutions that directly supported critical business operations.
2018 — 2020
Researcher
Northwestern University · Evanston, Illinois
Conducted applied research at the intersection of game theory and computational economics, developing Python-based simulation frameworks to model strategic behavior in duopoly markets — specifically analyzing how competing firms optimize advertising-sponsored Wi-Fi provisions. In parallel, I built algorithmic pricing models to study consumer surplus dynamics in ride-sharing platforms, work that sharpened my foundation in mathematical modeling, optimization, and translating complex economic theory into code — skills that now directly inform how I approach ML system design.

Selected work

Highlights from personal and professional projects.

🧠
LLM Fine-Tuning Pipeline
End-to-end pipeline for fine-tuning large language models on domain-specific data using LoRA and QLoRA. Includes data preparation, training orchestration, evaluation, and deployment via a REST API.
PyTorchHugging FacePEFTFastAPI
View on GitHub
📊
Real-Time Anomaly Detection
Streaming anomaly detection system for time-series sensor data. Uses autoencoders and statistical methods with a Kafka-based ingestion layer and Grafana dashboards for alerting.
PythonKafkaTensorFlowDocker
View on GitHub
🔍
Semantic Search Engine
Built a semantic search system using sentence transformers and vector databases. Supports multi-modal queries across text and image embeddings with sub-100ms retrieval latency.
Sentence TransformersFAISSPineconeReact
View on GitHub

Let's connect

I'm currently open to new opportunities and always happy to chat about ML, engineering challenges, or potential collaborations. Feel free to reach out through any of the channels here.