Software Engineer at Fireworks AI

May 2025 - Current | Redwood City, CA
Joining Fireworks AI as a Software Engineer, working on cutting-edge AI infrastructure and model serving technologies.
Key Responsibilities:
- Part of Forbes AI 50 startup, working closely with pytorch team who are powering the AI revolution. [Video]
- Automatically provision the latest GPUs across 10+ clouds and 15+ regions for high availability, consistent performance, and seamless scaling—so you can focus on building.
- Building and optimizing high-performance inference infrastructure for large language models (akin to vLLM and SGLang)
- Developing scalable solutions for model deployment and serving at scale
- Working with state-of-the-art AI/ML frameworks
- Contributing to the platform that powers AI applications for thousands of developers.
Technology Stack:
AI/ML: PyTorch, TensorFlow, ONNX, Triton Inference Server
Infrastructure: Kubernetes, Docker, GPU orchestration
Languages: Python, Go, C++
Cloud: Multi-cloud deployment strategies
About Fireworks AI:
Fireworks AI is revolutionizing how developers build and deploy generative AI applications. The platform provides the fastest and most reliable API for running and fine-tuning open-source models, making advanced AI capabilities accessible to developers worldwide.
More details coming soon as I embark on this exciting new journey...
Back to Timeline