Software Engineer at Fireworks AI

May 2025 - Current | Redwood City, CA

Joining Fireworks AI as a Software Engineer, working on cutting-edge AI infrastructure and model serving technologies.

Key Responsibilities:

Part of Forbes AI 50 startup, working closely with pytorch team who are powering the AI revolution. [Video]
Automatically provision the latest GPUs across 10+ clouds and 15+ regions for high availability, consistent performance, and seamless scaling—so you can focus on building.
Building and optimizing high-performance inference infrastructure for large language models (akin to vLLM and SGLang)
Developing scalable solutions for model deployment and serving at scale
Working with state-of-the-art AI/ML frameworks
Contributing to the platform that powers AI applications for thousands of developers.

Technology Stack:

AI/ML: PyTorch, TensorFlow, ONNX, Triton Inference Server

Infrastructure: Kubernetes, Docker, GPU orchestration

Languages: Python, Go, C++

Cloud: Multi-cloud deployment strategies

About Fireworks AI:

Fireworks AI is revolutionizing how developers build and deploy generative AI applications. The platform provides the fastest and most reliable API for running and fine-tuning open-source models, making advanced AI capabilities accessible to developers worldwide.

More details coming soon as I embark on this exciting new journey...

Back to Timeline