Other Impact Areas Full-time

Software Engineer, ML Platform (ML Serving)

Zoox

Location

Remote

Type

Full-time

Posted

Feb 03, 2026

Mission

What you will drive

Core responsibilities:

  • Build the off-vehicle inference service powering Foundational models (LLMs & VLMs) and models that improve rider experiences
  • Lead the design, implementation, and operation of robust and efficient ML serving infrastructure for serving and monitoring ML models
  • Collaborate closely with cross-functional teams including ML researchers, software engineers, and data engineers to define requirements and align on architectural decisions
  • Enable junior engineers in the team to grow their careers by providing technical guidance and mentorship

Impact

The difference you'll make

This role enables innovations in ML and AI to make autonomous driving as seamless as possible, reducing the time from ideation to productionization of cutting-edge AI innovation for autonomous robotaxis that are safe, reliable, clean, and enjoyable for everyone.

Profile

What makes you a great fit

Required qualifications:

  • 4+ years of ML model serving infrastructure experience
  • Experience building large-scale model serving using GPU and/or high QPS, low latency serving use cases
  • Experience with GPU-accelerated inference using RayServe, vLLM, TensorRT, Nvidia Triton, or PyTorch
  • Experience working with cloud providers like AWS and working with K8s

Benefits

What's in it for you

No specific benefits, compensation, or salary information mentioned in the job posting.

About

Inside Zoox

Zoox is on a mission to reimagine transportation and ground-up build autonomous robotaxis that are safe, reliable, clean, and enjoyable for everyone. They are developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market.