Zoox Logo

Zoox

Engineering Manager, ML Training Platform

Posted 9 Days Ago
Be an Early Applicant
Hybrid
Foster City, CA
230K-315K
Senior level
Hybrid
Foster City, CA
230K-315K
Senior level
Manage the ML Training Platform at Zoox, leading a team to enhance autonomous driving technology through effective ML tools and infrastructure.
The summary above was generated by AI
Zoox is on a mission to reimagine transportation and ground-up build autonomous robotaxis that are safe, reliable, clean, and enjoyable for everyone. We are still in the early stages of deploying our robotaxis on public roads, and it is a great time to join Zoox and have a significant impact in executing this mission. The ML Platform team at Zoox plays a crucial role in enabling innovations in ML and CV to make autonomous driving as seamless as possible. 

The Opportunity
Are you excited to manage our ML Training Platform that enables autonomous driving? You will get to work across all ML teams within Zoox - Perception, Prediction, Planner, Simulation, Collision Avoidance, Data Science, etc., and have the opportunity to significantly push the boundaries of how ML is practiced within Zoox.
This team builds and operates the core part of the ML platform that powers model training at scale. We are responsible for developing and operating ML tools, deep learning frameworks, and distributed model training infrastructure to support foundational models and reinforcement learning. This team also owns the model repository and model lifecycle management tools used by our applied research teams for in- and off-vehicle ML use cases. You will lead a team of strong software engineers and act as a force multiplier for our internal customers. This team has a lot of growth opportunities as we expand our robotaxi deployments and venture into new ML domains. If you want to learn more about our stack behind autonomous driving, please look here.

In this role, you will

  • Vision: Develop and execute a strategic vision for our ML training platform, ensuring scalability, reliability, and performance to support large-scale Foundation and RL models.
  • Technical acumen: Lead the design, implementation, and operation of a robust and efficient ML training platform to enable the training, experimentation, validation, and monitoring of ML models.
  • Hiring: Attract, hire, and inspire a diverse world-class engineering team, fostering a culture of innovation, collaboration, and excellence.
  • Partnership: Collaborate closely with cross-functional teams, including ML researchers, software engineers, data engineers, and hardware engineers to define requirements and align on architectural decisions.
  • Mentorship: Enable the engineers in the team to grow their careers by providing the right opportunities along with clear and timely feedback.

Qualifications

  • 8+ years of total experience, including 3+ years of engineering management experience.
  • Excellent leadership skills with a demonstrated ability to build and manage high-performing engineering teams.
  • Experience enabling large-scale, cost-efficient distributed model training and ML compute infrastructure.
  • Experience with training frameworks such as PyTorch, Hugging Face, Ray, DeepSpeed, JAX, etc., leveraging GPUs, TPUs, or Trainium.
  • Experience building model lifecycle management tools and managing AWS costs for our ML needs.

Compensation
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. The salary range for this position is $230,000 to $315,000. A sign-on bonus may be offered as part of the compensation package. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position. Zoox also offers a comprehensive package of benefits, including paid time off (e.g., sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.

Vaccine Mandate
Employees working in this position will be required to have received a vaccine approved by the U.S. Food and Drug Administration and/or the World Health Organization. In addition, employees who are eligible for a COVID-19 booster vaccine (“Booster”) will be required to receive a Booster. Employees will be required to show proof of vaccination status upon receipt of a conditional offer of employment. That offer of employment will be conditioned upon, among other things, an Applicant’s ability to show proof of vaccination status. Please note the Company provides reasonable accommodations in accordance with applicable state, federal, and local laws.

Top Skills

AWS
Cv
Deepspeed
Gpus
Hugging Face
Jax
Ml
PyTorch
Ray
Tpus
Trainium

Similar Jobs

2 Hours Ago
In-Office
Costa Mesa, CA, USA
143K-213K Annually
Senior level
143K-213K Annually
Senior level
Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
As a Sr. Reliability Engineer, you will ensure products meet performance requirements, utilize reliability processes, and support engineering teams throughout the product lifecycle.
Top Skills: Fault Tree AnalysisFmeaFmecaPredictive ModelingReliability Analysis TechniquesWeibull Analysis
3 Hours Ago
Remote or Hybrid
2 Locations
87K-140K Annually
Senior level
87K-140K Annually
Senior level
Consumer Web • Digital Media • Edtech • Information Technology • Social Impact • Software
The Founding Account Executive will drive sales into university research offices, manage the full sales cycle, and provide customer feedback for product development.
Top Skills: Crm SoftwareSaaS
3 Hours Ago
In-Office or Remote
Palo Alto, CA, USA
115K-220K Annually
Senior level
115K-220K Annually
Senior level
Aerospace • Artificial Intelligence • Computer Vision • Software • Analytics • Defense • Big Data Analytics
Lead estimating efforts for U.S. Government and Commercial contracts, develop compliant estimating practices, train teams, and enhance processes for financial forecasting.
Top Skills: Financial Database SoftwareExcelMs PowerpointMs WordUsg Approved Proposal Pricing Tools

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account