General Motors Logo

General Motors

Senior ML Inference Engineer - Platform

Posted 56 Minutes Ago
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in Austin, TX
129K-261K Annually
Senior level
Remote or Hybrid
Hiring Remotely in Austin, TX
129K-261K Annually
Senior level
The Senior ML Inference Engineer will design and operate a deployment platform for ML models onto autonomous vehicle hardware, collaborating with teams to enhance tools and address deployment issues.
The summary above was generated by AI
Description
About the Team
The Model Deployment & Inference Solutions team in GM AV deploys machine learning models from training frameworks (e.g. PyTorch) onto autonomous vehicle hardware. Our mission is two-fold: build the ML deployment platform that makes model rollouts fast and predictable, and optimize models so they meet the real-time latency and memory budgets required to run on-vehicle. Our work is on the critical path of GM's publicly committed launch of eyes-off (hands-free, eyes-free) autonomous driving in 2028, debuting on the Cadillac Escalade IQ, building on Super Cruise's billion-plus hands-free miles.
About the Role
This role sits in the team's Platform pillar. We own the unified ML deployment platform that automates the path from a trained model to inference on the vehicle, along with the developer-experience and agentic-tooling layer that makes deployment self-serve for every ML model development team at GM.
What you'll be doing (Responsibilities)
  • Design, build, and operate the ML deployment platform that automates the path from trained model to on-vehicle inference.

  • Drive cross-organization model deployments to the autonomous vehicle stack, partnering with model development teams to take high-value models from training to production on-vehicle.

  • Build agentic tools that diagnose and fix deployment-blocking issues, automating workflows currently performed manually by engineers.

  • Build the developer experience that ML model development teams use day to day: tooling, dashboards, automation, and observability.

  • Drive shift-left validation that surfaces deployment risk (compile, runtime, parity, latency) early in the model development cycle.

  • Build platform tools that integrate the work of our sister teams (kernels, compiler, reduced precision and parity) so their optimization wins land directly in the deployment workflow.

  • Partner with the team's Performance pillar and model development teams across the AV organization.

Your Skills & Abilities (Required Qualifications)
  • BS, MS, or PhD in Computer Science or a related technical field.

  • 3+ years of relevant industry experience.

  • Strong fundamentals and excellent coding ability in Python.

  • Experience building or operating production platform or infrastructure systems where reliability, observability, and extensibility matter.

  • Experience with ML model deployment, inference integration, model optimization workflows, or model serving infrastructure, with at least one prior context where you owned the path from a trained model to a running inference workload.

  • Experience using coding agents (Cursor, Claude Code, GitHub Copilot, or equivalent) as part of your engineering workflow.

  • Experience designing clean, well-tested software with clear interfaces and good abstractions.

  • Strong cross-team collaboration skills.

What Will Give You A Competitive Edge (Preferred Qualifications)
  • Experience building agentic or LLM-powered developer tooling.

  • Experience with ML or workflow orchestration frameworks (Airflow, Temporal, Flyte, Ray, Kubeflow, or equivalent).

  • Familiarity with the NVIDIA GPU stack at the integration level (CUDA-aware Python, TensorRT, Triton inference server, torch.compile, ONNX).

  • Experience with inference-serving frameworks (Triton, TorchServe, Ray Serve, vLLM) or edge-deployment toolchains.

  • Experience with low-latency or real-time systems.

  • Experience in autonomous vehicles, robotics, or other safety-critical ML deployment domains.

  • Open-source contributions to PyTorch, Ray, Airflow, Temporal, vLLM, TensorRT, or related projects.

  • 3+ years of relevant industry experience.

Compensation: The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington.
  • The salary range for this role: is $128,700 to $261,300. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.
  • Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.
  • Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.

#GM-AV-1
This role is based remotely, but if the selected candidate lives within a specific mile radius of a GM hub, they will be expected to report to the location three times a week {or other frequency dictated by your manager}.
The selected candidate will be required to travel <25% for this role.
This job may be eligible for relocation benefits.
About GM
Our vision is a world with Zero Crashes, Zero Emissions and Zero Congestion and we embrace the responsibility to lead the change that will make our world better, safer and more equitable for all.
Why Join Us
We believe we all must make a choice every day - individually and collectively - to drive meaningful change through our words, our deeds and our culture. Every day, we want every employee to feel they belong to one General Motors team.
Total Rewards | Benefits Overview
From day one, we're looking out for your well-being-at work and at home-so you can focus on realizing your ambitions. Learn how GM supports a rewarding career that rewards you personally by visiting Total Rewards resources.
Non-Discrimination and Equal Employment Opportunities (U.S.)
General Motors is committed to being a workplace that is not only free of unlawful discrimination, but one that genuinely fosters inclusion and belonging. We strongly believe that providing an inclusive workplace creates an environment in which our employees can thrive and develop better products for our customers.
All employment decisions are made on a non-discriminatory basis without regard to sex, race, color, national origin, citizenship status, religion, age, disability, pregnancy or maternity status, sexual orientation, gender identity, status as a veteran or protected veteran, or any other similarly protected status in accordance with federal, state and local laws.
We encourage interested candidates to review the key responsibilities and qualifications for each role and apply for any positions that match their skills and capabilities. Applicants in the recruitment process may be required, where applicable, to successfully complete a role-related assessment(s) and/or a pre-employment screening prior to beginning employment. To learn more, visit How we Hire.
Accommodations
General Motors offers opportunities to all job seekers including individuals with disabilities. If you need a reasonable accommodation to assist with your job search or application for employment, email us [email protected] or call us at 1-800-865-7580. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

General Motors Los Angeles, California, USA Office

Los Angeles, CA, United States

General Motors Pasadena, California, USA Office

General Motors Advanced Design and Innovation Campus Office

The teams at the General Motors Advanced Design and Innovation campus in Pasadena, CA, are charged with exploring future transportation, technology and consumer trends and creating conceptual mobility solutions that inspire and inform program teams across the company.

Similar Jobs at General Motors

55 Minutes Ago
Remote or Hybrid
129K-261K Annually
Senior level
129K-261K Annually
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The role involves developing quantization and compression strategies for AV models, conducting numerical sensitivity analyses, and creating automated tooling for low-precision deployments, emphasizing safety and performance.
Top Skills: AwqGptqModeloptNvidiaOnnxPt2EPyTorchQuipSmoothquantSparsegptTorchao
Yesterday
Remote or Hybrid
United States
Mid level
Mid level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Buick GMC District Sales Manager oversees dealer relationships, enhances customer satisfaction, and drives vehicle sales through various initiatives and market analysis.
Yesterday
Remote or Hybrid
United States
135K-208K Annually
Expert/Leader
135K-208K Annually
Expert/Leader
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Staff Designer will lead the design of GM's e-commerce platform, enhancing user experience across web and mobile by driving visual, interaction, and motion design. This role involves collaboration with cross-functional teams and elevating design standards through prototyping and critique.
Top Skills: Adobe Creative SuiteFigma

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account