General Motors Logo

General Motors

Senior ML Engineer - Model Compression

Posted 55 Minutes Ago
Be an Early Applicant
Remote or Hybrid
Hiring Remotely in Austin, TX
129K-261K Annually
Senior level
Remote or Hybrid
Hiring Remotely in Austin, TX
129K-261K Annually
Senior level
The role involves developing quantization and compression strategies for AV models, conducting numerical sensitivity analyses, and creating automated tooling for low-precision deployments, emphasizing safety and performance.
The summary above was generated by AI
Description
About the Team
The Compression and Parity team in GM's Autonomous Vehicle (AV) Organization enables repeatable, high-velocity model deployments through principled and automated model compression under strict safety guarantees. We partner closely with model developers and deployment and infra engineers to ship numerically robust, low-latency models to the car, blending rigorous analysis with state-of-the-art methods and our own innovations.
About the Role
Over time, you will help grow and evolve the Compression and Parity function through the following:
  • Developing and iterating on quantization and compression strategies for our AV models, considering model numerical properties, safety and latency constraints, and hardware performance, and partnering on deployment of quantized models to NVIDIA-based AV hardware with our deployment, compiler, and kernel teams

  • Advancing our numerical sensitivity analyses to recommend safe compression policies per op/layer/block, using AV-relevant metrics (perception, trajectory, etc.) to evaluate compressed models, and collaborating with Embodied AI to support compression-aware modeling

  • Evolving sensitivity analysis, compression, and parity tooling into a connected, automated flow that makes low-precision deployments repeatable, reliable, and low-touch, with an emphasis on robust execution and maintainability

  • Bridging the gap between state-of-the-art model compression research and safety-constrained deployment while making strong technical contributions in cross-functional projects and educating others on best practices

Your Skills & Abilities (Required Qualifications)
  • Bachelor's degree in Computer Science, Electrical Engineering, Physics, Mathematics, Data Science / ML, or a closely related quantitative field (or equivalent experience)

  • 3+ years of industry experience focused on model optimization and deployment, with significant hands-on work in neural network quantization / model compression / efficient inference or relevant experience

  • Strong proficiency in PyTorch and experience with graph-level representations (e.g., PyTorch FX, ONNX) for capture and manipulation

  • Background in numerical linear algebra and optimization (conditioning, spectral properties, Jacobians, Hessians) and how they relate to quantization robustness

What Will Give You A Competitive Edge (Preferred Qualifications)
  • Master's or PhD degree in related quantitative fields

  • Deep experience with PTQ and QAT, compression frameworks (e.g., PT2E, ModelOpt, torchao) and advanced quantization algorithms (e.g., GPTQ, AWQ, SmoothQuant, QuIP, SparseGPT), as well as with building or extending quantization toolchains

  • Hands-on experience designing numerics observability and sensitivity tooling integrated into training or evaluation pipelines (logging ranges, saturation, quant noise, etc.)

  • A track record of collaboration, including leading cross-functional initiatives and mentoring others

  • Experience with additional compression techniques such as structured/unstructured pruning, low-rank decomposition, or knowledge distillation

  • Experience with perception and/or transformer-based models (e.g., multi-view encoders, BEV backbones, detection/segmentation heads, trajectory or planning networks), ideally in AV / ADAS

  • General understanding of kernel performance and optimization for reduced precision formats

  • Direct experience with specialized hardware accelerators for edge deployment on tight latency and memory budgets (automotive SoCs, robotics platforms, or similar)

  • Published research, open-source contributions, or other notable, intellectually curious work in quantization, compression, or efficient inference
  • 3+ years of industry experience focused on model optimization and deployment, with significant hands-on work in neural network quantization / model compression / efficient inference or relevant experience

Compensation: The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington.
  • The salary range for this role: is $128,700 to $261,300. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.
  • Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.
  • Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.

#GM-AV-1
This role is based remotely, but if the selected candidate lives within a specific mile radius of a GM hub, they will be expected to report to the location three times a week {or other frequency dictated by your manager}.
The selected candidate will be required to travel <25% for this role.
This job may be eligible for relocation benefits.
About GM
Our vision is a world with Zero Crashes, Zero Emissions and Zero Congestion and we embrace the responsibility to lead the change that will make our world better, safer and more equitable for all.
Why Join Us
We believe we all must make a choice every day - individually and collectively - to drive meaningful change through our words, our deeds and our culture. Every day, we want every employee to feel they belong to one General Motors team.
Total Rewards | Benefits Overview
From day one, we're looking out for your well-being-at work and at home-so you can focus on realizing your ambitions. Learn how GM supports a rewarding career that rewards you personally by visiting Total Rewards resources.
Non-Discrimination and Equal Employment Opportunities (U.S.)
General Motors is committed to being a workplace that is not only free of unlawful discrimination, but one that genuinely fosters inclusion and belonging. We strongly believe that providing an inclusive workplace creates an environment in which our employees can thrive and develop better products for our customers.
All employment decisions are made on a non-discriminatory basis without regard to sex, race, color, national origin, citizenship status, religion, age, disability, pregnancy or maternity status, sexual orientation, gender identity, status as a veteran or protected veteran, or any other similarly protected status in accordance with federal, state and local laws.
We encourage interested candidates to review the key responsibilities and qualifications for each role and apply for any positions that match their skills and capabilities. Applicants in the recruitment process may be required, where applicable, to successfully complete a role-related assessment(s) and/or a pre-employment screening prior to beginning employment. To learn more, visit How we Hire.
Accommodations
General Motors offers opportunities to all job seekers including individuals with disabilities. If you need a reasonable accommodation to assist with your job search or application for employment, email us [email protected] or call us at 1-800-865-7580. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

General Motors Los Angeles, California, USA Office

Los Angeles, CA, United States

General Motors Pasadena, California, USA Office

General Motors Advanced Design and Innovation Campus Office

The teams at the General Motors Advanced Design and Innovation campus in Pasadena, CA, are charged with exploring future transportation, technology and consumer trends and creating conceptual mobility solutions that inspire and inform program teams across the company.

Similar Jobs at General Motors

55 Minutes Ago
Remote or Hybrid
129K-261K Annually
Senior level
129K-261K Annually
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Senior ML Inference Engineer will design and operate a deployment platform for ML models onto autonomous vehicle hardware, collaborating with teams to enhance tools and address deployment issues.
Top Skills: AirflowCudaFlyteKubeflowOnnxPythonPyTorchRayRay ServeTemporalTensorrtTorchserveTritonTriton Inference ServerVllm
Yesterday
Remote or Hybrid
United States
Mid level
Mid level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Buick GMC District Sales Manager oversees dealer relationships, enhances customer satisfaction, and drives vehicle sales through various initiatives and market analysis.
Yesterday
Remote or Hybrid
United States
135K-208K Annually
Expert/Leader
135K-208K Annually
Expert/Leader
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Staff Designer will lead the design of GM's e-commerce platform, enhancing user experience across web and mobile by driving visual, interaction, and motion design. This role involves collaboration with cross-functional teams and elevating design standards through prototyping and critique.
Top Skills: Adobe Creative SuiteFigma

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account