Sonatus Logo

Sonatus

Staff DevOps/MLOps Engineer

Reposted 7 Days Ago
Easy Apply
Hybrid
Sunnyvale, CA
169K-232K Annually
Senior level
Easy Apply
Hybrid
Sunnyvale, CA
169K-232K Annually
Senior level
The Staff DevOps/MLOps Engineer will design and build end-to-end DevOps and MLOps platforms, managing cloud infrastructure, CI/CD pipelines, and machine learning lifecycles to ensure efficient model deployment and monitoring.
The summary above was generated by AI

Join a high-performing team at Sonatus that’s redefining what cars can do in the era of Software-Defined Vehicles (SDV).

At Sonatus, we’re driving the transformation to AI-enabled software-defined vehicles. Traditional automotive software methods can’t keep pace with consumer expectations shaped by the mobile industry—where features evolve rapidly, update seamlessly, and improve continuously. That’s why leading OEMs trust Sonatus to accelerate this shift. Our technology is already in production across more than 5 million vehicles on the road today and rapidly expanding.

Headquartered in Sunnyvale, CA, with 250+ employees worldwide, Sonatus combines the agility of a fast-growing company with the scale and impact of an established partner. Backed by strong funding and proven by global deployment, we’re solving some of the most interesting and complex challenges in the industry. Join us and help redefine what’s possible as we shape the future of mobility.

Role Summary:

We are seeking a highly experienced and strategic Staff DevOps & MLOps Engineer to architect, build, and scale our end-to-end DevOps and MLOps platform. In this role, you will be responsible for the full cloud CI/CD pipeline, cloud infrastructure management, and machine learning model lifecycle, from implementing the MLOps framework that enables models to move from experimentation to production with velocity and reliability, to managing the serving infrastructure. You'll leverage deep expertise in DevOps and MLOps and Site Reliability Engineering (SRE) to make critical decisions that span model training, serving, and monitoring. This is a key leadership position for a hands-on engineer who will define our model versioning, production observability, and infrastructure-as-code best practices.

Roles and Responsibilities:
  • Design and build the foundational, end-to-end DevOps and MLOps platform for our Generative AI systems, making critical decisions that span large language model-based systems evaluation, monitoring, and deployment
  • Implement the full DevOps and MLOps framework. You will build the CI/CD/CT (Continuous Integration/Delivery/Training) automation that takes models from experiment to production with velocity and reliability.
  • Deploy, scale, and optimize our model serving infrastructure. You will manage GPU/NPU resources, minimize inference latency, and build robust monitoring to ensure our AI is always fast, accurate, and cost-effective.
  • Create a single, cohesive set of best practices for the entire AI lifecycle. Your work will define how we handle model versioning, infrastructure as code, and production observability in one seamless system.
Requirements:
  • A seasoned engineer with 8+ years of experience building and scaling production-grade cloud services and systems, with a strong focus on DevOps, MLOps, and/or SRE.
  • A "systems thinker" with a demonstrated ability to architect end-to-end solutions and a deep understanding of the full CI/CD pipeline and machine learning lifecycle.
  • Deep proficiency in Python and Infrastructure as Code (e.g., Terraform, Pulumi, etc.).
  • Experience with MLOps tools (e.g., MLflow, Kubeflow, Vertex AI) and production monitoring frameworks
  • Enforce reproducibility, approvals, audit trails, PII handling, model cards, and policy/compliance (e.g., privacy, evals, guardrails).
  • Experience with robust ML deployment systems (e.g., Kubeflow, MLflow, model servers like BentoML or TensorFlow Serving).
  • Hands-on experience with public cloud platforms (GCP, AWS, and/or Azure) and containerization/orchestration (Docker, Kubernetes).
  • Package, version, and deploy software modules and AI models (batch & online) with blue/green or canary rollouts; build feature & model registries, and automate retraining
  • Experience with Pytorch, vLLMs, and GPUs a plus
  • Experience with tracking Modes and Agentic drift is a plus
  • Experience with tuning serving stacks (GPU/CPU utilization, batching, quantization)
  • Direct experience building and operationalizing systems for LLMs, especially RAG pipelines, is a plus
  • Experience with vector databases (e.g., Pinecone, Weaviate) and embedding management from a deployment and scaling perspective is a plus
Benefits:

Sonatus is a tight-knit team aligned around a unified vision. You can expect a strong engineering-oriented culture that focuses on building the best products and solutions for our customers. We embrace equality and diversity in all regards because respect is ingrained in our every fiber. Other benefits Sonatus offers include:

  • Stock option plan
  • Health care plan (Medical, Dental & Vision)
  • Retirement plan (401k, IRA)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Unlimited paid time off (Vacation, Sick & Public Holidays)
  • Family leave (Maternity, Paternity)
  • Flexible work arrangements
  • Free food & snacks in the office

The posted salary range is a general guideline and represents a good faith estimate of what Sonatus ("Company") could reasonably expect to pay for a base salary for this position. The pay offered to a selected candidate will be determined based on factors such as (but not limited to) the scope and responsibilities of the position, the qualifications of the selected candidate, departmental budget availability, geographic location and external market pay for comparable jobs. The Company reserves the right to modify this range in the future, as needed, as market conditions change.

Pay range for this role
$168,500$232,000 USD
Sonatus is a fast-paced and innovative company and are seeking team members who are passionate about making a difference. If you are ready to take your career to the next level, we highly encourage you to apply.
 
To all recruitment agencies: Sonatus, Inc. ("Sonatus") does not accept unsolicited agency resumes. Please do not forward resumes to our careers alias or other Sonatus' employees. Sonatus is not responsible for any fees associated with unsolicited activities.

Top Skills

AWS
Azure
Docker
GCP
Kubeflow
Kubernetes
Mlflow
Pulumi
Python
Terraform
Vertex Ai

Similar Jobs at Sonatus

2 Hours Ago
Easy Apply
In-Office
Sunnyvale, CA, USA
Easy Apply
140K-177K Annually
Senior level
140K-177K Annually
Senior level
Artificial Intelligence • Automotive • Cloud • Software
The role involves integrating embedded software into vehicles, troubleshooting, developing backend services, and collaborating with customers and teams.
Top Skills: AndroidAWSAzureC/C++Can BusEthernetGCPHTTPLinuxMqttPythonTcp/Ip
6 Days Ago
Easy Apply
In-Office
Sunnyvale, CA, USA
Easy Apply
198K-260K Annually
Senior level
198K-260K Annually
Senior level
Artificial Intelligence • Automotive • Cloud • Software
The Staff AI Engineer will optimize AI models for production Edge environments, ensuring high-performance inference using various hardware. Responsibilities include building MLOps pipelines, collaborating with engineers, and managing model deployment on edge devices.
Top Skills: Ai CompilersAWSAzureC++DockerKubernetesMlopsOnnxOpenvinoPythonPyTorchTensorFlowTensorrtTfliteTvm
6 Days Ago
Easy Apply
In-Office
Sunnyvale, CA, USA
Easy Apply
180K-250K Annually
Senior level
180K-250K Annually
Senior level
Artificial Intelligence • Automotive • Cloud • Software
As a Hardware Test Engineering Manager, you will lead testing environments, evaluate specifications, debug automotive electronic systems, and manage a hardware R&D lab.
Top Skills: Automotive Ethernet StandardsCan Dbc File FormatsComputer EngineeringElectrical EngineeringLinuxPython

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account