Bonfy.AI Logo

Bonfy.AI

Senior Machine Learning Operations (MLOps) Engineer

Posted 25 Days Ago
Be an Early Applicant
In-Office
Mountain View, CA
70K-120K
Senior level
In-Office
Mountain View, CA
70K-120K
Senior level
As a Senior MLOps Engineer, you'll optimize ML performance, build inference systems, and collaborate cross-functionally to ensure effective AI deployment.
The summary above was generated by AI

Bonfy.AI is building the trust layer for generative AI. Our Adaptive Content Security platform detects and mitigates subtle risks embedded in large language model (LLM) outputs before they reach users. From hallucinations to hidden data leaks, we enable enterprises to deploy GenAI confidently, without compromising truth, privacy, or reputation.

We are model-agnostic, outcome-driven, and unapologetically rigorous. Our customers include leading Fortune 500 teams working in high-stakes sectors where trust is not optional.

Why This Role Matters

We need an MLOps Engineer to optimize our GPU-accelerated ML infrastructure for performance and cost efficiency. Working with our existing Sr. DevOps and Sr. SRE teams, you'll focus on the specialized ML optimization challenges that require deep machine learning expertise.

  • GPU & Cost Optimization: Design optimal GPU configurations and ML deployment strategies to maximize performance while minimizing cloud costs.
  • ML Performance Tuning: Optimize model serving, memory management, and inference pipelines for production LLM workloads. You will also work with models and customize prompts, write pre- and post-processing methods to improve accuracy and speed (production coded), and implement new models functionality in the system.
  • DevOps Collaboration: Work with our Sr. DevOps/SRE teams to implement ML-specific solutions and monitoring

What We're Looking For

  • ML Infrastructure Optimization: 5+ years optimizing production ML systems with focus on GPU utilization and cost management
  • GPU & LLM Expertise: Deep understanding of GPU architectures, memory management, and LLM inference optimization
  • Python + DevOps Integration: Expert Python programming with experience working alongside DevOps/SRE teams on ML-specific solutions
  • Bonus: Experience at GPU-focused ML companies (SambaNova, NVIDIA, etc.) or with high-performance ML serving frameworks

Why Join Us

  • Collaborative Impact: Work with our existing Sr. DevOps and Sr. SRE teams to solve ML-specific challenges that require specialized expertise
  • Technical Depth: Focus purely on cutting-edge ML optimization problems without getting pulled into general infrastructure work
  • High Autonomy: Direct collaboration with engineering leadership in a fast-paced, technically rigorous environment
  • Competitive Package: Strong salary, equity, comprehensive benefits, and flexible hybrid work model
    Bonfy.AI — Truth. Security. Intelligence.

Top Skills

Gpu Optimization
Gpu Profiling
Llms
Modular Ml Framework
Nlp
Python
Vector Databases

Similar Jobs

5 Hours Ago
In-Office
4 Locations
190K-215K
Senior level
190K-215K
Senior level
Cloud • Information Technology • Machine Learning
As a Solutions Architect, you will lead customer engagement, prototype and deploy cloud solutions, provide technical leadership, and collaborate with engineering teams to enhance product offerings.
Top Skills: Distributed TrainingInferenceInfinibandKubernetesMachine Learning OperationsNetworkingNvidia Collective Communications LibraryNvidia GpusSlurm
5 Hours Ago
In-Office
4 Locations
175K-220K
Senior level
175K-220K
Senior level
Cloud • Information Technology • Machine Learning
Lead the automated provisioning and management of hardware nodes. Develop applications, streamline operations, improve UI, and resolve vendor API integrations.
Top Skills: APIsAutomationContainer-Based MicroservicesOrchestrationReportingUi
5 Hours Ago
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
212K-245K
Senior level
212K-245K
Senior level
Cloud • Software
As a Senior AI Engineer, you will design, implement, and maintain large-scale AI/ML pipelines, focusing on training and tuning ML models and evaluating their performance.
Top Skills: Deep LearningMachine LearningPythonPyTorchSklearnTensorFlowXgboost

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account