Bonfy.AI is building the trust layer for generative AI. Our Adaptive Content Security platform detects and mitigates subtle risks embedded in large language model (LLM) outputs before they reach users. From hallucinations to hidden data leaks, we enable enterprises to deploy GenAI confidently, without compromising truth, privacy, or reputation.
We are model-agnostic, outcome-driven, and unapologetically rigorous. Our customers include leading Fortune 500 teams working in high-stakes sectors where trust is not optional.
Why This Role Matters
We need an MLOps Engineer to optimize our GPU-accelerated ML infrastructure for performance and cost efficiency. Working with our existing Sr. DevOps and Sr. SRE teams, you'll focus on the specialized ML optimization challenges that require deep machine learning expertise.
- GPU & Cost Optimization: Design optimal GPU configurations and ML deployment strategies to maximize performance while minimizing cloud costs.
- ML Performance Tuning: Optimize model serving, memory management, and inference pipelines for production LLM workloads. You will also work with models and customize prompts, write pre- and post-processing methods to improve accuracy and speed (production coded), and implement new models functionality in the system.
- DevOps Collaboration: Work with our Sr. DevOps/SRE teams to implement ML-specific solutions and monitoring
What We're Looking For
- ML Infrastructure Optimization: 5+ years optimizing production ML systems with focus on GPU utilization and cost management
- GPU & LLM Expertise: Deep understanding of GPU architectures, memory management, and LLM inference optimization
- Python + DevOps Integration: Expert Python programming with experience working alongside DevOps/SRE teams on ML-specific solutions
- Bonus: Experience at GPU-focused ML companies (SambaNova, NVIDIA, etc.) or with high-performance ML serving frameworks
Why Join Us
- Collaborative Impact: Work with our existing Sr. DevOps and Sr. SRE teams to solve ML-specific challenges that require specialized expertise
- Technical Depth: Focus purely on cutting-edge ML optimization problems without getting pulled into general infrastructure work
- High Autonomy: Direct collaboration with engineering leadership in a fast-paced, technically rigorous environment
- Competitive Package: Strong salary, equity, comprehensive benefits, and flexible hybrid work model
Bonfy.AI — Truth. Security. Intelligence.
Top Skills
Similar Jobs
What you need to know about the Los Angeles Tech Scene
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering