Oumi Logo

Oumi

ML Performance Engineer

Posted Yesterday
Be an Early Applicant
In-Office
9 Locations
100K-220K Annually
Mid level
In-Office
9 Locations
100K-220K Annually
Mid level
The ML Performance Engineer will optimize and accelerate AI models' training and inference, focusing on kernel optimization, memory management, and performance profiling.
The summary above was generated by AI
About Oumi

Why we exist: Oumi is on a mission to make frontier AI truly open for all. We are founded on the belief that AI will have a transformative impact on humanity, and that developing it collectively, in the open, is the best path forward to ensure that it is done efficiently and safely.

What we do: Oumi provides an all-in-one platform to build state-of-the-art AI models, end to end, from data preparation to production deployment, empowering innovators to build cutting-edge models at any scale. Oumi also develops open foundation models in collaboration with academic collaborators and the open community.

Our Approach: Oumi is fundamentally an open-source first company, with open-collaboration across the community as a core principle. Our work is:

  • Open Source First: All our platform and core technology is open source

  • Research-driven: We conduct and publish original research in AI, collaborating with our community of academic research labs and collaborators

  • Community-powered: We believe in the power of open-collaboration and welcome contributions from researchers and developers worldwide

Role Overview

The ML Performance Engineer will be an integral part of Oumi's research team, focusing on optimizing and accelerating training and inference with AI models. This role involves developing efficient CUDA/Triton kernels,  contributing to open-source projects, and collaborating with researchers and engineers to improve model performance. Engineers at Oumi will work on various aspects of model acceleration including kernel optimization, memory management, and performance profiling.

What you’ll bring:

  • ML Performance: Demonstrated experience optimizing models, training & inference pipelines, and familiarity with profiling tools (NSight, nvprof)

  • Programming Skills: Strong programming skills in one of Python, C++ or Rust

  • Systems Knowledge: familiarity with low-level operating systems foundations, PyTorch internals, GPU architectures is highly desirable

  • ML Expertise: Deep understanding of machine learning and deep learning concepts, with specific knowledge of large language models (LLMs).

  • Open Source: Familiarity with open-source projects and a passion for contributing to the open-source community.

  • Values: Share Oumi's values: Beneficial for all, Customer-obsessed, Radical Ownership, Exceptional Teammates, Science-grounded.

Benefits
  • Competitive salary: $100,000 - $220,000

  • Equity in a high-growth startup

  • Comprehensive health, dental and vision insurance

  • 21 days PTO

  • Regular team offsites and events

Top Skills

C++
Cuda
Nsight
Nvprof
Python
PyTorch
Rust
Triton

Similar Jobs

17 Minutes Ago
Hybrid
2 Locations
Junior
Junior
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Develop thermal management control and diagnostics software for EV vehicles, collaborating with engineering teams and participating in Agile ceremonies.
Top Skills: CGitMatlab Simulink
17 Minutes Ago
Hybrid
Markham, ON, CAN
Expert/Leader
Expert/Leader
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Lead software strategy for Infotainment and Software Defined Vehicle solutions, mentor engineers, and ensure software quality across complex systems.
Top Skills: AaosAndroidAndroid AospC++Embedded SystemsJavaLinuxQnx/Rtos
32 Minutes Ago
Remote or Hybrid
2 Locations
Mid level
Mid level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
As a Software Engineer, you'll deliver software solutions within an Agile team, working on features and system enhancements while maintaining quality and efficiency.
Top Skills: .Net 5/6AngularC#ConfluenceCypressGitlabGoJIRAKubernetesReactSQL ServerTypescriptVisual Studio

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account