Luma AI Logo

Luma AI

Senior Machine Learning Engineer - Hardware Abstractions & Performance Optimization

Posted 18 Days Ago
Remote
Hiring Remotely in USA
220K-300K Annually
Senior level
Remote
Hiring Remotely in USA
220K-300K Annually
Senior level
Develop and optimize multimodal AI systems by improving efficiency across various hardware platforms, benchmarking, and collaborating with teams.
The summary above was generated by AI

Luma’s mission is to build multimodal AI to expand human imagination and capabilities. We believe that multimodality is critical for intelligence. To go beyond language models and build more aware, capable and useful systems, the next step function change will come from vision. So, we are working on training and scaling up multimodal foundation models for systems that can see and understand, show and explain, and eventually interact with our world to effect change.

We are looking for engineers with significant experience maintaining & designing highly efficient systems and code that can be optimized to run on multiple hardware platforms, bringing our state-of-the-art models to as many people at the best performance per dollar.

Responsibilities
  • Ensure efficient implementation of models & systems with a focus on designing, maintaining, and writing abstractions that scale beyond NVIDIA/CUDA hardware.

  • Identify and remedy efficiency bottlenecks (memory, speed, utilization, communication) by profiling and implementing high-performance PyTorch code, deferring to Triton or similar kernel-level languages as necessary.

  • Benchmarking our products across a variety of hardware & software to help the product team understand the optimal tradeoffs between latency, throughput and cost at various degrees of parallelism.

  • Work together with our partners to help them identify bottlenecks and push forward new iterations of hardware and software.

  • Work closely together with the rest of the research team to ensure systems are planned to be as efficient as possible from start to finish and raise potential issues for hardware integration.

Must have experience
  • Experience optimizing for memory, latency and throughput in Pytorch.

    • Bonus: experience with non-NVIDIA systems

  • Experience using torch.compile / torch.XLA.

  • Experience benchmarking and profiling GPU & CPU code in Pytorch for optimal device utilization (examples: torch profiler, memory profilers, trace viewers, custom tooling).

  • Experience building tools & abstractions to ensure models run optimally on different hardware and software stacks .

  • Experience working with transformer models and attention implementations.

  • Experience with parallel inference, particularly with tensor parallelism, pipeline parallelism.

Good to have experience
  • Experience with high-performance Triton/CUDA and writing custom PyTorch kernels and ops. Top candidates will be able to write fused kernels for common hot paths, understand when to make use of lower level features like tensor cores or warp intrinsics, and will understand where these tools can be most impactful.

  • Experience writing high-performance parallel C++. Bonus if done within an ML context with PyTorch, like for data loading, data processing, inference code

  • Experience building inference / demo prototype code (incl. Gradio, Docker etc.)

Top Skills

C++
Cuda
Docker
Gradio
PyTorch
Triton

Similar Jobs

11 Minutes Ago
Remote or Hybrid
2 Locations
76K-122K Annually
Mid level
76K-122K Annually
Mid level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Media Solutions Technologist will manage and integrate media technologies, support media planning, audience targeting, and collaborate across teams to enhance GM's media solutions.
Top Skills: ComscoreGoogle Marketing PlatformInnovidMediaocean
12 Minutes Ago
Remote or Hybrid
2 Locations
76K-122K Annually
Mid level
76K-122K Annually
Mid level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
The Media Solutions Strategist designs media capabilities, manages relationships, develops strategies, and collaborates with various stakeholders to enhance media operations.
Top Skills: Audience ManagementExcelMedia Planning And Buying TechnologiesMedia TechnologyMicrosoft WordPowerPointProgrammatic Media
4 Hours Ago
Easy Apply
Remote or Hybrid
12 Locations
Easy Apply
170K-247K
Senior level
170K-247K
Senior level
Fintech • HR Tech
As a Senior Staff Software Engineer, you will design and develop RESTful APIs, mentor junior engineers, and lead technical projects for time management solutions.
Top Skills: AWSAzureGCPGoJavaPythonRuby On Rails

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account