NVIDIA Logo

NVIDIA

Product Manager, AI Platform Kernels and Communication Libraries

Reposted 16 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA
144K-259K Annually
Senior level
In-Office
Santa Clara, CA
144K-259K Annually
Senior level
The Product Manager will lead the development of GPU-focused AI inference products, collaborating with engineers and open-source communities to enhance kernel and communication libraries.
The summary above was generated by AI

NVIDIA's AI Software Platforms team seeks a technical product manager to accelerate next-generation inference deployments through innovative libraries, communication runtimes, and kernel optimization frameworks. This role bridges low-level GPU programming with ecosystem-wide developer enablement for products including CUTLASS, cuDNN, NCCL, NVSHMEM, and open-source contributions to Triton/FlashInfer.

As NVIDIA Product Managers, our goal is to enable developers to be successful on the NVIDIA Platform, and push the boundaries of what is possible with their AI deployments! For Inference, we are the champions inside NVIDIA for AI developers looking to accelerate their deployments on GPUs. We work directly with developers inside and outside of the company to identify key improvements, create roadmaps, and stay alert on the inference landscape. We also work with NVIDIA leaders to define clear product strategy, and marketing team teams to build go-to-market plans. The Product Management organization at NVIDIA is a small, strong, and impactful group. We focus on enabling deep learning across all GPU use cases and providing extraordinary solutions for developers. We are seeking a rare blend of product skills, technical depth, and drive to make NVIDIA great for developers. Does that sounds familiar? If so, we would love to hear from you!

What you'll be doing:

  • Architect developer-focused products that simplify high-performance inference and training deployment across diverse GPU architectures.

  • Define the multi-year strategy for kernel and communication libraries by analyzing performance bottlenecks in emerging AI workloads.

  • Collaborate with CUDA kernel engineers to design intuitive, high-level abstractions for memory and distributed execution.

  • Partner with open-source communities like Triton and FlashInfer to shape and drive ecosystem-wide roadmaps.

What we need to see:

  • 5+ years of technical PM experience shipping developer products for GPU acceleration, with expertise in HPC optimization stacks.

  • Expert-level understanding of CUDA execution models and multi-GPU protocols, with a proven track record to translate hardware capabilities into software roadmaps.

  • BS or MS or equivalent experience in Computer Engineering or demonstrated expertise in parallel computing architectures.

  • Strong technical interpersonal skills with experience communicating complex optimizations to developers and researchers.

Ways to stand out from the crowd:

  • PhD or equivalent experience in Computer Engineering or a related technical field.

  • Contributed to performance-critical open-source projects like Triton, FlashAttention, or TVM with measurable adoption impact

  • Crafted GitHub-first developer tools with >1k stars or similar community engagement metrics

  • Published research on GPU kernel optimization, collective communication algorithms, or ML model serving architectures

  • Experience building cost-per-inference models incorporating hardware utilization, energy efficiency, and cluster scaling factors

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 144,000 USD - 218,500 USD for Level 3, and 168,000 USD - 258,750 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 29, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Cuda
Cudnn
Cutlass
Flashinfer
Gpu
Hpc
Nccl
Nvshmem
Triton

Similar Jobs

5 Hours Ago
Hybrid
8 Locations
240K-359K Annually
Senior level
240K-359K Annually
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
The Director of Growth Strategy & Operations will lead the global partnerships strategy, driving revenue growth through strategic collaborations, enabling product integrations, and executing GTM initiatives while influencing senior leadership.
Top Skills: AIAutomationGtm Analytics Tools
9 Hours Ago
Hybrid
Beverly Hills, CA, USA
22-28 Hourly
Junior
22-28 Hourly
Junior
Fintech • Financial Services
As a Teller, you will handle customer transactions, provide financial services support, engage with customers, and minimize operational risks while meeting the financial needs of the community.
9 Hours Ago
Hybrid
Panorama Heights, CA, USA
23-31 Hourly
Entry level
23-31 Hourly
Entry level
Fintech • Financial Services
The Associate Personal Banker engages with customers to provide banking services, assist with account openings, and develop product knowledge to meet customer needs.

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account