NVIDIA

Product Manager, AI Platform Kernels and Communication Libraries

Reposted 16 Days Ago

Be an Early Applicant

In-Office

Santa Clara, CA

144K-259K Annually

Senior level

In-Office

Santa Clara, CA

144K-259K Annually

Senior level

The Product Manager will lead the development of GPU-focused AI inference products, collaborating with engineers and open-source communities to enhance kernel and communication libraries.

The summary above was generated by AI

NVIDIA's AI Software Platforms team seeks a technical product manager to accelerate next-generation inference deployments through innovative libraries, communication runtimes, and kernel optimization frameworks. This role bridges low-level GPU programming with ecosystem-wide developer enablement for products including CUTLASS, cuDNN, NCCL, NVSHMEM, and open-source contributions to Triton/FlashInfer.

As NVIDIA Product Managers, our goal is to enable developers to be successful on the NVIDIA Platform, and push the boundaries of what is possible with their AI deployments! For Inference, we are the champions inside NVIDIA for AI developers looking to accelerate their deployments on GPUs. We work directly with developers inside and outside of the company to identify key improvements, create roadmaps, and stay alert on the inference landscape. We also work with NVIDIA leaders to define clear product strategy, and marketing team teams to build go-to-market plans. The Product Management organization at NVIDIA is a small, strong, and impactful group. We focus on enabling deep learning across all GPU use cases and providing extraordinary solutions for developers. We are seeking a rare blend of product skills, technical depth, and drive to make NVIDIA great for developers. Does that sounds familiar? If so, we would love to hear from you!

What you'll be doing:

Architect developer-focused products that simplify high-performance inference and training deployment across diverse GPU architectures.
Define the multi-year strategy for kernel and communication libraries by analyzing performance bottlenecks in emerging AI workloads.
Collaborate with CUDA kernel engineers to design intuitive, high-level abstractions for memory and distributed execution.
Partner with open-source communities like Triton and FlashInfer to shape and drive ecosystem-wide roadmaps.

What we need to see:

5+ years of technical PM experience shipping developer products for GPU acceleration, with expertise in HPC optimization stacks.
Expert-level understanding of CUDA execution models and multi-GPU protocols, with a proven track record to translate hardware capabilities into software roadmaps.
BS or MS or equivalent experience in Computer Engineering or demonstrated expertise in parallel computing architectures.
Strong technical interpersonal skills with experience communicating complex optimizations to developers and researchers.

Ways to stand out from the crowd:

PhD or equivalent experience in Computer Engineering or a related technical field.
Contributed to performance-critical open-source projects like Triton, FlashAttention, or TVM with measurable adoption impact
Crafted GitHub-first developer tools with >1k stars or similar community engagement metrics
Published research on GPU kernel optimization, collective communication algorithms, or ML model serving architectures
Experience building cost-per-inference models incorporating hardware utilization, energy efficiency, and cluster scaling factors

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 144,000 USD - 218,500 USD for Level 3, and 168,000 USD - 258,750 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 29, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Cuda

Cudnn

Cutlass

Flashinfer

Gpu

Hpc

Nccl

Nvshmem

Triton

Similar Jobs

Square

Director of Growth Strategy & Operations - Global Partnerships

5 Hours Ago

Hybrid

240K-359K Annually

Senior level

240K-359K Annually

Senior level

eCommerce • Fintech • Hardware • Payments • Software • Financial Services

The Director of Growth Strategy & Operations will lead the global partnerships strategy, driving revenue growth through strategic collaborations, enabling product integrations, and executing GTM initiatives while influencing senior leadership.

Top Skills: AIAutomationGtm Analytics Tools

Wells Fargo

Teller Full Time Beverly Hills

9 Hours Ago

Hybrid

Beverly Hills, CA, USA

22-28 Hourly

Junior

22-28 Hourly

Junior

Fintech • Financial Services

As a Teller, you will handle customer transactions, provide financial services support, engage with customers, and minimize operational risks while meeting the financial needs of the community.

Wells Fargo

Personal Banker Bilingual Spanish

9 Hours Ago

Hybrid

Panorama Heights, CA, USA

23-31 Hourly

Entry level

23-31 Hourly

Entry level

Fintech • Financial Services

The Associate Personal Banker engages with customers to provide banking services, assist with account openings, and develop product knowledge to meet customer needs.

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
Key Industries: Artificial intelligence, adtech, media, software, game development
Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering