NVIDIA Logo

NVIDIA

Senior Product Manager - Inference Benchmarking

Reposted 4 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
208K-328K Annually
Senior level
In-Office or Remote
2 Locations
208K-328K Annually
Senior level
The Senior Product Manager will guide AI Inference product strategies, manage external partnerships, and drive product introduction and enhancement processes.
The summary above was generated by AI

NVIDIA has become the platform upon which every new AI-powered application is built. From healthcare research applications to autonomous vehicles, or voice-recognition systems, there is a need to simplify and deliver predictability for AI applications and workflows ... and NVIDIA is right in the center of this revolution. Businesses are often challenged with balancing the performance and costs of inference workloads. Token economics need to be matched with performance and user experience. NVIDIA technologies simplify model deployment while optimizing cost and performance for AI inference workloads. The role of this product manager is to work across NVIDIA to build the right kind of benchmarks, products, and tools that can help customers understand true performance & TCO of Inference.

What you will be doing:

  • Serve as a Subject Matter Expert on AI Inference: Maintain a deep understanding of the entire inference stack, including performance, scaling across workloads, and emerging technologies like disaggregated serving, to guide technical and product strategy.

  • Drive Product Strategy through Market and User Insights: Steer product evolution and partnership direction by conducting thorough research on market trends, competitor activities, and customer feedback, translating these insights into actionable development plans and innovative ideas.

  • Lead Partner Collaboration and Project Execution: Actively manage external partnerships by leading project planning, defining specific tasks and deliverables, serving as the primary liaison for communication, and educating partners on product value to ensure alignment and swift issue resolution.

  • Spearhead Cross-Functional Product Introduction: Drive the new product introduction and transition processes by collaborating seamlessly with engineering, design, operations, sales, and marketing teams, ensuring a unified approach from conception to launch.

  • Enhance Product and User Experience: Play a key role in the product development lifecycle by contributing to the ideation, design, and testing of user experiences, ensuring the final product meets and exceeds customer and partner expectations.

What we need to see:

  • BS or MS in Computer Science, Computer Engineering, or a related team (or equivalent experience)

  • 12+ years of product‑management experience in enterprise technology.

  • Be a subject matter expert on Inference. Understand different components of the inference stack, how they perform, scale, and correlate to workloads.

  • Ability to articulate trade‑offs among latency, throughput, cost, and reliability to both engineering and executive audiences.

  • Strong cross‑functional execution: writes clear specs and PRDs, produces GTM collateral, and leads agile processes.

Ways to stand out from the crowd:

  • Masters/PhD or Expertise in distributed systems or Demonstrated experience in Inference.

  • Experience with Inference (within the stack or ecosystem) and integrating with enterprise platforms; deployments at modern data‑center scale;

We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our elite engineering teams are growing fast. NVIDIA is widely considered to be one of the industry's most desirable employers. NVIDIA is at the center of Deep Learning, Artificial Intelligence, and Autonomous Vehicles. If you're looking for a challenge, thrives in an ambiguous environment and shares our passion for technology, we want to hear from you. We are looking for great people to help us accelerate the next wave of artificial intelligence.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 208,000 USD - 327,750 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 9, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

AI
Distributed Systems
Inference

Similar Jobs

35 Minutes Ago
Remote or Hybrid
San Diego, CA, USA
122K-213K Annually
Senior level
122K-213K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead change management initiatives for the finance organization, focusing on training, communications, and user experience to drive successful digital transformation and enhance operational efficiency.
Top Skills: Adobe CaptivateAdobe Creative CloudAnaplanArticulate Storyline 360BlacklineCamtasiaKyribaExcelMicrosoft OutlookMicrosoft PowerpointMicrosoft WordSAP
4 Hours Ago
Remote
US
22-22 Hourly
Entry level
22-22 Hourly
Entry level
Consumer Web • eCommerce • Machine Learning • Professional Services • Software • Sports • Analytics
The PSA Hobby Concierge Representative will assist customers at submission centers by intake and secure processing of collectibles, resolve inquiries, and improve customer experience.
5 Hours Ago
Remote or Hybrid
2 Locations
60K-140K Annually
Senior level
60K-140K Annually
Senior level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Develop integrated plans for cross-functional teams, manage technical initiatives, identify risks, clarify dependencies, and improve processes for success.
Top Skills: Ai ToolsJIRAMs Copilot

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account