NVIDIA Logo

NVIDIA

GPU Kernel Compiler Engineer, AI Inference

Posted 12 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
148K-288K Annually
Mid level
In-Office or Remote
2 Locations
148K-288K Annually
Mid level
In this role, you will develop and optimize GPU kernels for AI inference workloads, analyze AI models for performance bottlenecks, and collaborate across teams to enhance productivity and efficiency in GPU programming.
The summary above was generated by AI

NVIDIA’s AI and GPU software is at the forefront of computing fueling breakthroughs across deep learning, LLMs, and intelligent applications. Our team is building solutions for rapid development and deployment of GPU kernels for AI systems. We take the latest AI models, rigorously analyze them, develop and deploy high-performance GPU kernels that define model performance and integrate the derived techniques and methodologies into the tools that automate this process.

This role is a unique opportunity to shape the next generation of AI performance and efficiency. You will work hands-on with emerging AI models, collaborating across compiler, AI inference, and model performance teams. The focus is on building programming solutions that can be applied to concrete AI inference use cases to deliver real-world performance and development efficiency wins.

What you will be doing:

  • Analyze state-of-the-art AI models, identifying key performance bottlenecks and opportunities at the kernel level.

  • Develop, optimize, and evaluate both hand-tuned and compiler-generated kernels for inference workloads, balancing speed and flexibility.

  • Design and build high-level DSLs and innovative compiler infrastructure to increase kernel developer productivity while achieving near peak performance.

  • Collaborate with model AI inference and compiler teams to iterate on kernel fusion, auto tuning, and sophisticated GPU programming techniques.

  • Benchmark performance across real workloads, diagnose root causes, and rapidly deploy optimizations that maximize hardware utilization on NVIDIA platforms.

What we need to see:

  • Bachelor’s, master’s or PhD degree in Computer Science, Computer Engineering or related field, or equivalent experience.

  • At least 3+ years Strong C++ and/or Python programming skills for system and performance engineering.

  • Understanding of GPU architecture and proficiency in CUDA programming.

  • Intellectual curiosity and interest to solve exciting problems and deliver practical results in production environments.

Ways to stand out from the crowd:

  • Experience designing, developing and optimizing high-efficiency GPU kernels for modern AI workloads.

  • Experience building compilers, domain-specific languages, or automatic optimization systems

  • Familiarity with popular compiler, GPU programming and AI frameworks such as MLIR, LLVM, PyTorch, XLA, Triton or Cutlass.

  • Experience with AI/ML inference workloads and model performance analysis.

  • Strong communication skills and ability to collaborate in a cross-team environment.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until October 27, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

C++
Cuda
Cutlass
Llvm
Mlir
Python
PyTorch
Triton
Xla

Similar Jobs

13 Minutes Ago
In-Office or Remote
San Francisco, CA, USA
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
The Lead Product Designer will lead the design of Jira, focusing on AI integration, user experience, and collaboration across teams, driving product value.
Top Skills: AIJIRAUx DesignVisual Design
18 Minutes Ago
Remote or Hybrid
United States
140K-170K Annually
Senior level
140K-170K Annually
Senior level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
Lead a team of application security engineers, oversee security practices and tooling, develop secure solutions, and collaborate cross-functionally across teams.
Top Skills: .Net CoreDastDockerGCPGoHelmIastKubernetesPythonRaspSastSca
19 Minutes Ago
Remote or Hybrid
United States
65K-100K Annually
Mid level
65K-100K Annually
Mid level
Cloud • Insurance • Payments • Software • Business Intelligence • App development • Big Data Analytics
The Inside Sales Consultant will manage the sales cycle, consult with independent agents, and exceed sales quotas while leveraging SaaS solutions.
Top Skills: ExcelMicrosoft OutlookMicrosoft PowerpointMicrosoft WordSaaSSalesforce

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account