NVIDIA Logo

NVIDIA

Senior Research Engineer - Enterprise Products

Reposted 15 Days Ago
Be an Early Applicant
In-Office or Remote
3 Locations
184K-357K
Senior level
In-Office or Remote
3 Locations
184K-357K
Senior level
The Senior Research Engineer will develop and optimize generative AI models, focusing on LLMs and their integration into products, while mentoring other engineers.
The summary above was generated by AI

We are now looking for a Senior Research Engineer passionate about Generative AI inference. Are you excited to change the way people infuse AI into products and services? NVIDIA is at the forefront of generative AI models, from language to images. NVIDIA provides building blocks to democratize AI and make generative AI easy to develop, integrate, and deploy. Our team is dedicated to developing optimized inferencing technologies to support our growing generative AI needs. We contribute to all steps of the machine learning lifecycle: from conceptualization, to applied research, engineering for optimized inference, and deployment. Collaborate with research teams, engineers, and open-source community. Implement optimized LLM algorithms.

What you will be doing:

  • Developing new models and algorithms focused on Large Language Models, Natural Language Processing, and Deep Learning.

  • Design and implement multi-node serving architectures disaggregated serving and distributed LLM inference

  • Optimize multi-LoRA (and other PEFT technique) inference serving systems

  • Apply sophisticated quantization techniques (FP4/INT4, FP8) to reduce model footprint while preserving quality

  • Implement speculative decoding (draft target, eagle, medusa etc) and other latency optimization strategies

  • Demonstrating good engineering practices and mentoring other team members to do the same.

  • Collaborating with engineering teams across all of NVIDIA to ensure our software integrates seamlessly up and down the NVIDIA accelerated serving stack.

What we need to see:

  • Understanding of modern techniques in Machine Learning, Deep Neural Networks, Natural Language Processing, or Speech Recognition.

  • 8+ years of industry experience in Deep Learning frameworks (PyTorch or TensorFlow).

  • Passion for software engineering, especially with excellent C++ and Python development skills, with meaningful contributions to major open-source projects.

  • Strong communication and interpersonal skills, along with the ability to work in a dynamic and distributed team. A history of mentoring junior engineers and interns is a huge plus.

  • Bachelor's degree or equivalent experience.

  • A desire to constantly grow and learn new things.

  • Strong computer science fundamentals - algorithms and data structures, computational complexity, parallel and distributed computing, system software.

Ways to stand out from a crowd:

  • Experience architecting or developing large-scale distributed systems for deep learning.

  • Knowledge of CPU and/or GPU architecture.

  • GPU programming (CUDA).

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 299,000 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 29, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

C++
Cuda
Deep Learning
Generative Ai
Large Language Models
Natural Language Processing
Python
PyTorch
TensorFlow

Similar Jobs

2 Hours Ago
In-Office or Remote
Washington, DC, USA
25-50
Entry level
25-50
Entry level
Artificial Intelligence • Machine Learning • Software • Defense
The intern will support Vannevar's government relations efforts, engage with legislative stakeholders, and assist with strategic communications and research activities.
2 Hours Ago
In-Office or Remote
New Orleans, LA, USA
30-61
Mid level
30-61
Mid level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Global Seller Onboarding Specialist accelerates onboarding for merchants, manages sales funnel stages, trains sellers, and supports cross-functional initiatives to enhance product use and streamline processes.
Top Skills: Project ManagementSoftware Implementation
2 Hours Ago
In-Office or Remote
5 Locations
180K-200K
Senior level
180K-200K
Senior level
Cloud • eCommerce • Enterprise Web • Information Technology • Software
Lead a team of Enterprise Account Executives to drive revenue growth, manage sales strategy, and foster high performance in a fast-paced SaaS environment.
Top Skills: CRMSaaSSalesforce

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account