NVIDIA Logo

NVIDIA

Senior Research Scientist, Post-Training LLM and DLM

Posted 15 Hours Ago
Be an Early Applicant
In-Office or Remote
2 Locations
160K-299K
Senior level
In-Office or Remote
2 Locations
160K-299K
Senior level
The role involves designing and implementing post-training algorithms for LLMs and DLMs, improving training pipelines, collaborating with researchers, and demonstrating engineering practices.
The summary above was generated by AI

We are now looking for a Senior Research Scientist passionate about Large Language Model (LLM) and Diffusion Language Model (DLM) post-training and system optimization. Are you excited to shape the future of large-scale generative AI? NVIDIA is at the forefront of foundation models and generative AI systems, enabling cutting-edge research and real-world deployment at unprecedented scale. Our team is dedicated to advancing post-training algorithms, building efficient large-scale systems, and developing evaluation frameworks to ensure reliability and scalability. Join us to work with world-class researchers and engineers on building the next generation of AI.

What you will be doing:

  • Designing and implementing post-training algorithms LLMs and DLMs.

  • Driving efficiency and scalability improvements across training pipelines and serving systems

  • Collaborating with researchers to translate cutting-edge ideas into production-ready implementations.

  • Exploring new paradigms for evaluation.

  • Demonstrating strong engineering practices, and contributing to open-source communities.

What we need to see:

  • PhD in Computer Science, Electrical Engineering, or related field, or equivalent research experience in LLMs, systems, or related areas.

  • 2+ years of experiences in machine learning, systems, distributed computing, or large-scale model training.

  • Proficiency in Python with hands-on experience in frameworks such as PyTorch.

  • Solid background in computer science fundamentals: algorithms, data structures, parallel/distributed computing, and systems programming.

  • Proven ability to collaborate across research and engineering teams in multifaceted environments.

Ways to stand out from the crowd:

  • Expertise in post-training LLMs with novel algorithmic/data pipelines

  • Experience developing andscaling large distributed systems for deep learning.

  • Contributions to open-source LLM systems or large-scale AI infrastructure.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and hard-working people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 160,000 USD - 258,750 USD for Level 3, and 184,000 USD - 299,000 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 20, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Python
PyTorch

Similar Jobs

An Hour Ago
Remote or Hybrid
US
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
The Director of Partner GTM will lead the US partner channel lifecycle, develop GTM strategies, drive revenue growth, and foster talent development while collaborating with various teams to meet business targets.
Top Skills: SaaS
An Hour Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
126K-212K Annually
Senior level
126K-212K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The role involves developing automation strategies, creating test frameworks, improving testing quality and speed, and mentoring junior engineers.
Top Skills: Automation Test FrameworksFirmware EngineeringLinuxPythonSQL
An Hour Ago
Remote
United States
85K-100K Annually
Senior level
85K-100K Annually
Senior level
Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
The Senior Business Intelligence & GA Specialist will implement and customize Google Analytics, create Tableau dashboards, lead reporting, and provide strategic insights for a financial client, driving data-driven recommendations.
Top Skills: BigQueryGoogle Analytics 4LookerPower BISQLTableauTag Manager

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account