NVIDIA Logo

NVIDIA

Senior System Software Engineer - MLOps

Posted 21 Days Ago
Be an Early Applicant
Remote
2 Locations
148K-288K
Senior level
Remote
2 Locations
148K-288K
Senior level
Design and build infrastructure solutions for Triton Inference Server, implement continuous integration processes, and collaborate across teams for deep learning deployment.
The summary above was generated by AI

We are now looking for a Senior System Software Engineer to work on Triton Inference Server! NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to speech recognition to natural language processing. We are a fast-paced team building tools and software to make design and deployment of new deep learning models easier and accessible to more data scientists.

What you'll be doing:

In this role, you will build infrastructure solutions from first principles needed to deliver Triton Inference Server. You will apply software design skills to define the processes and best practices for performing continuous integration, testing, and releasing builds, while ensuring the cross-platform compatibility of Triton Inference Server across a wide range of operating systems and architecture systems. Using your expertise, you will influence how we design our customer facing technology and tools to enable an optimized pipeline for building and deploying our product. Extensive collaboration with cross-functional teams to integrate pipelines from deep learning frameworks and components is essential to ensuring seamless deployment and inference of deep learning models across Triton Inference Server.

What we need to see:

  • Masters degree or equivalent experience

  • 3+ years of experience in Computer Science, computer architecture, or related field

  • Ability to work in a fast-paced, agile team environment

  • Excellent Bash, CI/CD, Python programming and software design skills, including debugging, performance analysis, and test design. 

  • Experience in administering, monitoring, and deploying systems and services on GitHub and cloud platforms. Support other technical teams in monitoring operating efficiencies of the platform, and responding as needs arise.

  • Knowledge of distributed systems programming.

Ways to stand out from the crowd:

  • Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems experience.

  • Experience driving efficiencies in software architecture, creating metrics, implementing infrastructure as code and other automation improvements.

  • Background deploying cloud-native services using modern technologies such as Docker, and Kubernetes, optimizing software for scalable and efficient deployment in cloud environments.

  • Experience contributing to a large open-source deep learning community - use of GitHub, bug tracking, branching and merging code, OSS licensing issues handling patches, etc.

  • Excellent problem solving abilities spanning multiple software (storage systems, kernels and containers) as well as collaborating within an agile team environment to prioritize deep learning-specific features and capabilities within Triton Inference Server, employing advanced troubleshooting and debugging techniques to resolve complex technical issues.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and hard-working people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the dynamic and quickly growing field Deep Learning and Artificial Intelligence.

#LI-Hybrid 

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Bash
Ci/Cd
Cloud Platforms
Docker
Git
Kubernetes
Python

Similar Jobs

An Hour Ago
Remote
Hybrid
Santa Clara, CA, USA
42-48
Internship
42-48
Internship
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Software Quality Engineer Intern, you will test and verify code, develop automated testing frameworks, and work within a team to ensure product quality and performance.
Top Skills: AIC++CSSJavaJavaScriptJunitPerlPHPPythonSeleniumTestngXhtml
An Hour Ago
Remote
Hybrid
Santa Clara, CA, USA
188K-328K Annually
Expert/Leader
188K-328K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Drive architecture and implementation of GenAI features, design scalable services leveraging LLMs, and guide teams in AI adoption best practices.
Top Skills: Cloud DevelopmentGenaiJavaJavaScriptLlmsReactRelational DatabasesVueWeb Engineering
An Hour Ago
Remote
Hybrid
Santa Clara, CA, USA
164K-286K Annually
Expert/Leader
164K-286K Annually
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Staff Front-End Software Engineer, you'll build high-quality, scalable UI code, mentor colleagues, and integrate AI to enhance user functionality.
Top Skills: Ai ToolsAngularJavaJavaScriptReactRelational DatabasesVue

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account