NVIDIA

Senior System Software Engineer - Performance

Posted 13 Days Ago

Be an Early Applicant

In-Office or Remote

3 Locations

148K-288K

Senior level

In-Office or Remote

3 Locations

148K-288K

Senior level

The role involves enabling GPU computing products deployment, performance analysis, and improvements for large scale AI systems, collaborating with diverse teams.

The summary above was generated by AI

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

We are looking for an outstanding engineer for a System Performance Engineer role for at scale AI system performance and datacenter applications. Be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing! Provide insights on at-scale system design and tuning mechanisms for large-scale compute runs. You will work with the latest Accelerated Computing and Deep Learning software and hardware platforms, and with many researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, CPU and GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms.

What you'll be doing:

Provide engineering solutions to enable deployment of world-class GPU computing products at scale, lead technical relationships with engineering teams, and assisting system administrators, software and hardware engineers, and machine learning/deep learning engineers in building creative solutions.
Lead aspects of performance analysis and scalable practices to support large scale infrastructure, deliver powerful tools, methodologies, and workflows to validate expectations.
Deliver engineering solutions to deliver continuous insights into performance of AI workloads over evolving environments, generating quick insights to improvements and regressions over time.
Decompose multi-faceted issues into minimal reproduction cases, working towards final root cause of underlying problems.
Participate and engage with multiple team members to develop best practices for understanding trends in test results and presenting data clearly to develop data driven actions.

What we need to see:

5+ years of experience running multinode workloads and identifying bottlenecks and implementing improvements.
Proven understanding of high-performance computing based architectures and GPU accelerated computing software stacks and DL Frameworks (CUDA, PyTorch).
Experience with CPU architectures.
Experience with C/C++/Python/Bash programming/scripting.
Strong teamwork and communication skills.
Ability to multitask in a dynamic environment.
Action driven with strong analytical and analytical skills.
BS in Engineering, Mathematics, Physics, or Computer Science, MS or PhD desirable (or equivalent experience).

Ways to Stand Out From the Crowd:

Experience tuning memory, storage, and networking settings for performance on Linux systems.
Knowledge of modern Cloud and container-based architectures.
Hands-on experience deploying and debugging systems with NVIDIA NVLink and Infiniband.
Experience with multiple monitoring stacks such as Prometheus+Grafana, Elasticsearch+Kibana, Splunk, Zabbix, etc.
Demonstrated work with Open-Source software: building, debugging, patching and contributing code.

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, with a genuine passion for technology, we want to hear from you.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Bash

C++

Cuda

Elasticsearch

Grafana

Infiniband

Kibana

Nvidia Nvlink

Prometheus

Python

PyTorch

Splunk

Zabbix

Similar Jobs

NVIDIA

Software Engineer

25 Days Ago

In-Office or Remote

184K-357K

Senior level

184K-357K

Senior level

Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse

Design, develop, and optimize software for next-gen SoCs, focusing on performance and architectural analysis, while working in complex environments.

Top Skills: Arm ArchitectureCuda

Groq

Staff Software Engineer

15 Days Ago

In-Office or Remote

Entry level

Artificial Intelligence • Machine Learning • Semiconductor

As a Software Engineer, develop real-time distributed compute frameworks for ultra-low latency AI inference. Collaborate on hardware-software optimization and ensure mission-critical reliability.

Top Skills: C++FpgasMlirPyTorchRust

ServiceNow

Staff Software Engineer

50 Minutes Ago

Remote or Hybrid

Santa Clara, CA, USA

164K-286K Annually

Senior level

164K-286K Annually

Senior level

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation

Lead engineering teams through development cycles, architect database schemas, enforce best practices, and integrate AI into products while mentoring colleagues.

Top Skills: AIAngularJavaJavaScriptReactVue

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
Key Industries: Artificial intelligence, adtech, media, software, game development
Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering