NVIDIA Logo

NVIDIA

Senior Datacenter System Software Architect - DGX Cloud

Posted 20 Hours Ago
Be an Early Applicant
In-Office or Remote
2 Locations
184K-357K
Senior level
In-Office or Remote
2 Locations
184K-357K
Senior level
Lead the architecture, design, and implementation of DGX cloud clusters. Oversee technical activities, infrastructure workflows, and ensure software integration for AI applications.
The summary above was generated by AI

NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, a deep understanding of distributed systems, familiarity with software testing and deployment, and excellent communication and planning abilities. We also welcome out-of-the-box thinkers who can provide new ideas with strong at execution bias. Expect to be constantly challenged, improving, and evolving for the better. You and other engineers in this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of AI-based applications that affect core data science. What are you waiting for if you're creative, passionate about what you do, and love having fun apply today!

We’re looking for a highly motivated, creative engineer with strong experience in system software to join the DGX Cloud Software Team. You will lead the architecture, design and implementation of our next generation DGX cloud clusters using latest technologies. On this team, you will do full stack deployment including hardware architecture, workload orchestration and application performance tuning. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.

What you’ll be doing:

  • Lead technical activities for data centers with focus on hybrid deployments between cloud and on-prem

  • Providing expertise in infrastructure workflows, including hardware, software release, workload orchestration and application tuning

  • Provide fast and creative solutions for complex problems and write effective, clear and reliable architecture specification

  • Translate requirements to vision, architecture and roadmap

  • Work with engineering teams across NVIDIA to ensure your software integrates seamlessly from the hardware all the way up to the AI training applications.

What we need to see:

  • Masters or PhD in Computer Science, Computer Engineering, Physics or equivalent experience.

  • 9+ years of experience in this field.

  • Data Sciences, Deep Learning, or Machine Learning coursework

  • Ability to seamlessly shift between Linux system environments to Python programming

  • Programming skills in 1 or more high-level languages (C, C++, Go, Rust, etc)

  • System-level experience with both hardware and software

  • Motivated self-starter with an equal balance of strong problem-solving skills and customer-facing communication skills

  • Strong design, coding, analytical, debugging and problem-solving skills

  • Passion for continuous learning and knowledge transfer. Ability to work concurrently with multiple groups locally and abroad in the organization

Ways to stand out from the crowd:

  • Experience with GPU deep learning and data sciences. Experience using TensorFlow, PyTorch or other DL framework. Experience working with Docker containers, Slurm, Terraform and Kubernetes

  • CUDA programming and NCCL experience. HPC programming experience including MPI, OpenACC, or other parallel programming tools. Hands-on experience with DGX Cloud, NVIDIA AI Enterprise AI Software, Base Command Manager, NEMO and NVIDIA Inference Microservices.

  • Interest in crafting, analyzing and fixing large-scale distributed systems.

  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until August 19, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

C
C++
Cuda
Docker
Go
Kubernetes
Mpi
Nccl
Openacc
Python
PyTorch
Rust
Slurm
TensorFlow
Terraform

Similar Jobs

11 Days Ago
In-Office or Remote
2 Locations
224K-357K
Senior level
224K-357K
Senior level
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Lead the architecture and design of DGX cloud clusters, focusing on hybrid deployments and providing expertise in infrastructure workflows and systems integration.
Top Skills: CC++CudaDockerGoKubernetesLinuxMpiNcclOpenaccPythonPyTorchRustSlurmTensorFlowTerraform
2 Hours Ago
Remote or Hybrid
United States
17-20
Entry level
17-20
Entry level
Digital Media • eCommerce • Information Technology • Marketing Tech • Retail • Social Media • Analytics
The Visual Design Intern supports the design team by adapting templates for various media, maintaining asset organization, and assisting with design execution while developing essential skills under guidance.
Top Skills: Adobe Creative SuiteFigmaIllustratorIndesignPhotoshop
2 Hours Ago
Remote or Hybrid
2 Locations
205K-258K Annually
Senior level
205K-258K Annually
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
Lead Sales Operations to optimize processes, tools, and policies; support B2B software growth and enhance sales productivity through data-driven strategies.
Top Skills: RevtechSales Analytics ToolsSalesforce

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account