NVIDIA Logo

NVIDIA

Software Engineering Manager, AI Infrastructure Services - DGX Cloud

Reposted Yesterday
In-Office or Remote
2 Locations
200K-322K
Senior level
In-Office or Remote
2 Locations
200K-322K
Senior level
Manage teams of software engineers developing automation for AI infrastructure services, ensuring reliable cloud services and talent development.
The summary above was generated by AI

We are seeking an experienced manager of Software Engineers to develop automation running reliable AI infrastructure services at scale; both close to the bare metal and over VMaaS. In this organization, you will develop one or more teams to ensure that our internal and external facing cloud services atop of our hardware for accelerated computing are running as reliably as needed.

What you’ll be doing:

  • Recruit and retain talent managing career development for your organization.

  • Accountable for deliverables of team(s) in scope.

  • Be accountable for cross team and cross company communications.

  • Participate in KPI-driven strategic planning.

  • Foster a collaborative environment.

What we need to see:

  • 7+ overall years of experience

  • BS degree in Computer Science or a related technical field involving coding (e.g., physics or mathematics) or equivalent experience.

  • 3+ years of management experience with prior hands-on experience as an individual contributor.

  • A proven track record of impactful project deliveries while managing Software Engineers focused on cloud infrastructure or cloud application services.

  • Experience with DevOps and/or SRE practices and/or Platform Engineering.

  • Systematic problem-solving approach, coupled with strong communications skills and a sense of ownership and drive.

Ways to stand out from the crowd:

  • Developing ML/AI infrastructure. Developing bare metal as a service (BMaaS) associated systems. Developing multi-cloud infrastructure services.

  • Teaching reliability (e.g. SRE) or more general cloud systems good practices to peers or to other companies (e.g. CRE). Running private or public cloud systems based on one or more of Kubernetes, OpenStack, NVIDIA BCM, Docker or Slurm

  • No prior experience having worked in a team of any particular name or having worked in a ML/AI focused team are required but also a nice to have. Experience managing team(s)

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you are creative and autonomous, we want to hear from you! NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning - the next era of computing - with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “The AI Computing Company.” We're looking to grow our company and establish teams with the most thoughtful people in the world.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 200,000 USD - 322,000 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until September 22, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

DevOps
Docker
Kubernetes
Nvidia Bcm
Openstack
Platform Engineering
Slurm
Sre

Similar Jobs

13 Minutes Ago
Remote or Hybrid
Pennsylvania, USA
32-32
Internship
32-32
Internship
AdTech • Digital Media • Marketing Tech
The intern will assist in managing cloud infrastructure and observability solutions for the Beeswax platform, learning real-world applications of scripting and infrastructure best practices.
Top Skills: AWSC++JavaKubernetesLinuxPython
24 Minutes Ago
Remote or Hybrid
New York, NY, USA
550-700
Mid level
550-700
Mid level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Broadcast Facilities Supervisor will provide technical and operational support during the Milan Winter Games, overseeing facility maintenance, installations, and coordinating with various teams.
Top Skills: Construction ToolsPower Construction EquipmentSecurity Systems
24 Minutes Ago
Remote or Hybrid
New York, NY, USA
150K-185K Annually
Senior level
150K-185K Annually
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Design, build, and oversee technology solutions for data management, utilizing various technologies to support business intelligence and decision-making processes.
Top Skills: AthenaAWSAws CloudformationEmrGithub ActionsHiveKafkaMicrostrategyPandasPostgresPythonScalaSnsSparkSqsTableauTerraform

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account