NVIDIA Logo

NVIDIA

Principal Architect, Site Reliability Engineering - GeForce Now

Posted 18 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
248K-391K
Expert/Leader
In-Office or Remote
2 Locations
248K-391K
Expert/Leader
Seeking a Principal Architect for SRE to design scalable infrastructure, implement best practices, mentor engineers, and enhance system reliability.
The summary above was generated by AI

NVIDIA is the world leader in accelerated computing—from gaming to data centers to AI and robotics. We are a team of trailblazers reinventing computing at the intersection of graphics, high-performance computing, and AI. If you’re driven to tackle sophisticated challenges, push boundaries, and build technology that powers the future, NVIDIA is the place for you. 

We are looking for an expert and transformative Principal Architect for Site Reliability Engineering (SRE) to join our GeForce Now Engineering team. In this role, you will define the architecture and strategic direction for NVIDIA’s highly available, scalable, and secure systems that power critically important services and platforms. You’ll collaborate with product, platform, and infrastructure teams to establish best practices, improve reliability, and drive the evolution of our SRE function. This is a highly specialized subject area which demands knowledge across different systems, networking, coding, database, capacity management, continuous delivery and deployment and open source cloud enabling technologies such as Kubernetes. 

What you will be doing: 

  • Design and architect scalable, resilient infrastructure for cloud-native and hybrid services. 

  • Define and implement SRE principles, SLAs, SLOs, and error budgets across teams and services. 

  • Collaborate with multi-functional teams to ensure reliability, observability, performance, and security. 

  • Lead architecture reviews, disaster recovery planning, incident response strategies, and postmortems. 

  • Develop automation frameworks for deployment, monitoring, and remediation of systems. 

  • Champion a culture of reliability, continuous improvement, and operational excellence. 

  • Mentor SREs and DevOps engineers, sharing knowledge and standard methodologies across the organization. 

What we need to see: 

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field (or equivalent experience).

  • 15+ years of experience in infrastructure, cloud, or SRE roles, including at least 5+ years in an architectural or technical leadership position. 

  • Expertise in cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (Kubernetes). 

  • Deep understanding of distributed systems, microservices architecture, and CI/CD pipelines. 

  • Proficient with observability tools (Prometheus, Grafana, ELK/EFK, Datadog) and infrastructure as code (Terraform, Ansible). 

  • Strong programming/scripting skills (Python, Go, Bash, etc.). 

  • Ability to communicate your ideas/code clearly through documents, presentation, etc. 

Ways to stand out from the crowd:

  • AWS, GCP, or Azure Professional Solution Architect Certification. 

  • Familiarity with parallel programming and distributed computing platforms 

  • Experience in developing large-scale and complex applications. 

  • Cross-platform development experience. 

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and dedicated people in the world working for us. If you're creative and autonomous, we want to hear from you! 

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 248,000 USD - 391,000 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until August 2, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Top Skills

Ansible
AWS
Azure
Bash
Datadog
Efk
Elk
GCP
Go
Grafana
Kubernetes
Prometheus
Python
Terraform

Similar Jobs

13 Hours Ago
Remote
USA
167K-197K Annually
Senior level
167K-197K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Partner with finance stakeholders to implement and configure Kyriba and other financial applications, perform analysis, lead testing, and support business process transformation.
Top Skills: BoomiJIRAKyribaMulesoftNetSuiteOracleRest ApiSAPSftpSoap WebservicesWorkato
13 Hours Ago
Remote
USA
152K-179K Annually
Junior
152K-179K Annually
Junior
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The role involves building and maintaining deployment systems in Golang, designing scalable solutions, and providing support to internal engineers.
Top Skills: BlockchainCloud TechnologyGo
13 Hours Ago
Remote
USA
194K-228K Annually
Senior level
194K-228K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Manage strategic credit and debit card partnerships at Coinbase, optimizing execution, aligning KPIs, and collaborating with marketing and product teams for growth.

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account