NVIDIA Logo

NVIDIA

Senior Software SDET Test Development Engineer

Reposted 2 Days Ago
Be an Early Applicant
In-Office
Santa Clara, CA
140K-270K Annually
Senior level
In-Office
Santa Clara, CA
140K-270K Annually
Senior level
Develop and execute platform test plans for NVIDIA HGX/DGX/MGX servers across OS, firmware, and CUDA stack. Build and maintain server/OS-level automation frameworks, run reliability/validation tests, perform root-cause analysis, manage bug lifecycles, and collaborate with partners and cross-functional teams to ensure test-driven quality at scale.
The summary above was generated by AI

NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our traditional OEM business. NVIDIA is also well positioned as the ‘AI Computing Company’, and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some of the most experienced and dedicated people in the world working for us. If you are dedicated, forward-thinking, and hard-working technical people across countries sounds exciting, this job is for you. NVIDIA is looking for an outstanding individual who thrives in a diverse work environment, has outstanding interpersonal skills and possesses a strong sense of engagement and continuous process improvement. This candidate must have enterprise server integration, strong Linux experience, reliability testing with various telemetries, scale out cluster, test plan development, track record in developing AI tools and NLP, DevOps, CI/CD experience to join our platform SWQA team.

What you’ll be doing:

  • Responsible for the development and execution of NVIDIA HGX/DGX/MGX platform test plan on servers, OS, FW and CUDA SW stack from design doc.

  • Installing and testing various systems OS, server firmware and SW stack.

  • Drive support for root cause analysis on reliability and validation test failures to identify root cause(s) and achieve mitigation.

  • Build, develop/debug server and OS level automation front-end and back-end framework and tests

  • Review partner and supplier test results and prescribe additional reliability testing on components, servers, and packaging as needed.

  • Work in an agile software development team with very high production quality standards.

  • Manage bug lifecycle and collaborate with inter-groups to drive for solutions.

What we need to see:

  • Bachelor’s Degree (or equivalent experience) in a STEM (Science, Technology, Engineering, Math or Physics) field

  • 5+ years proven experience; or master’s degree.

  • Proven years of OS and server level automation, CI/CD process and DevOps experience using Python, SHELL, Ansible, Jenkins, C/C++, Java, JavaScript

  • Strong server and Linux(Ubuntu, RedHat, CentOS, SuSE, Fedora and etc…) troubleshooting and debugging experience in a bare-metal and KVM/VMWare/Hyper-V environment.

  • Good knowledge and hands-on experience in model testing, AI tools/frameworks (TensorFlow, Pytorch, Cursor and etc…), NLP and LLM benchmarking

  • Experience in using AI development tools for test plans creation, test cases development and test cases automation

  • Strong experience in FW, BMC/OpenBMC, Network protocol, internal/external enterprise storage devices, PCIe buses and devices, IO sub-devices, CPU and memory, ACPI, UEFI spec, Redfish - huge plus

  • Proven years of experience in GitHub/Gitlab/Gerrit, PXE, SLURM, Stack/Kubernetes/Docker) – huge plus

Ways to stand out from the crowd:

  • AI related tools, LLM and NLP.

  • Experience working with NVIDIA GPU hardware is a strong plus.

  • Good to have solid understanding of virtualization in Linux (KVM, Docker orchestrated with Kubernetes)

  • Background in parallel programming ideally CUDA/OpenCL is a plus

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

    Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 140,000 USD - 224,250 USD for Level 3, and 168,000 USD - 270,250 USD for Level 4.

    You will also be eligible for equity and benefits.

    Applications for this job will be accepted at least until February 28, 2026.

    This posting is for an existing vacancy. 

    NVIDIA uses AI tools in its recruiting processes.

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    Top Skills

    Python,Shell,Ansible,Jenkins,C,C++,Java,Javascript,Ubuntu,Redhat,Centos,Suse,Fedora,Kvm,Vmware,Hyper-V,Tensorflow,Pytorch,Cursor,Cuda,Nlp,Llm Benchmarking

    Similar Jobs

    Yesterday
    In-Office
    Santa Clara, CA, USA
    168K-270K Annually
    Senior level
    168K-270K Annually
    Senior level
    Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
    Lead development of large-scale test infrastructure and automation for confidential computing on NVIDIA GPU platforms. Create test plans, orchestrate distributed test execution, improve code coverage, integrate AI tools into automation, collaborate across teams to productize features, and run regression and validation on CUDA/driver features in agile environments.
    Top Skills: Python,C,C++,Linux,Cuda,Openacc,Docker,Ansible,Xen,Kvm,Hyper-V,Automake,Autoconf,Cmake,Meson,Nvidia Gpus,Gpu,Cloud
    Yesterday
    In-Office
    Santa Clara, CA, USA
    140K-270K Annually
    Senior level
    140K-270K Annually
    Senior level
    Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
    Design, implement, and operate large-scale automated test infrastructure for CUDA/driver validation on distributed heterogeneous servers. Develop test plans, automation frameworks, and orchestration for silicon validation, improve code coverage, integrate AI tools for test generation and analysis, collaborate cross-functionally, and run regression and hardware/software tuning for compute releases.
    Top Skills: Linux,Python,C,C++,Cuda,Openacc,Ai Tools,Cloud Infrastructure,Ansible,Docker,Xen,Kvm,Rpm,Deb,Centos,Ubuntu,Sles,Redhat,Fedora,Automake,Autoconf,Cmake,Meson,Containers,Cross-Compilation,Cluster Management,Orchestration Systems
    16 Days Ago
    In-Office
    Santa Clara, CA, USA
    146K-183K Annually
    Senior level
    146K-183K Annually
    Senior level
    Database
    The role involves leading testing efforts on storage and file system components, developing test methodologies, executing test plans, automating tests, and mentoring team members in a hybrid work setting.
    Top Skills: BashDdFioFsctGoIscsiNas Storage ProtocolsNfsPytestPythonS3SmbSpecfsVdbenchVirtana

    What you need to know about the Los Angeles Tech Scene

    Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

    Key Facts About Los Angeles Tech

    • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
    • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
    • Key Industries: Artificial intelligence, adtech, media, software, game development
    • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
    • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
    • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

    Sign up now Access later

    Create Free Account

    Please log in or sign up to report this job.

    Create Free Account