Nominal Logo

Nominal

Site Reliability Engineer (SRE)

Reposted 24 Days Ago
In-Office
3 Locations
120K-200K Annually
Senior level
In-Office
3 Locations
120K-200K Annually
Senior level
The Site Reliability Engineer will enhance reliability and observability of distributed systems, lead incident responses, and improve release processes.
The summary above was generated by AI
About Nominal
Nominal is a venture-backed company with offices in Los Angeles, Austin, and New York City. We build software and data solutions for organizations, testing and validating complex systems—think drones, robots, rocket engines, and satellites. Backed by top investors like General Catalyst, Founders Fund, and Lux Capital, we’re gaining momentum across the commercial and government aerospace and defense ecosystem, including direct work with the Department of Defense.

Our team includes alumni from SpaceX, Meta, Palantir, Anduril, Lockheed Martin, and NASA, united by a mission: accelerate hardware innovation by making testing faster, smarter, and easier. Our platform helps engineering teams scale their test infrastructure and gain insight from high-throughput, high-complexity test data.

We're looking for a senior engineer to take on a high-leverage role focused on strengthening the foundations of our distributed systems and improving how the entire team builds, ships, and maintains software. This role is ideal for someone who thrives in complex environments, has deep experience with incident response and production systems, and is driven to create safer, faster systems through smart infrastructure and process design.

🚀 What You’ll Do

  • Drive reliability and observability improvements across large-scale distributed systems.
  • Serve as a force multiplier across all engineering teams by reducing downtime, improving tooling, and freeing up senior engineers from firefighting.
  • Own and evolve our incident review process, leading postmortems and embedding learnings into tools, practices, and culture across the company.
  • Collaborate with teams to improve release hygiene, including: Automating release gating (e.g., ensuring code bakes in staging for appropriate windows), preventing code from stagnating in staging environments, and implementing pre-prod automated test pipelines to catch issues early.
  • Build and maintain Nominal’s gRPC middleware to ensure safe, observable, and performant service communication.
  • Improve alerting, debugging, and monitoring to ensure production health and rapid root cause analysis.

🔍 Who You Are

  • You have 7+ years of experience in software engineering with a strong focus on production systems and distributed architectures.
  • You thrive in high-leverage roles that improve how everyone else builds, ships, and fixes software.
  • You’ve led or played a significant role in incident response, building systems, and culture around continuous improvement.
  • You’re excited by complexity, not afraid of it, and you’re deeply motivated to make systems safer and teams faster.

⚡Skills that supercharge us

  • Experience working on distributed systems at scale.
  • Hands-on experience with Kafka/Redpanda, PostgreSQL or other SQL databases, MongoDB/NoSQL databases, Clickhouse or other OLAP databases.
  • Deep understanding of release automation, CI/CD, and code lifecycle management.
  • Familiarity with gRPC and experience building shared infrastructure components like middleware.
  • A systems mindset—you understand the ripple effects of a single bug and know how to design to prevent them

✨ Benefits & Perks

  • 🏥 100% coverage of medical, dental, and vision insurance
  • 🏖️ Unlimited PTO and sick leave
  • 🍽️ Free lunch, snacks, and coffee
  • 🚀 Professional Development Stipend
  • ✈️ Annual company retreats

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, or national origin.

Top Skills

Ci/Cd
Clickhouse
Grpc
Kafka
MongoDB
Postgres
Redpanda

Nominal Los Angeles, California, USA Office

Los Angeles, CA, United States

Similar Jobs

3 Days Ago
Hybrid
Houston, TX, USA
Junior
Junior
Financial Services
As a Site Reliability Engineer II, you will ensure system reliability, oversee small to medium projects, and leverage technology for business solutions.
Top Skills: CloudDatadogDynatraceGitlabGrafanaJenkinsLinuxPrometheusSoftware EngineeringSplunkTerraformWindows
10 Days Ago
Hybrid
Fort Worth, TX, USA
Senior level
Senior level
Financial Services
Lead reliability initiatives for applications and platforms, utilize data analytics, collaborate on service metrics, and manage major incidents.
Top Skills: .NetCloudwatchDatadogDockerDynatraceEcsGitlabGrafanaJava Spring BootJenkinsKubernetesPrometheusPythonSplunkTerraform
10 Days Ago
In-Office
Plano, TX, USA
110K-145K Annually
Senior level
110K-145K Annually
Senior level
Aerospace • Cloud • Digital Media • Information Technology • Mobile • News + Entertainment • Retail
Lead incident response and reliability initiatives by implementing monitoring, automation, and mentoring junior engineers to enhance system health.
Top Skills: AppdynamicsAWSAzureBashCassandraCircleCIDatadogDockerDynatraceElk StackGitlab CiGoGCPGrafanaJavaJenkinsKubernetesMySQLOraclePostgresPrometheusPythonRubySplunkSQL

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account