Varda Space Industries Logo

Varda Space Industries

Senior Site Reliability Engineer

Posted Yesterday
Be an Early Applicant
In-Office
El Segundo, CA, USA
153K-185K Annually
Senior level
In-Office
El Segundo, CA, USA
153K-185K Annually
Senior level
Lead design, build, and operate mission-critical infrastructure across cloud, on-prem, and spacecraft contexts. Implement IaC, CI/CD, observability, and scalable Kubernetes-based systems; respond to incidents, perform root cause analysis, optimize performance, and collaborate with software and hardware teams. Participate in on-call rotations and occasional travel.
The summary above was generated by AI
About Varda

Low Earth orbit is open for business. Varda is accelerating the development of commercial space infrastructure, from in-orbit pharmaceutical processing to reliable and economical reentry capsules. Varda’s W-Series vehicles are built, designed, and operated by Varda in-house, including the pharmaceutical processing payloads, the capsules, the C-PICA heatshields, and the satellite buses.

From life-saving pharmaceuticals to more powerful fiber optics, there is a world of products used on Earth today that can only be manufactured in space. Varda is accelerating innovation in the orbital economy by creating both the products and infrastructure needed so space can directly benefit life on Earth. Our mission is to expand the economic bounds of humankind.  

Our team is uniquely suited to accomplishing this goal, with leadership and staff comprised of veterans from SpaceX, Blue Origin, major pharmaceutical companies and Silicon Valley. Varda was founded in January 2021 by Will Bruey and Delian Asparouhov with significant backing from world class investors including Khosla Ventures, Lux Capital, Founders Fund, Caffeinated Capital, General Catalyst, and Also Capital.  

Varda is headquartered in El Segundo, California, where we have offices and a production facility where our vehicles, equipment, and materials are built, integrated, and tested. Varda also has offices in Washington, DC and Huntsville, AL. 

Join Varda, and work to create a bustling in-space ecosystem.

About This Role 

At Varda Space Industries, we're pushing the boundaries of what's possible in space and materials science — and we’re looking for bold engineers to help us get there. As a Senior Site Reliability Engineer, you'll be critical in building, scaling, and maintaining the infrastructure that powers our systems on Earth, in orbit, and everything in between. 

We are looking for an experienced engineer with deep working knowledge of Kubernetes and containerized technologies. You are a hands-on operator and builder who applies first-principles thinking to both software delivery (DevOps) and production reliability (SRE), and thrives in complex, mission-critical environments. 

In this role, you will: 

  • Solve challenging technical problems across a wide range of modern technologies. 
  • Apply a software engineering mindset to automate operations and improve system reliability, scalability, and resilience 
  • Design and build infrastructure that enables rapid development — from cloud-based services to embedded software running on spacecraft. 
  • Shape Varda’s infrastructure strategy and drive operational excellence across containerized and modernized environments. 

Responsibilities 

  • Deploy, maintain, and operate mission-critical applications and infrastructure supporting spacecraft and company-wide systems. 
  • Build and evolve Infrastructure as Code (IaC) frameworks using tools such as Terraform 
  • Implement and operate observability systems (metrics, logging, tracing) and actionable alerting. 
  • Build and maintain CI/CD pipelines to enable safe, repeatable, and rapid deployments. 
  • Partner with software and hardware engineers to deliver highly operable, reliable, and scalable systems and pipelines, ensuring they have the tools and infrastructure needed for rapid iteration. 
  • Identify, analyze, and resolve system bottlenecks and reliability risks; perform performance tuning and implement long-term stability improvements. 
  • Respond to and resolve production incidents; perform root cause analysis and drive corrective actions through blameless postmortems. 
  • Rotate through the team’s on-call schedule to keep critical systems healthy and responsive. 
  • Must be willing to work extended hours and weekends as needed 
  • Occasionally travel to customer sites and other Varda locations to troubleshoot, deploy, or test critical infrastructure. 

Basic Qualifications 

  • Bachelor’s degree in computer science, engineering, or related STEM field with 5+ years of Site Reliability Engineering experience, or 7+ years of progressive experience in DevOps, SRE, or Systems Engineering in lieu of a degree.  
  • Experience with Infrastructure as Code (IaC) using tools like Terraform to automate server provisioning and configuration management 
  • Experience operating Kubernetes or similar container orchestration platforms in production environments. 
  • Experience with Prometheus, Grafana, InfluxDB, or similar technologies. 
  • Knowledge of software-defined networking (VPC, Subnets, Firewalls, VPNs, etc.) 
  • Python, Bash, PowerShell (or similar) scripting experience  
  • Positive and strong communication skills, both written and oral 

Preferred Skills and Experience 

  • Experience in provisioning and managing scalable Azure cloud infrastructure using native tools and best practices 
  • Experience implementing configuration management, provisioning, and workflow automation solutions via Infrastructure as Code, CI/CD, and GitOps (e.g., Ansible, Salt, ArgoCD, etc). 
  • Strong understanding of Linux systems and container runtimes (e.g., containerd, Docker) 
  • Experience with GPU workloads or high-throughput computing. 
  • Hands-on experience operating and optimizing High Performance Computing (HPC) environments, including workload schedulers such as Slurm (e.g., queue/partition design, fair-share scheduling, and cluster resource management). 
  • Experience with hybrid environments (cloud + on-prem or edge systems) 
  • Experience debugging distributed systems at scale (network, storage, latency) 
  • Experience with databases and data modeling 
Pay Range
  • Site Reliability Engineer: $153,000.00 - $185,000.00/per year
  • This role is on-site in El Segundo, CA
  • Leveling and base salary are determined by job-related skills, education level, experience level, and job performance
  • You will be eligible for long-term incentives in the form of stock options and/or long-term cash awards

Benefits

Varda offers a comprehensive benefits package designed to support health, financial well‑being, and a high‑quality workplace experience. Below is an overview of what full‑time employees receive (at this time, interns receive a subset of benefits): 

Health & Wellness 

  • Flexible PTO policy + 12 paid holidays 
  • 100% company-paid Medical, Dental, and Vision insurance plans for employees and dependents with FSA and employer-matched HSA options
  • Voluntary accident, hospital, critical illness, and pet insurance 
  • $120/month wellness reimbursement for gym and fitness expenses 
  • 12 weeks of parental leave (with supplemental disability leave for CA mothers) 
  • Family building, pregnancy, parenting and menopause benefits via Maven Clinic 
  • Sponsored One Medical memberships for employees and their dependents 

Financial & Retirement 

  • Substantial incentive equity in a fully funded space start-up 
  • 401(k) retirement plan with 6% employer match (immediately vested)
  • $20/pay period cell phone reimbursement 
  • Relocation support for new hires, if needed 

Workplace Experience & Perks 

  • Fully stocked kitchen with lunch provided daily and dinner provided twice weekly 
  • Company and team-bonding events, happy hours and mission-success celebrations
  • Complimentary EV charging
  • Dog-friendly office space 🐕 

ITAR Requirements

Varda, like all employers, must ensure that its employees working in the United States are lawfully authorized to work in the U.S. Additionally, our employees are exposed to and have access to certain export-controlled items. Because our employees are provided access to export-controlled items, our policy is to only hire “U.S. persons” who are permitted to have access to our technology without an export license.
“US person” means: U.S. citizen, U.S. lawful permanent resident, or protected individual as defined by 8 U.S.C. 1324b(a)(3) (i.e., individual admitted to the U.S. as a refugee or granted asylum).


E-Verify Statement

Varda Space Industries, Inc. participates in the U.S. Department of Homeland Security E-Verify program. The E-Verify program is an Internet-based employment eligibility verification system operated by the U.S. Citizenship and Immigration Services. Learn more about the E-Verify program.

E-Verify Notice                                                               Right To Work Notice

Read more                                                                             Read more

Varda Space Industries is an Equal Opportunity Employer.  We celebrate diversity and are committed to creating an inclusive environment for all employees. Candidates and employees are always evaluated based on merit, qualifications, and performance. We will never discriminate on the basis of race, color, gender, national origin, ethnicity, veteran status, disability status, age, sexual orientation, gender identity, martial status, mental or physical disability, or any other legally protected status.

HQ

Varda Space Industries El Segundo, California, USA Office

225 S Aviation Blvd., El Segundo, CA, United States, 90405

HQ

Varda Space Industries El Segundo, California, USA Office

225 S Aviation blvd, El Segundo, California, United States, 90045

Similar Jobs

9 Days Ago
Hybrid
160K-250K Annually
Senior level
160K-250K Annually
Senior level
Artificial Intelligence • Fintech • Payments • Business Intelligence • Financial Services • Generative AI
Lead design and delivery of scalable cloud infrastructure for the Spend product. Embed with development teams to drive reliability, performance, observability, incident response, and automation. Own SLOs, runbooks, DevOps metrics, and collaborate with central DevOps and security teams to ensure compliance and resilience. Lead infrastructure projects including new service launches, data centre migrations, and modernising data pipelines.
Top Skills: Analytics PipelinesAWSData StreamingDevOpsGCPIncident ResponseKubernetesObservabilitySlosSre
18 Days Ago
Easy Apply
Hybrid
Easy Apply
210K-270K Annually
Senior level
210K-270K Annually
Senior level
Healthtech • Information Technology • Software • Telehealth
Lead reliability efforts for Zocdoc's cloud-based, consumer-facing services: monitor and maintain production systems, automate tooling and infrastructure, support scaling and performance, debug production incidents, and work with product teams to improve uptime and reliability.
Top Skills: AWSDistributed SystemsDnsDockerGCPGenaiHTTPHttpsKubernetesLoad BalancerMicroservicesNtpReverse ProxyTcp/IpTlsWeb Application Firewall
Yesterday
In-Office or Remote
United States
Senior level
Senior level
Blockchain • Fintech • Software • Cryptocurrency • Metaverse
Design, build, and maintain internal monitoring and alerting for high-load real-time systems; automate production testing; troubleshoot and resolve performance issues; coordinate cross-team incident resolution; recommend architectural and process improvements; research vendor solutions and enforce security best practices.
Top Skills: AWSGCPJavaScriptLinuxNode.jsRest ApiWebsockets

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account