Job Title, Company or Keyword

Maximum of 25 job preferences reached.

Top Remote Site Reliability Engineer Jobs in Los Angeles, CA

Aalyria

Site Reliability Engineer

Reposted 14 Days AgoSaved

Remote

United States

115K-135K Annually

Mid level

115K-135K Annually

Mid level

Aerospace • Manufacturing

As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.

Top Skills: ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform

Offchain Labs

Site Reliability Engineer

Reposted 15 Days AgoSaved

Remote

United States

Mid level

Blockchain • Software

Build, operate, and scale production Kubernetes infrastructure using GitOps and declarative IaC. Design CI/CD workflows, observability, and secure-by-default systems. Troubleshoot networking/storage, participate in on-call rotations, automate operational workflows, and drive postmortems and reliability improvements.

Top Skills: ArbitrumArgocdArgocd ApplicationsetsAWSAzureBashCloudwatchCodebuildGCPGithub ActionsGitopsGoGrafanaK9SKubernetesLinuxLokiMimirPrometheusPrysmPythonTerraformYamlZerodev

Tekmetric

Site Reliability Engineer

Reposted 15 Days AgoSaved

Remote

United States

Senior level

Automotive

Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.

Top Skills: AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform

StackBlitz

Staff Site Reliability Engineer

16 Days AgoSaved

Remote

USA

Senior level

Software • Web3

Lead reliability practices across teams: embed early in projects, define SLIs/SLOs, build multi-cloud paved roads with Terraform, run on-call, drive org-wide incident maturity and tooling.

Top Skills: AWSAzureGCPRuby On RailsTerraformTypescriptWebcontainers

Canonical

Site Reliability Engineer

Reposted 16 Days AgoSaved

In-Office or Remote

United States

200K-200K Annually

Mid level

200K-200K Annually

Mid level

Cloud • Software

The Site Reliability Engineer will ensure reliable cloud operations by applying Python for infrastructure automation, managing OpenStack and Kubernetes, and practicing devsecops in a fast-paced environment.

Top Skills: KubernetesLinuxOpenstackPython

Akamai Technologies

Site Reliability Engineer II

Reposted 16 Days AgoSaved

In-Office or Remote

United States

95K-171K Annually

Junior

95K-171K Annually

Junior

Cloud • Security • Software • Cybersecurity

As a Site Reliability Engineer II, you'll automate tasks, monitor AI workloads, enhance dashboards, support CI/CD processes, and collaborate with engineering teams on complex issues while participating in on-call rotations.

Top Skills: GoGrafanaKubernetesLinuxPrometheusPythonSaltstackTerraform

Yugabyte

Staff Site Reliability Engineer

Reposted 16 Days AgoSaved

Remote

United States

220K-250K Annually

Expert/Leader

220K-250K Annually

Expert/Leader

Cloud • Software • Database

Lead design, build, and operate the YugabyteDB DBaaS infrastructure. Drive architecture, automate lifecycle and maintenance, manage incidents and on-call rotations, implement security/encryption processes, and optimize reliability using SRE principles and observability.

Top Skills: AksAnsibleAWSAzureBashDockerEksGCPGitGithub ActionsGkeJavaKubernetesLinuxPostgresPrometheusPythonShellTerraform

Stellar Cyber

Senior DevOps Engineer/Site Reliability Engineer-East Coast

Reposted 16 Days AgoSaved

In-Office or Remote

United States

165K-215K Annually

Senior level

165K-215K Annually

Senior level

Software • Cybersecurity

This role involves managing Kubernetes clusters, cloud infrastructure, and CI/CD pipelines. The engineer will enhance system reliability and efficiency while troubleshooting production issues.

Top Skills: AlertmanagerAWSAzureBashCi/CdDockerElastic StackElasticsearchGCPGoGrafanaHelmKafkaKubernetesLokiMongoDBOciPrometheusPythonRedisSparkTerraform

Socure

Senior Software Engineer - SRE

Reposted 16 Days AgoSaved

Remote or Hybrid

160K-180K Annually

Senior level

160K-180K Annually

Senior level

Artificial Intelligence • Machine Learning • Software • Analytics

The role involves end-to-end ownership of AWS infrastructure, managing Kubernetes platforms, and ensuring system reliability through observability and automation. Responsibilities include incident response and maintaining CI/CD systems.

Top Skills: ArgocdAWSDatadogGitGoKubernetesPythonTerraform

Encora

Senior Application Support Engineer (SRE)

Reposted 16 Days AgoSaved

Remote

United States

Mid level

Software • Consulting

The Senior Application Support Engineer leads efforts to ensure application reliability, manages incidents, collaborates with teams, and monitors performance, providing 24/7 support.

Top Skills: AppdynamicsAWSDatadogLinuxMulesoftOpentelemetryPythonServicenowSplunk

Andromeda (andromeda.ai)

Site Reliability Engineer - AI Infrastructure

Reposted 17 Days AgoSaved

In-Office or Remote

United States

Senior level

Artificial Intelligence • Cloud • Information Technology • Software

The Site Reliability Engineer will provision and manage Kubernetes clusters, build automation tools, debug customer issues, and improve infrastructure reliability.

Top Skills: AnsibleBashDatadogGoGrafanaHelmKubernetesLokiPrometheusPythonTerraform

Nebius

Staff Network Site Reliability Engineer

18 Days AgoSaved

Remote

United States

180K-224K Annually

Senior level

180K-224K Annually

Senior level

Artificial Intelligence • Information Technology • Consulting

Build and operate Nebius's network infrastructure: define SLIs/SLOs, improve site and inter-site reliability, lead incident response and postmortems, develop observability and alerting, automate change workflows, and collaborate with network and platform teams to embed operability.

Top Skills: Ci/CdContainer PlatformsGoInfrastructure As CodeLinuxPython

New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free

Akamai Technologies

Site Reliability Engineer

18 Days AgoSaved

In-Office or Remote

United States

76K-136K Annually

Mid level

76K-136K Annually

Mid level

Cloud • Security • Software • Cybersecurity

Design, develop, test, and operate scalable infrastructure and services for Akamai Cloud. Implement and manage Infrastructure-as-Code (Terraform and similar tools), CI/CD, and observability. Automate reliability improvements, mentor engineers, collaborate on incident response and root-cause remediation, and participate in on-call rotations.

Top Skills: Alerting)AnsibleChefCi/CdInfrastructure As CodeLinuxLoggingObservability (MonitoringPuppetSaltstackTerraform

Berkeley Research Group

Site Reliability Engineer

18 Days AgoSaved

Remote

USA

130K-160K Annually

Senior level

130K-160K Annually

Senior level

Other

Design, build, and maintain highly available cloud-native systems. Improve reliability through automation, CI/CD, Kubernetes, observability, and incident management. Collaborate with developers, security, and product teams to define SLOs, implement self-healing, debug production issues, and ensure secure deployments.

Top Skills: AWSAzure Cloud ServicesDatadogGCPGithub ActionsGitlab CiGoInfrastructure As CodeKubernetesOpsgeniePagerdutyPythonRubySite Reliability Engineering Foundation

Onebrief

Senior Site Reliability Engineer (Arlington, VA) - Relocation Provided

24 Days AgoSaved

Remote

United States

180K-220K Annually

Senior level

180K-220K Annually

Senior level

Software • Defense

Own reliability, scalability, and security for on-prem and AWS deployments. Build observability (Prometheus/Loki/Grafana/ELK), define SLOs/SLIs, lead incident response and postmortems, automate infrastructure (Terraform/Ansible), operate Kubernetes clusters, embed security/compliance controls, eliminate operational toil, and mentor teams.

Top Skills: AlloyAnsibleAWSAws GovcloudBashCloudFormationDatadogElkGithub ActionsGitlab Ci/CdGoGrafanaJenkinsKubernetesLokiPrometheusPythonRmfStigsTerraform

Order.co

Senior Site Reliability Engineer

Reposted 25 Days AgoSaved

Remote or Hybrid

United States

175K-200K Annually

Senior level

175K-200K Annually

Senior level

eCommerce • Fintech • Payments • Software

The role involves ensuring software reliability and performance, managing incidents, developing infrastructure automation, and mentoring junior engineers within a platform team.

Top Skills: AWSCloudFormationDatadogKubernetesOpentelemetryRubyRuby On RailsTerraform

xLabs

Senior / Staff Site Reliability Engineer (Blockchain Infra)

Reposted 19 Days AgoSaved

In-Office or Remote

United States

Senior level

Software

The role involves managing compute infrastructure for decentralized applications, requiring critical thinking, documentation skills, and experience in Kubernetes and blockchain management.

Top Skills: BlockchainGitopsInfrastructure-As-CodeKubernetesProgramming Languages

Mattermost

Lead Site Reliability Engineer

Reposted 21 Days AgoSaved

Remote

United States

170K-200K Annually

Senior level

170K-200K Annually

Senior level

Software

Lead SRE to define SRE strategy, architecture, and roadmap; design and operate containerized, compliant cloud environments; build observability, incident management, automation, and developer platform capabilities; mentor SRE team and collaborate with security, compliance, and product teams to ensure reliability at scale.

Top Skills: AWSAws MarketplaceAzureAzure MarketplaceGCPGoogle Cloud MarketplaceGrafanaKubernetesPrometheusTerraform

Assured

Staff Site Reliability Engineer

Reposted 21 Days AgoSaved

Remote

USA

180K-210K Annually

Senior level

180K-210K Annually

Senior level

Artificial Intelligence • Insurance • Software • Automation

The Staff Site Reliability Engineer will build and scale infrastructure for Assured's platform, automate delivery, enhance observability, and lead mentoring initiatives.

Top Skills: AWSKubernetesPostgresTerraform

Cresta

Senior Infrastructure Engineer/SRE

Reposted 21 Days AgoSaved

Remote

United States

205K-270K Annually

Senior level

205K-270K Annually

Senior level

Artificial Intelligence • Other • Sales • Software

The role involves designing and advancing infrastructure for the engineering team, ensuring the reliability of Kubernetes clusters, automating operations, and building machine learning infrastructure.

Top Skills: ArgoAWSAzureCloudFormationFluxGithub ActionsGoGCPKubernetesPostgresPythonTerraform

WEX Inc.

Senior Staff Site Reliability Engineer

Reposted 21 Days AgoSaved

In-Office or Remote

CA, USA

160K-179K Annually

Senior level

160K-179K Annually

Senior level

Fintech • Payments

The Senior Staff SRE leads reliability engineering initiatives, drives operational excellence, mentors staff, and influences architecture to enhance system reliability and performance.

Top Skills: Ai/MlAWSAzureDockerElk StackGCPGrafanaKubernetesMySQLNoSQLPostgresSplunk

Photon

SRE Architect | Onsite

22 Days AgoSaved

Remote

United States

Senior level

Agency • Information Technology

Lead SRE role designing and maintaining CI/CD pipelines (GitHub Actions), containerized deployments (Docker, Kubernetes, AKS, Helm), web/mobile app releases, observability, automated testing, and DevOps best practices across cloud environments with cross-functional collaboration and regulatory compliance.

Top Skills: AksAndroidAzure Application InsightsAzure Log AnalyticsAzure MonitorBashBranchingDockerDocker ComposeGitGit HooksGithub ActionsGoogle PlayHelmHerokuiOSIos App StoreJavaKubernetesNpmPowershellPull RequestsPythonSonarqubeVeracodeVercel

Veeam

Senior Site Reliability Engineer- FedRamp

4 Hours AgoSaved

Remote

United States

173K-321K Annually

Senior level

173K-321K Annually

Senior level

Cloud • Security • Software • Cybersecurity

Senior SRE to build and run Veeam's Government/Sovereign-cloud reliability practice. Responsibilities include mapping platform workloads, writing runbooks, defining SLIs/SLOs, designing HA on Azure Government, incident response and postmortems, closing observability gaps, automation and IaC in compliance-restricted environments, CI/CD/GitOps pipelines, on-call rotations, and cross-team collaboration and mentoring.

Top Skills: Api ManagementApplication InsightsArgocdArm TemplatesAWSAws CloudformationAws GovcloudAzureAzure DevopsAzure FunctionsAzure GovernmentAzure MonitorAzure StorageBitbucketC#Ci/CdCosmos DbDaggerElastic Stack (Elk)Entra IdFluxcdGitGithub ActionsGitlab CiGitopsGoGrafanaJavaJavaScriptKubernetesMicrosoft TfsOpentelemetryPrometheusPulumiServerless FrameworkTerraformTerragruntTypescript

Juul Labs

Senior Site Reliability Engineer

Reposted 20 Hours AgoSaved

Remote

United States of America

185K-227K Annually

Senior level

185K-227K Annually

Senior level

Other

The Senior Site Reliability Engineer at Juul Labs ensures operational stability and performance of hybrid cloud infrastructure, leads automation, and handles critical incidents.

Top Skills: AWSBashCloudFormationGCPNutanixPowershellPythonTerraform

Epic for kids

Senior Site Reliability Engineer

YesterdaySaved

Remote

160K-200K Annually

Senior level

160K-200K Annually

Senior level

Digital Media • Edtech

Drive reliability and observability of Epic's GCP-based platform. Own cloud infrastructure, container platform (Kubernetes/GKE), CI/CD, observability, and IaC (Terraform). Define SLOs/SLIs, reduce toil, manage security and compliance practices, participate in on-call rotations, lead incident response and post-mortems, and partner with product and data teams to troubleshoot and improve platform reliability.