Top Reliability Engineer Jobs in Los Angeles, CA

Reposted 7 Days AgoSaved
In-Office
Los Angeles, CA
130K-145K Annually
Mid level
130K-145K Annually
Mid level
Events
The Site Reliability Engineer II designs and maintains scalable systems, focusing on automation, monitoring, incident response, and collaboration with developers to enhance operational practices and efficiency.
Top Skills: BashCloud Service OperationsContainersContinuous DeliveryContinuous IntegrationGoInfrastructure As CodeOrchestration PlatformsPython
Reposted An Hour AgoSaved
Remote
Los Angeles, CA
113K-176K Annually
Senior level
113K-176K Annually
Senior level
Other • Social Impact
As a Senior Site Reliability Engineer, you will manage and improve Wikimedia's infrastructure, handle operational tasks, automate processes, and provide mentorship while participating in a 24/7 on-call rotation.
Top Skills: AnsibleBashDebianGoGrafanaHhvmKubernetesMemcachedPHPPrometheusPuppetPythonRedisRuby
Reposted An Hour AgoSaved
Remote
Los Angeles, CA
Senior level
Senior level
Artificial Intelligence • Fintech • Software • Financial Services
The SRE will own reliability for a cloud-native platform, optimizing performance, availability, and observability, while mentoring engineering teams.
Top Skills: AWSClickhouseGoKafkaKubernetesPulumiPythonTerraform
Reposted An Hour AgoSaved
Remote
Los Angeles, CA
82K-229K Annually
Senior level
82K-229K Annually
Senior level
Cloud • Software
Design, implement, and support Kubernetes and compute platforms in a private cloud. Oversee architecture and standardization across hardware, OS, and cloud orchestration.
Top Skills: AnsibleBashCi/CdHelmKubernetesLinuxOpenstackPythonTerraformUbuntu
Reposted An Hour AgoSaved
Remote
Los Angeles, CA
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Cloud • Information Technology
As a Sr. Site Reliability Engineer, you'll ensure service reliability, build automation, and collaborate on infrastructure improvements while mentoring others.
Top Skills: AnsibleCatchpointDockerElkGoGrafanaHashicorp VaultJenkinsKubernetesLinuxPrometheusPythonTerraform
Reposted An Hour AgoSaved
Remote
Los Angeles, CA
173K-321K Annually
Senior level
173K-321K Annually
Senior level
Cloud • Security • Software • Cybersecurity
Design and maintain reliable infrastructure solutions for a cloud data protection platform. Ensure application scalability and support through CI/CD and monitoring tools while collaborating in a global team.
Top Skills: AppinsightsAws CloudformationAzure Api ManagementAzure Arm TemplatesAzure Cosmos DbAzure DevopsAzure Entra IdAzure FunctionsAzure MonitorAzure Storage ServicesBashBitbucketElastic StackGitGoMicrosoft TfsPowershellPythonServerless FrameworkTerraform
Reposted An Hour AgoSaved
Remote
Los Angeles, CA
150K-200K Annually
Mid level
150K-200K Annually
Mid level
Software
As a Senior Site Reliability Engineer at Regrello, you'll shape the developer platform, collaborate with customers, and ensure the reliability and security of infrastructure and applications.
Top Skills: AWSAzureCircleCIGCPGithub ActionsGitlab CiGoKubernetesTerraform
Reposted 10 Days AgoSaved
In-Office
Los Angeles, CA
160K-220K Annually
Senior level
160K-220K Annually
Senior level
Aerospace • Other
The Sr. Site Reliability Engineer at SpaceX is responsible for enhancing distributed systems, managing large data clusters, and ensuring software reliability on the Starlink project, focusing on customer experience and operational efficiency.
Top Skills: Apache KafkaC#FlinkGoHbaseHdfsIstioJavaKubernetesLinuxPythonScalaSpark
11 Days AgoSaved
In-Office
Los Angeles, CA
183K-235K Annually
Senior level
183K-235K Annually
Senior level
Artificial Intelligence • Machine Learning • Security • Software
The Senior Staff Site Reliability Engineer will be responsible for ensuring system reliability, debugging issues, mentoring the engineering team, and maintaining infrastructure and CI/CD pipelines.
Top Skills: AWSDatadogDockerGithub ActionsGrafanaHelmKotlinKubernetesPostgresPrometheusPythonRustTerraformTerragruntTypescript
Reposted YesterdaySaved
Remote
Los Angeles, CA
115K-135K Annually
Mid level
115K-135K Annually
Mid level
Aerospace • Manufacturing
As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.
Top Skills: ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform
Reposted YesterdaySaved
Remote
Los Angeles, CA
Senior level
Senior level
Automotive
Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.
Top Skills: AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform
Reposted YesterdaySaved
Remote
Los Angeles, CA
208K-330K Annually
Senior level
208K-330K Annually
Senior level
Fintech
The Staff Site Reliability Engineer role involves leading architecture, automating GCP environment, defining SLIs and SLOs, mentoring teammates, and enhancing system reliability and performance.
Top Skills: ArgocdDatadogGCPGoHelmJavaScriptKubernetesPythonTerraformTypescript
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 2 Days AgoSaved
Remote
Los Angeles, CA
Senior level
Senior level
Digital Media • Software • Sports
Seeking a Senior Site Reliability Engineer to enhance system reliability, performance, and scalability. Focus on automation, observability, and improving CI/CD practices while collaborating with engineering teams for better incident response and metrics improvement.
Top Skills: AWSAzureC++Ci/CdDatadogDockerElkGCPGoGrafanaJavaKubernetesLinuxPrometheusPythonTerraform
3 Days AgoSaved
Remote
Los Angeles, CA
114K-148K Annually
Senior level
114K-148K Annually
Senior level
Software • Financial Services
Ensure platform reliability, performance, and availability by implementing observability, automating infrastructure, participating in on-call rotations and post-mortems, partnering with Product and Engineering, designing scalable architectures, mentoring teammates, and integrating Dynatrace with Azure DevOps and Jira while supporting compliance (SOC/FedRAMP).
Top Skills: .NetAksAlpineAnsibleAppinsightsArm TemplatesAWSAzure DevopsBashBicepC#ChefCloudFormationDatadogDebianDynatraceEksGCPGitGitGksGrafanaHelmJIRAKubernetesLog AnalyticsAzureNew RelicOnestream SoftwareOpenshiftPowershellPowershell DscPrometheusPuppetPythonRest ApisSQLTerraformUbuntu
12 Days AgoSaved
In-Office
Los Angeles, CA
164K-270K Annually
Mid level
164K-270K Annually
Mid level
Aerospace • Hardware • Software • Defense • Manufacturing
As a Site Reliability Engineer, you'll ensure robotics system reliability, build telemetry integration, and develop tools for diagnostics and automation, collaborating with engineering teams for enhanced production reliability.
Top Skills: C++DatadogGoKubernetesOpentelemetryPrometheusPythonRos2TelegrafTypescript
3 Days AgoSaved
Remote
Los Angeles, CA
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Artificial Intelligence • Blockchain • Information Technology • Consulting
Lead design and build of production-grade Azure infrastructure using Terraform, ensuring scalable, secure, and repeatable deployments. Provide technical leadership, platform enhancements, observability and incident response improvements, and Tier 2 infrastructure support while collaborating with engineering, security, and product teams to meet enterprise readiness and feature parity goals.
Top Skills: ArgoAzureGoGrafanaKubernetesPrometheusPythonSpaceliftTerraform
3 Days AgoSaved
Remote
Los Angeles, CA
143K-175K Annually
Mid level
143K-175K Annually
Mid level
Cloud • Security • Software • Generative AI
Design, build, and automate large-scale multi-cloud infrastructure and internal SRE tools. Improve host lifecycle, observability, alerting, and reliability; operate containerized workloads; participate in on-call rotations, incident response, runbooks, postmortems, code reviews, and mentoring.
Top Skills: AnsibleArgo CdArgo WorkflowsCueDockerElastic StackGoGraphiteInfluxKubernetesLinuxPrometheusPuppetTerraformUbuntuUbuntu Live Patch
Reposted 3 Days AgoSaved
Remote
Los Angeles, CA
160K-190K Annually
Senior level
160K-190K Annually
Senior level
Legal Tech • Software
As a Senior Site Reliability Engineer, you will lead reliability initiatives, design and maintain systems, enhance CI/CD pipelines, and mentor junior engineers while ensuring system availability and performance.
Top Skills: AWSBashCloudwatchEc2EksIamKubernetesLambdaPowershellPythonS3
3 Days AgoSaved
Remote
Los Angeles, CA
238K-288K Annually
Senior level
238K-288K Annually
Senior level
HR Tech • Software
Design, build, maintain, and operate Calendly's infrastructure platform with IaC and observability. Evaluate and deploy cloud-native tools, enable application teams on reliability practices, participate in on-call rotation, and mentor engineers while defining standards for incidents, capacity, and platform usage.
Top Skills: APIsCloud NetworkingControllers And OperatorsDatadogDistributed SystemsGCPGoInfrastructure As CodeKubernetesLinuxPython
3 Days AgoSaved
Remote
Los Angeles, CA
110K-140K Annually
Senior level
110K-140K Annually
Senior level
Real Estate • Financial Services • PropTech
Support and optimize products migrated to AWS, implement cloud best practices, maintain operational coverage, enhance automation, observability, CI/CD/GitOps, and security. Collaborate with development and platform teams to scale, troubleshoot, and ensure reliable SaaS operations.
Top Skills: AmisArgocdAWSAws Elastic BeanstalkAws Transfer FamilyAzure DevopsBashCloudwatchCurlDockerEc2EksFluxcdGitGitopsHTTPIstioKubernetesLinkerdLoad BalancerPowershellPythonRdsSQLTerraformWget
Reposted 3 Days AgoSaved
Remote
Los Angeles, CA
220K-250K Annually
Expert/Leader
220K-250K Annually
Expert/Leader
Cloud • Software • Database
Lead design, build, and operate the YugabyteDB DBaaS infrastructure. Drive architecture, automate lifecycle and maintenance, manage incidents and on-call rotations, implement security/encryption processes, and optimize reliability using SRE principles and observability.
Top Skills: AksAnsibleAWSAzureBashDockerEksGCPGitGithub ActionsGkeJavaKubernetesLinuxPostgresPrometheusPythonShellTerraform
5 Days AgoSaved
In-Office or Remote
Los Angeles, CA
95K-171K Annually
Junior
95K-171K Annually
Junior
Cloud • Security • Software • Cybersecurity
The Site Reliability Engineer II - Database ensures the integrity, security, and performance of MySQL databases while collaborating with development and operations teams to address database issues and improve reliability.
Top Skills: MySQLSQL
5 Days AgoSaved
Remote
Los Angeles, CA
Senior level
Senior level
Artificial Intelligence • Information Technology • Software • Database
As a Site Reliability Engineer, you will design, implement, and maintain scalable infrastructure, ensure system reliability, automate processes, and collaborate with engineering teams.
Top Skills: DockerElk StackGoGrafanaJavaKubernetesNode.jsPrometheusPulumiPythonRubyTerraform
5 Days AgoSaved
Remote
Los Angeles, CA
180K-233K Annually
Expert/Leader
180K-233K Annually
Expert/Leader
Cloud • Security • Software • Generative AI
The role involves designing, building, and automating network infrastructure for Elastic's global services, focusing on reliability and operational excellence while enhancing customer experience through proactive problem management.
Top Skills: AnsibleBgpDnsDockerElastic StackGoKubernetesTerraform
5 Days AgoSaved
Remote or Hybrid
Los Angeles, CA
160K-180K Annually
Senior level
160K-180K Annually
Senior level
Artificial Intelligence • Machine Learning • Software • Analytics
The role involves end-to-end ownership of AWS infrastructure, managing Kubernetes platforms, and ensuring system reliability through observability and automation. Responsibilities include incident response and maintaining CI/CD systems.
Top Skills: ArgocdAWSDatadogGitGoKubernetesPythonTerraform
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account