Get the job you really want.

Top Reliability Engineer Jobs in Los Angeles, CA

Reposted 22 Hours AgoSaved
Remote
Los Angeles, CA
109K-169K
Senior level
109K-169K
Senior level
Other • Social Impact
The Senior Site Reliability Engineer will design and maintain infrastructure, ensure system reliability, participate in on-call rotations, and mentor peers in a collaborative remote environment.
Top Skills: AnsibleDockerGerritGitlabKubernetesMediawikiPuppetPythonSpicerackTerraform
Reposted 22 Hours AgoSaved
Remote
Los Angeles, CA
157K-185K Annually
Senior level
157K-185K Annually
Senior level
Blockchain • Fintech • Internet of Things • Cryptocurrency • Web3
As a Senior Site Reliability Engineer, you'll design and operate cloud infrastructure, manage Kubernetes environments, implement Infrastructure as Code, and automate processes to ensure reliability and performance.
Top Skills: Amazon RdsAuroraAWSBashCdkCrossplaneGoKubernetesPythonTerraform
Reposted 24 Days AgoSaved
Remote
Los Angeles, CA
130K-144K
Mid level
130K-144K
Mid level
Information Technology
As a Database Reliability Engineer, you will maintain and improve PostgreSQL infrastructure, resolve production incidents, collaborate with developers, and implement infrastructure as code.
Top Skills: Ci/CdGithub ActionsGrafanaMySQLPostgresPrometheusSaltSQLTerraform
2 Days AgoSaved
Remote
Los Angeles, CA
176K-207K Annually
Senior level
176K-207K Annually
Senior level
Security • Cybersecurity
Lead initiatives for reliability and operational excellence, mentor engineers, and define goals to improve system reliability and productivity.
Top Skills: AWSAzureDatadogGCPGoGrafanaKubernetesPrometheusPythonTerraform
Reposted 2 Days AgoSaved
Remote
Los Angeles, CA
Mid level
Mid level
Blockchain • Software
As a Site Reliability Engineer at Offchain Labs, you will manage infrastructure in cloud environments, design CI/CD workflows, and enhance system reliability with a focus on blockchain technology.
Top Skills: ArgocdAWSAzureCodebuildGCPGithub ActionsGoGrafanaKubernetesLokiPrometheusPythonTerraform
Reposted 2 Days AgoSaved
Remote or Hybrid
Los Angeles, CA
166K-201K Annually
Senior level
166K-201K Annually
Senior level
Cloud • Greentech • Other • Energy
As a Senior Site Reliability Engineer, you'll optimize virtualization and kernel-level performance for AI workloads, develop automation tools, and support compute infrastructure, ensuring scalability and reliability.
Top Skills: CCi/CdGoInfrastructure As CodeKvmLinuxQemuRust
3 Days AgoSaved
Remote
Los Angeles, CA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
Design, develop, and maintain machine learning infrastructure while enhancing reliability and scalability, mentoring team members and collaborating across teams.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
4 Days AgoSaved
Remote
Los Angeles, CA
Senior level
Senior level
Generative AI
The Senior Site Reliability Engineer will enhance system reliability, manage cloud infrastructure, and enforce best SRE practices while mentoring juniors.
Top Skills: AWSElk StackGrafanaKubernetesTerraform
Reposted 13 Days AgoSaved
In-Office
Los Angeles, CA
120K-200K Annually
Senior level
120K-200K Annually
Senior level
Software
The Site Reliability Engineer will enhance system reliability, improve tooling, oversee incident processes, and collaborate on software maintenance across distributed systems.
Top Skills: ClickhouseGrpcKafkaMongoDBNoSQLPostgresRedpanda
Reposted 6 Days AgoSaved
Remote
Los Angeles, CA
Senior level
Senior level
Automotive • Software
The Senior Site Reliability Engineer will optimize platform reliability, manage Kubernetes production clusters, deploy monitoring solutions, collaborate on resource optimization, and participate in on-call rotations.
Top Skills: AndroidArgocdAWSCircleCIDockerGCPGitGoGrafanaKafkaKubernetesLokiNew RelicObjective-COpentelemetryPostgresPrometheusPythonReact/ReduxRedisRedshiftRuby On RailsSentrySwiftTerraformThanos
7 Days AgoSaved
In-Office or Remote
Los Angeles, CA
145K-145K
Senior level
145K-145K
Senior level
Information Technology • Software
As a Senior Site Reliability Engineer, you'll ensure the reliability, performance, and scalability of Ditto's cloud infrastructure, lead incident management, and improve system resilience.
Top Skills: AWSAzureCDatadogGCPGoGrafanaHelmJavaKubernetesPrometheusRustTerraform
Reposted 7 Days AgoSaved
In-Office or Remote
Los Angeles, CA
224K-426K
Expert/Leader
224K-426K
Expert/Leader
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design, build, and maintain large-scale production systems for machine learning applications, ensuring reliability, scalability, and efficiency.
Top Skills: Ci/CdElkGithub ActionsGoJenkinsKafkaKubernetesOpenstackPerlPrometheusPythonRubySpark
New

Track Smarter, Apply Better.

Ditch the spreadsheets. Organize your job search with our freeApplication Tracker.

Use For Free
Application Tracker Preview
Reposted 7 Days AgoSaved
In-Office or Remote
Los Angeles, CA
Senior level
Senior level
Artificial Intelligence • Information Technology • Consulting
As a Senior Site Reliability Engineer, you will enhance the reliability and performance of our inference platform, leveraging Kubernetes and Terraform while ensuring smooth scalability of systems under load.
Top Skills: BashGrafanaKubernetesMlopsPrometheusPythonRayTerraformTritonVllm
Reposted 8 Days AgoSaved
Remote or Hybrid
Los Angeles, CA
204K-247K Annually
Senior level
204K-247K Annually
Senior level
Cloud • Greentech • Other • Energy
As a Staff Site Reliability Engineer focused on storage, you'll ensure the reliability and performance of cloud storage systems while optimizing distributed, fault-tolerant architectures for AI workloads.
Top Skills: AnsibleCCephDockerGlusterfsGoIscsiJavaKubernetesNfsNvme-OfOpenebsPuppetPythonSmbTerraform
Reposted 8 Days AgoSaved
Remote
Los Angeles, CA
Mid level
Mid level
Blockchain • Information Technology • Internet of Things
The Site Reliability Engineer will ensure system reliability, security, and performance by implementing infrastructure as code, CI/CD, and monitoring solutions.
Top Skills: AWSAzureBashGCPGoKubernetesPythonRustTerraform
9 Days AgoSaved
Remote
Los Angeles, CA
100K-720K
Mid level
100K-720K
Mid level
News + Entertainment
As a Site Reliability Engineer at Netflix, you'll enhance gaming platform reliability, manage incidents, build detection tools, and improve operational excellence.
Top Skills: AWSGCPGoJavaJavaScriptLinuxNode.jsPrestoPythonSpark SqlTrinoUnix
Reposted 11 Days AgoSaved
Remote
Los Angeles, CA
200K-250K
Senior level
200K-250K
Senior level
Software • Cryptocurrency
Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.
Top Skills: Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform
Reposted 11 Days AgoSaved
Remote or Hybrid
Los Angeles, CA
93K-112K
Mid level
93K-112K
Mid level
Cloud • Security • Software
As a Site Reliability Engineer II, you will architect, deploy, and maintain resilient infrastructure on AWS, develop deployment pipelines, and manage performance issues across distributed systems.
Top Skills: AWSCloudwatchDockerGitGrafanaJenkinsKubernetesNew RelicPuppetTerraform
Reposted 11 Days AgoSaved
In-Office or Remote
Los Angeles, CA
119K-161K Annually
Mid level
119K-161K Annually
Mid level
Big Data • Cloud • Marketing Tech • Social Impact • Software
The Site Reliability Engineer will support global product deployments, provide 24/7 engineering support, enhance CI/CD tooling, and ensure security compliance.
Top Skills: AWSCircleCIGCPGoJenkinsKubernetesPythonTerraform
Reposted 12 Days AgoSaved
Remote
Los Angeles, CA
156K-240K
Senior level
156K-240K
Senior level
Artificial Intelligence • Marketing Tech • Mobile • Software
Design and implement solutions for platform reliability and scalability, lead cross-team projects, and mentor team members while ensuring operational excellence.
Top Skills: AirflowAWSCloudflareDatadogDynamoDBEsbuildGradleGraphQLHelmHuggingfaceIstioJavaKinesisKubernetesMetaflowPandasPlanetscalePlaywrightPostgresPythonPyTorchRadix UiReactRedisSpring BootStorybookTensorFlowTerraformTypescriptVite
23 Days AgoSaved
In-Office
Los Angeles, CA
160K-220K
Mid level
160K-220K
Mid level
Aerospace • Other
The Lead Software Engineer will manage software development processes, enhance application performance, mentor engineers, and ensure reliable software solutions for SpaceX's build operations.
Top Skills: .NetAngularC#GoJavaPostgresPythonReactSQL Server
13 Days AgoSaved
Remote
Los Angeles, CA
129K-201K
Senior level
129K-201K
Senior level
Other • Social Impact
Design, develop, maintain, and scale machine learning infrastructure. Collaborate with teams to improve the reliability and performance of ML systems while supporting engineers and researchers.
Top Skills: AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesMachine LearningPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
Reposted 13 Days AgoSaved
Remote
Los Angeles, CA
Senior level
Senior level
Healthtech
This role ensures the reliability and performance of cloud-native platforms, focusing on system design, incident response, and collaboration with various teams.
Top Skills: AWSBashCi/CdCloudwatchDatadogGoHelmKubernetesPrometheusPythonTerraform
14 Days AgoSaved
Remote
Los Angeles, CA
103K-136K Annually
Entry level
103K-136K Annually
Entry level
Cloud • Security • Software • Generative AI
The Site Reliability Engineer I will automate engineering efforts, improve platform reliability, and ensure customer satisfaction while managing cloud infrastructure and responding to incidents.
Top Skills: DockerElastic StackGoGraphiteInfluxKubernetesLinuxPrometheusTerraform
Reposted 23 Days AgoSaved
Remote
Los Angeles, CA
244K-304K Annually
Senior level
244K-304K Annually
Senior level
Real Estate • Travel • PropTech
The Senior Staff Software Engineer will drive the development of a reliability strategy, enhance infrastructure performance, and mentor SRE teams.
Top Skills: Cloud PlatformsHigh-Availability SystemsIncident Management ProcessesSoftware Engineering Practices
All Filters
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account