Get the job you really want.
Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in Los Angeles, CA
Software
Own reliability, performance, and scalability of PostgreSQL infrastructure. Implement HA, replication, observability, capacity planning, automation, and DR. Support engineering teams with migrations, query optimization, on-call incident response, runbooks, and tooling to enable safe DB operations.
Top Skills:
AnsibleAuroraAws RdsChefDatadogDynamoDBElasticacheGoGrafanaIndexingMvccPatroniPgbouncerPostgresPrometheusPythonQuery PlannerReplicationRubySQLTerraformVacuum TuningWal
Aerospace • Other
The Sr. IT Linux Site Reliability Engineer will manage and optimize Kubernetes clusters, automate systems, and collaborate with teams to ensure system resilience and performance.
Top Skills:
AnsibleDockerGoGrafanaKubernetesLinuxPrometheusPythonTerraform
Reposted 20 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will support, maintain and grow the Atlas platform, focusing on automating processes and running multi-cloud environments.
Top Skills:
AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
Fintech • Financial Services
The Systems Reliability Engineer supports MEMX exchange platforms by responding to incidents, debugging issues, improving processes, and working with cross-functional teams to ensure platform availability.
Top Skills:
AnsibleBashChefLinuxLinux ShellMonitoring ToolsPuppetPython
Reposted 23 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Principal Software Engineer on the SRE team, lead best practices adoption, mentor engineers, and improve system reliability and user experience through automation and collaboration.
Top Skills:
CdkCloudFormationDatadogGoJavaScriptPrometheusPythonTerraformTypescript
eCommerce • Healthtech • Kids + Family • Retail • Social Media
Seeking a Senior Software Engineer, Site Reliability to ensure system stability, scalability, and reliability, while optimizing AWS infrastructure using modern DevOps practices and tools like Terraform, Docker, and Kubernetes.
Top Skills:
AWSCircleCICronitorDatadogDockerGithub ActionsJenkinsKubernetesMySQLPagerdutyReactRedisRuby On RailsSentrySidekiqTerraform
eCommerce • Retail • Software
The Senior Database Reliability Engineer ensures database availability, reliability, and efficiency, driving initiatives for upgrades, automation, and security while mentoring team members.
Top Skills:
AWSDynamoDBElasticsearchMongoDBMySQLPostgresPowershellPythonRedisSQL Server
Fitness • Healthtech • Retail • Pharmaceutical
The role focuses on metrics and observability in Site Reliability Engineering, enhancing monitoring practices, managing error budgets, and automating quality gates in release processes.
Top Skills:
AppdynamicsAWSAzureDatadogDockerElk StackGCPGrafanaKubernetesOtelPower BIPrometheusSQL ServerUipath
Food
The Reliability Engineer will manage maintenance of fixed assets, focusing on equipment reliability, predictive maintenance, and collaboration to reduce downtime and improve performance metrics of packaging operations.
Top Skills:
Automation EquipmentThermoforming Packaging MachinesTpm
Legal Tech • Software
As a Senior Site Reliability Engineer, you will lead reliability initiatives, design and maintain systems, enhance CI/CD pipelines, and mentor junior engineers while ensuring system availability and performance.
Top Skills:
AWSBashCloudwatchEc2EksIamKubernetesLambdaPowershellPythonS3
Cloud • Information Technology
As a Staff Platform Engineer, you'll develop and maintain infrastructure components using Go and Node.js, improve service reliability, mentor juniors, and manage data ecosystems.
Top Skills:
EnvoyExpressGoJenkinsKafkaMySQLNode.jsPostgresPuppetPythonReactRedis
Sports
Manage and improve the AWS infrastructure, deploy into new regions, monitor releases, and implement new technologies in a fast-paced environment.
Top Skills:
AWSDockerGrafanaKubernetesPrometheusPython
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Cloud • Security • Software
As a Site Reliability Engineer, you'll design and optimize cloud infrastructure, automate compliance, manage Kubernetes, and maintain reliability in regulated environments.
Top Skills:
Ci/Cd PipelinesDockerGoogle Cloud Platform (Gcp)KubernetesNist 800-53
Software • Analytics
The role involves automating and managing AWS infrastructure, ensuring reliability and scalability of stateful systems, and optimizing deployment processes. You'll also handle incident responses and improve operational tooling.
Top Skills:
AWSKubernetesTerraformTerragrunt
Cloud • Security • Software • Cybersecurity
As a Site Reliability Engineer, you will troubleshoot production issues, automate systems, define database requirements, and collaborate with Dev and QA teams for stability.
Top Skills:
AnsibleCassandraChefNoSQLPythonRedis
Cloud • Security • Cybersecurity
This role involves leading complex initiatives in site reliability, developing automation, maintaining IaC, overseeing IAM, and mentoring junior engineers.
Top Skills:
AnsibleAWSAzureCi/CdDockerGCPKubernetesTerraform
AdTech • Marketing Tech • Analytics
Manage and support customer applications, improve system reliability, collaborate with teams on infrastructure needs, and help drive architectural decisions.
Top Skills:
Auto ScalingAWSCdnsDatadogDnsDockerKafkaKibanaKubernetesLinuxLoad BalancersPostgresProxy ServersPythonRdsRedshiftShell/BashSparkTerraformWafs
Blockchain • Financial Services • Cryptocurrency • Web3
As a Senior Site Reliability Engineer at Kraken, you will manage and support infrastructure, improve CI/CD pipelines, and ensure the reliability and performance of systems supporting growth initiatives.
Top Skills:
BashConsulDockerGitlabGoGrafanaHashistackKubernetesNomadNoSQLPrometheusPythonSQLTerraformVault
Marketing Tech • Mobile • Software
The Site Reliability Engineer will scale infrastructure, automate deployments, improve reliability, and enhance operational tools to support high-scale messaging.
Top Skills:
BashEmberGCPGoMySQLReactTerraform
Blockchain • Software
As a Senior Site Reliability Engineer, you will support monitoring services, improve infrastructure reliability, manage deployments, debug issues, and foster a strong communication culture within a distributed team, all while focusing on blockchain technology.
Top Skills:
AWSCi/CdKafkaKubernetesPostgres
Blockchain • Software
As a Senior Engineer, SRE/DevOps, you will enhance blockchain infrastructure reliability, automate deployment, and collaborate on CI/CD practices while ensuring security and performance optimization.
Top Skills:
AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElk-StackEthereumGCPK8SKubernetesOpsgeniePingdomPythonTerraform
Security • Cybersecurity
Design and maintain infrastructure using Terraform on AWS, develop CI/CD pipelines, collaborate on service reliability, and mentor junior engineers.
Top Skills:
AWSBashCi/CdCloudwatchDatadogDockerEcsGitGoGrafanaIamJavaScriptKotlinLambdaPrometheusPythonRdsRubyTerraformVpc
Security • Cybersecurity
The Staff Site Reliability Engineer will lead reliability strategy, architecture, and incident response while mentoring engineers and improving operational excellence.
Top Skills:
AWSCi/CdGithub ActionsJavaScriptPythonRubyTerraform
Big Data
You will manage AWS infrastructure, automate deployments, debug application issues, and improve the operational health of Metabase Cloud.
Top Skills:
AWSDatadogGoGrafanaKubernetesPrometheusPythonTerraform
Information Technology • Security • Cybersecurity
The Staff/Principal Site Reliability Engineer leads infrastructure initiatives, architects solutions for cloud and SaaS, and collaborates cross-functionally to enhance reliability and innovation.
Top Skills:
AWSBashBazelCuelangDatadogGitopsGoGrafanaHelmKubernetesLinuxPrometheusPythonTerraform
Top Los Angeles Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results

































