Get the job you really want.
Top Reliability Engineer Jobs in Los Angeles, CA
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
The Senior Site Reliability Engineer collaborates on building cloud-native infrastructure, improves monitoring, and ensures high uptime, with substantial AWS and Kubernetes expertise.
Top Skills:
AWSBashGoHelmKubernetesPythonRubyTerraform
Virtual Reality
As a Staff Network Reliability Engineer, you will design automated solutions, maintain hybrid infrastructure, and ensure service reliability through collaboration and mentoring.
Top Skills:
AnsibleAWSBashCi/CdDockerGCPGit ActionsGoJenkinsKubernetesPanoramaPythonTerraform
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will build and maintain secure system architectures for various client platforms and manage DevOps tooling.
Top Skills:
AnsibleAutopkgAWSAzureGCPGoMicromdmMunkiNanomdmPuppetPythonRubyTerraform
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
As a Senior Site Reliability Engineer, you will deploy, manage, and optimize AI tools, ensure system reliability, and collaborate on AI solutions while maintaining compliance and security standards.
Top Skills:
Aws,Gcp,Python,Java,Go,Ansible,Terrraform,Bash
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Staff Site Reliability Engineer at Coinbase will improve system reliability, mentor engineers, automate processes, and oversee software integrity, focusing on high-quality coding and performance tuning.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will maintain system stability, automate processes, handle incidents, and enhance reliability while collaborating with dev teams.
Top Skills:
AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
As a Principal SRE, you will drive reliability practices for the Identity Security Cloud platform, coach teams, manage architecture and capacity, and provide technical leadership while improving operational excellence.
Top Skills:
AWSGoGrafanaHoneycombJavaKibanaKubernetesPrometheusPythonTerraform
AdTech • Cloud • Information Technology • Marketing Tech • Software
Lead the design and implementation of reliability strategies for SMS infrastructure, focusing on automation, performance tuning, and collaboration with cross-functional teams.
Top Skills:
AnsibleAsteriskAWSAzureDockerElasticsearchGCPGitGitlabHaproxyInterconnectsJenkinsK8SKannelLinuxMySQLNatNginxOpensipsRestRtpSipSmsSngrepSnmpSpring BootTerraformTomcatVpnWafsWireshark
Featured Jobs
Aerospace • Artificial Intelligence • Hardware • Machine Learning • Software • Defense
As a Senior DevOps Engineer at True Anomaly, you will design and manage cloud-based solutions, establish continuous integration and deployment processes, and ensure optimal application deployment using Kubernetes. You will collaborate closely with security teams, troubleshoot software performance, and manage cloud resources with Terraform, all while enhancing our spacecraft simulation technology.
Big Data • Software
The Performance & Reliability Engineer will enhance the performance and reliability of Aerospike's graph database through collaboration, workload design, performance tuning, and documentation.
Top Skills:
AnsibleC++EbpfGoGrafanaJavaKubernetesLinuxNoSQLPrometheusTerraform
Cloud • Information Technology • Software
The Database Reliability Engineer will optimize MongoDB clusters, ensure database security, implement monitoring, and automate database operations while collaborating with DevOps teams.
Top Skills:
AnsibleElasticMongoDBPython
Big Data • Software
The Performance & Reliability Engineer will enhance Aerospike's graph database performance, troubleshoot issues, collaborate with R&D, and document techniques while interacting with customers.
Top Skills:
AnsibleC++CentosCoreosEbpfGoGrafanaJavaKubernetesPrometheusRedhatTerraformUnix/Linux
Artificial Intelligence • Enterprise Web • Machine Learning • Natural Language Processing • Software • Conversational AI • Automation
As a Site Reliability Engineer, you'll enhance infrastructure security, automate deployments, optimize CI/CD processes, and drive engineering best practices while ensuring compliance and observability.
Top Skills:
Aws CloudElasticsearchGoJavaScriptMongoDBNode.jsReactRedisTerraform
Big Data • Cloud • Software • Database
Seeking a Senior Site Reliability Engineer to support and maintain the MongoDB Atlas platform, focusing on automation, system design, and operational excellence.
Top Skills:
AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
Marketing Tech • Sales • Software
Lead the evolution of Demandbase's database infrastructure, ensuring reliability and scalability while automating processes, mentoring a global team, and collaborating across departments for optimal performance.
Top Skills:
Aws RdsBigQueryDatadogDockerDynamoDBGitlab Ci/CdGrafanaKubernetesMySQLPostgresPrometheusPythonShell ScriptingTerraform
Real Estate • Travel • PropTech
The Sr. Staff Engineer will drive the Reliability strategy at Airbnb, ensuring infrastructure performance, creating incident management processes, and mentoring SRE teams.
Top Skills:
Cloud SystemsCodingDesign PatternsIncident ManagementSoftware EngineeringSystem Architecture
Reposted 22 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
As a Staff Software Engineer in SRE, you will design and enhance backend systems, ensuring reliability and operational excellence while developing a culture of quality and mentorship within the team.
Top Skills:
AWSKotlinKubernetesMySQLPythonSpark
Aerospace • Other
The Lead Software Engineer will design and improve software solutions for reliable vehicle manufacturing, manage a team of engineers, and ensure software quality across SpaceX's build processes.
Top Skills:
.NetAngularC#GoJavaPostgresPythonReactSQL Server
Artificial Intelligence • Software
The Platform Reliability Engineer will manage and optimize the technical infrastructure, ensuring reliability, performance, and scalability, while supporting DevOps initiatives.
Top Skills:
AWSBugsnagCloudflareCloudFormationDatadogElasticsearchMongoDBNew RelicRedisTerraform
eCommerce • Gaming
As a Staff Software Engineer in Infrastructure Reliability, you will enhance system performance and scalability, focus on automation, and lead engineering projects.
Top Skills:
AWSCloudFormationDockerElk StackGCPGitGoGrafanaHarnessJavaJenkinsJSONKubernetesPrometheusPythonTerraformYaml
Information Technology • Software
As a Site Reliability Engineer, you will handle technical escalations, ensure system reliability, collaborate with engineering teams, and participate in on-call rotations to support global operations.
Top Skills:
AnsibleAzureBashC#ChefElkGitGithub ActionsGitlabGrafanaJenkinsLinuxPrometheusPulumiPythonSplunkSvnTerraform
Information Technology • Software
As a Site Reliability Engineer at Redis, you'll ensure system reliability, troubleshoot technical escalations, collaborate with engineering teams, and participate in on-call rotations. You'll work on large-scale systems and develop automation tools to enhance the Redis database's stability.
Top Skills:
AnsibleAzureBashC#ChefElkGitGithub ActionsGitlabGrafanaJenkinsLinuxPrometheusPulumiPythonRedisSplunkSvnTerraform
Music
As a Senior Site Reliability Engineer, you will maintain infrastructure, automate deployments, mentor teams, handle incidents, and enhance system reliability.
Top Skills:
ArgocdAWSDatadogElasticsearchGithub ActionsGraphQLHashicorp VaultKafkaKubernetesMemcachedMySQLPostgresPythonRedisRest ApiSentryShellTerraform
Cloud • Information Technology
Lead and expand a Production SRE team, enhance infrastructure reliability, implement network automation, and shape SRE practices within the organization.
Top Skills:
AnsibleEnvoyExpressGitGoHaproxyJavaScriptJenkinsKafkaMySQLNapalmNode.jsPostgresPythonReactRedisSaltstack
Software
As a Senior Site Reliability Engineer, you will enhance the developer platform, improve reliability, and collaborate with internal and external teams on complex infrastructure solutions.
Top Skills:
AWSAzureCeleryCircleCIGCPGithub ActionsGitlab CiGoGrafanaHelmKubernetesLaunchdarklyMachineryPrometheusSplunkTemporalTerraform
Popular Job Searches
All Filters
Total selected ()
No Results
No Results