Get the job you really want.
Top Reliability Engineer Jobs in Los Angeles, CA
Cloud • Fintech • Cryptocurrency • NFT • Web3
The Staff Site Reliability Engineer at Coinbase will improve system reliability, mentor engineers, automate processes, and oversee software integrity, focusing on high-quality coding and performance tuning.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
Artificial Intelligence • Machine Learning • Security • Software
As a Site Reliability Engineer, you'll manage deployments, ensure system reliability, handle on-call incidents, and collaborate with teams to optimize performance.
Top Skills:
AWSBashDockerGrafanaKubernetesPrometheusPythonTerraform
Cloud • Greentech • Other • Energy
As a Site Reliability Engineer II on the Observability team, you'll manage and improve observability stacks, support engineering teams with monitoring, develop new tools, and analyze system performance for enhanced reliability.
Top Skills:
AnsibleCircleCICloud FormationDockerGithub ActionsGitlab Ci/CdGoKubernetesPythonTerraform
Cloud • Information Technology • Software
The Database Reliability Engineer will optimize MongoDB clusters, ensure database security, implement monitoring, and automate database operations while collaborating with DevOps teams.
Top Skills:
AnsibleElasticMongoDBPython
Artificial Intelligence • Enterprise Web • Machine Learning • Natural Language Processing • Software • Conversational AI • Automation
As a Site Reliability Engineer, you'll enhance infrastructure security, automate deployments, optimize CI/CD processes, and drive engineering best practices while ensuring compliance and observability.
Top Skills:
Aws CloudElasticsearchGoJavaScriptMongoDBNode.jsReactRedisTerraform
Marketing Tech • Sales • Software
Lead the evolution of Demandbase's database infrastructure, ensuring reliability and scalability while automating processes, mentoring a global team, and collaborating across departments for optimal performance.
Top Skills:
Aws RdsBigQueryDatadogDockerDynamoDBGitlab Ci/CdGrafanaKubernetesMySQLPostgresPrometheusPythonShell ScriptingTerraform
Real Estate • Travel • PropTech
The Sr. Staff Engineer will drive the Reliability strategy at Airbnb, ensuring infrastructure performance, creating incident management processes, and mentoring SRE teams.
Top Skills:
Cloud SystemsCodingDesign PatternsIncident ManagementSoftware EngineeringSystem Architecture
Fintech • Software
The Senior Database Reliability Engineer ensures reliable database deployments, mentors junior staff, and manages performance and security of database systems, particularly with MongoDB and PostgreSQL.
Top Skills:
AnsibleBashDockerJenkinsKubernetesMongoDBPostgresPythonSQL ServerTerraform
Featured Jobs
Generative AI
The Senior Site Reliability Engineer at Stability AI will enhance and manage cloud infrastructure, enforce SRE best practices, architect scalable systems, and drive incident management. Responsibilities include collaborating with development teams, implementing infrastructure as code, and mentoring junior team members.
Information Technology • Cryptocurrency
The Head of SRE will lead the SRE team, defining strategy, ensuring system reliability, and driving operational excellence while mentoring staff.
Top Skills:
ArgocdBashElk StackGCPGoGrafanaHelmKubernetesPrometheusPythonTerraform
Healthtech
The Sr. Site Reliability Engineer at Synapse Health focuses on building scalable systems in an Azure environment, automating processes, and ensuring infrastructure reliability while collaborating with software development teams.
Top Skills:
AnsibleAzureAzure DevopsBashDatadogGoGrafanaKubernetesNew RelicPrometheusPythonTerraform
Legal Tech • Software
As a Site Reliability Engineer, you will develop autonomous systems, improve CI/CD processes, mentor junior engineers, and ensure reliable software operations.
Top Skills:
Artificial IntelligenceCi/CdCloud-Based Workflow ToolsInternet Scale ApplicationsMachine Learning
Big Data
This role involves managing AWS infrastructure, debugging applications, developing internal tools, and automating deployment processes for Metabase Cloud.
Top Skills:
AWSDatadogGoGrafanaKubernetesPrometheusPythonTerraform
Blockchain • Software
The Site Reliability Engineer Lead will oversee build and deployment cycles, improve automation tools, and ensure highly available systems, while leading a diverse team in a remote setting.
Top Skills:
AnsibleAWSAzureBashBuildkiteChefDatadogDockerGCPGitGithub ActionsGitlab CiGoGrafanaHelmJenkinsKubernetesLinuxLokiOpentelemetryPrometheusPulumiPythonRustSaltstackTerraform
Artificial Intelligence • Software
As a Senior Staff Site Reliability Engineer, you will enhance system reliability and performance, lead incident management, analyze capacity planning, and mentor junior engineers.
Top Skills:
AnsibleAWSAzureBashChefCircleCICloudFormationDatadogDockerElkGCPGitlabGoGrafanaJenkinsKubernetesPrometheusPuppetPythonTerraform
Edtech • Information Technology • Other
As a Senior SRE/DevOps Engineer, you will design and maintain infrastructure for a learning platform, migrate services, and foster a DevOps culture.
Top Skills:
Microsoft Stack,Saas,Web Products,Cloud Platform Resources,Azure,Gcp,Aws,Kubernetes
Aerospace • Other
The Site Reliability Engineer will manage Kubernetes clusters, ensure reliability and security, and collaborate with teams for automation and deployment in a fast-paced environment.
Top Skills:
Cloud-Based TechnologiesGitopsInfrastructure As CodeKubernetesLinux
Aerospace • Other
The Full Stack Software Engineer will design, develop, and improve software solutions for build reliability at SpaceX, engaging with engineers to ensure high-quality software delivery.
Top Skills:
.NetAngularjsC#CSSHTMLPostgresPythonSQLTypescript
Aerospace • Other
The Lead Software Engineer will design and improve software solutions for reliable vehicle manufacturing, manage a team of engineers, and ensure software quality across SpaceX's build processes.
Top Skills:
.NetAngularC#GoJavaPostgresPythonReactSQL Server
Aerospace • Other
The Site Reliability Engineer will operate and scale mission-critical software products for engineering and launch, collaborating with software engineers and managing infrastructure as code to ensure software delivery meets high standards.
Top Skills:
AnsibleDockerGitKubernetesKvmLinuxPuppetPythonTerraformVirtualbox
Aerospace • Other
The Site Reliability Engineer will upgrade distributed systems, manage compute clusters, and enhance deployment and monitoring infrastructure while collaborating with engineers across programs at SpaceX.
Top Skills:
Apache KafkaC#FlinkGoHbaseHdfsIstioJavaKubernetesLinuxPythonScalaSpark
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design and maintain large scale production systems focusing on reliability and automation, while supporting services through monitoring and incident management.
Top Skills:
DockerGoKubernetesLinuxOpenstackPerlPythonRuby
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
Design, build, and maintain large scale production systems focusing on reliability and performance using software and systems engineering practices.
Top Skills:
DockerGoKubernetesLinuxNetworkingOpenstackPerlPythonRuby
Artificial Intelligence • Software • Generative AI
The site reliability engineer will enhance and maintain Writer’s cloud infrastructure, ensuring reliability, scalability, and security while mentoring junior engineers.
Top Skills:
AWSAzureDockerElk StackGCPGoGrafanaJavaKubernetesPrometheusPythonTerraform
Cloud • Information Technology • Productivity • Software • Automation
As a Senior Site Reliability Engineer, you will enhance system scalability and reliability, automate infrastructure, mentor engineers, and collaborate on product features development.
Top Skills:
AnsibleAWSCloud FormationNew RelicPythonSplunkTerraform
Top Los Angeles Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results