Get the job you really want.
Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in Los Angeles, CA
Aerospace • Hardware • Robotics • Software • Manufacturing
Responsible for improving designs, ensuring hardware meets requirements, engaging with teams for design optimization, and leading reviews and test approvals.
Top Skills:
Aerospace StructuresEngineering ToolsLiquid Rocket EnginesMechanical Engineering DesignVerification Testing
Aerospace • Hardware • Robotics • Software • Manufacturing
As a Mission Reliability Engineer, you'll mitigate risks and ensure flight readiness for the Terran R rocket, collaborating across engineering disciplines and managing anomalies during testing and launch.
Top Skills:
Engineering Risk ManagementSystem Safety Tools
Cloud • Mobile • Software
The Director of Engineering, DevOps will lead DevOps & SRE functions, ensuring infrastructure reliability and driving innovation across the organization.
Top Skills:
AnsibleAWSAzureGCPGrafanaJenkinsKubernetesPrometheusTerraform
Healthtech • Other • Social Impact • Software • Telehealth
The Staff SRE & DevOps Engineer ensures system reliability and efficiency, collaborates with teams, and applies SRE best practices in a remote work environment.
Top Skills:
AWSKubernetes
Fintech
The Senior Systems Reliability Engineer will optimize and automate processes, monitor systems, troubleshoot issues, and mentor teams, improving system reliability and scalability.
Top Skills:
AnsibleBashC++JavaLinuxPythonRust
Healthtech • Information Technology • Software • Telehealth
As a Staff Database Reliability Engineer at Zocdoc, you'll manage and modernize database systems, ensure data reliability, mentor teams, and handle incidents effectively.
Top Skills:
AuroraAws RdsCdkDynamoDBObservability PlatformsPostgresSQL ServerTerraform
Artificial Intelligence • Fintech • Information Technology • Software • Data Privacy
The Principal Site Reliability Engineer ensures that SaaS products are fast and stable, focusing on automating processes, monitoring systems, and collaborating across teams to enhance product performance and reliability.
Top Skills:
AnsibleAppdynamicsAzureAzure DevopsBashC# .NetCosmosDatadogDynatraceHarnessIdera Sql Diagnostic ManagerJavaJenkinsKubernetesNew RelicPowershellPythonRedgate Sql MonitorSolarwinds Database Performance AnalyzerSQLTerraform
Artificial Intelligence • Computer Vision • HR Tech • Machine Learning • Software
The Site Reliability Engineer II will manage and ensure the reliability and efficiency of SaaS application platforms, leveraging tools for automation, monitoring, and incident response while collaborating with various teams.
Top Skills:
AnsibleArgocdAWSAzureCisDockerElasticsearchFips 140-2Fips 140-3GCPGoGrafanaHelmIptablesJavaJenkinsKubernetesLinuxMongoDBMssqlMySQLPostgresPrometheusPythonSelinuxSolrStigTerraform
2 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
This role involves setting technical strategies, collaborating across teams, managing operations and availability, and fostering a culture of quality and ownership within the Site Reliability Engineering team.
Top Skills:
AWSKotlinKubernetesMySQLPythonSpark
Consumer Web • Digital Media • Information Technology • News + Entertainment • Social Media
The Site Reliability Engineer will enhance infrastructure resilience, optimize system performance, and improve automation within the cloud-based infrastructure while collaborating across engineering teams.
Top Skills:
AnsibleArgocdBashCC#C++DockerGoHelmJavaKubernetesLinuxPythonRustTerraform
Mobile • Software
Site Reliability Engineers will work on production infrastructure, focusing on AWS and Kubernetes while ensuring high availability and customer satisfaction.
Top Skills:
AirflowAWSCircleCICloudwatchEksGrafanaMongoDBPagerdutyPingdomRustScala SparkTerraformTypescript
Artificial Intelligence • Fintech • Hardware • Information Technology • Sales • Software • Transportation
Design, scale, and manage AWS services for IoT devices. Collaborate on infrastructure, optimize performance, and ensure high availability of services.
Top Skills:
AWSBashGoHelmKubernetesPythonRubyTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
Design and optimize AWS cloud infrastructure as an AWS Cloud Architect with SRE skills, focusing on scalability, cost-efficiency, and security.
Top Skills:
AnsibleAWSAws CloudtrailAws CloudwatchBashConsulDockerDocumentdbEc2EksGrafanaLambdaPrometheusPythonRdsS3TerraformVault
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
As a Principal SRE, drive reliability practices for the Identity Security Cloud platform, mentor teams, and improve service reliability and performance.
Top Skills:
AWSGoGrafanaHoneycombJavaKibanaKubernetesPrometheusPythonTerraform
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Lead and develop high-performing Site Reliability Engineering teams, drive cross-functional collaboration, and oversee platform reliability and engineering excellence initiatives.
Top Skills:
AWSAzureGCPOci
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Principal Site Reliability Engineer, lead SRE practices, collaborate cross-functionally, ensure system resiliency, and drive operational improvements.
Top Skills:
CdkCloudFormationDatadogGoJavaScriptPrometheusPythonTerraformTypescript
Reposted 4 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The role involves maintaining and improving CI/CD infrastructure using Argo Workflows and Kubernetes, ensuring effective deployment for engineering teams.
Top Skills:
AWSAzureGoGCPKubernetesPython
Reposted 4 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills:
AnsibleAWSAzureCloudFormationGCPGoTerraform
Reposted 5 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Senior Software Engineer in Site Reliability Tooling at Upstart, you will enhance system reliability and automation, implement monitoring standards, and improve incident response practices.
Top Skills:
CdkCloudFormationDatadogGoJavaScriptPrometheusPythonTerraformTypescript
Cloud • Information Technology • Software
This role involves evolving database infrastructure with a focus on MongoDB by automating processes, optimizing performance, and ensuring system reliability in collaboration with SREs and DevOps teams.
Top Skills:
AnsibleCi/CdInfrastructure-As-CodeMongoDBPython
Mobile • Real Estate • Software • Database • Analytics
As a Senior Site Reliability Engineer, you will enhance Perchwell's infrastructure, ensuring reliability, scalability, and effective deployments while collaborating with engineering and product teams.
Top Skills:
AWSCloudfrontCodebuildEc2EcrEksGoIamKubernetesLambdaPythonRdsRoute53RustS3Terraform
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As an SRE Manager, you will oversee a team ensuring operational excellence in a production environment, manage incident responses, and lead infrastructure planning efforts.
Top Skills:
FreebsdLinuxStorage Area NetworksVMware
HR Tech • Software
As a Senior Data Reliability Engineer, you'll enhance data system reliability, collaborate with teams, and optimize database infrastructure using GCP and PostgreSQL.
Top Skills:
DatadogGoogle Cloud PlatformHelmKubernetesPgbouncerPostgresRedis
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will empower developers by optimizing MongoDB Atlas, ensuring seamless performance across multiple cloud platforms while fostering a supportive culture.
Top Skills:
AWSGCPAzureMongoDB
Cloud • Fintech • Food • Information Technology • Software • Hospitality
The Sr. Site Reliability Engineer will automate incident and change management processes, optimize efficiency, and collaborate with stakeholders to maintain reliability at Toast.
Top Skills:
AWSAzureFirehydrantGCPGoJIRAPythonTerraform
Top Los Angeles Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results