Get the job you really want.
Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in Los Angeles, CA
Healthtech • Software
The Database Reliability Engineer manages and maintains cloud-based database infrastructures for SaaS applications, focusing on automation, process improvement, and collaboration with engineering teams.
Top Skills:
AnsibleAWSAzureAzure Data FactoryC#DatabricksGCPGitGrafanaInfluxdbMySQLPostgresPowershellPythonSQLSQL ServerTerraform
2 Days AgoSaved
Easy Apply
Easy Apply
Manufacturing • Renewable Energy
The Senior Grid Reliability Engineer will ensure compliance and optimize performance of Battery Energy Storage Systems, addressing operational issues through collaboration and analysis while monitoring regulatory standards.
Top Skills:
Power BIPowershellPythonSeeqSQL
Marketing Tech
The Cloud Reliability Engineer develops and deploys cloud tools, maintains systems performance, participates in incident response, and collaborates with teams. Requires DevOps experience, cloud expertise, and programming skills.
Top Skills:
AWSDockerGoGoogle BigqueryGCPKubernetesPythonSQLTerraform
Reposted 12 Days AgoSaved
Easy Apply
Easy Apply
Aerospace • Other
As an RF Hardware Reliability Engineer, you will investigate failure causes, drive improvements, and perform hands-on troubleshooting for reliability in satellite systems.
Top Skills:
CC++PythonRf SystemsSQL
Reposted 12 Days AgoSaved
Easy Apply
Easy Apply
Aerospace • Other
Responsible for ensuring hardware reliability by conducting root cause analysis, troubleshooting RF systems, and improving satellite product quality.
Top Skills:
CC++PythonRf SystemsSQL
Healthtech • Information Technology • Software • Telehealth
The Senior Site Reliability Engineer will develop, monitor, and maintain distributed production systems, ensuring uptime for patients and providers while automating processes and supporting a large engineering team.
Top Skills:
AWSDockerGCPKubernetes
Artificial Intelligence • Big Data • Cloud • Software • Analytics • Infrastructure as a Service (IaaS) • Big Data Analytics
As an Airflow Reliability Engineer, you'll provide expertise in Apache Airflow, solve challenges for customers, and contribute to open-source projects, while enhancing your technical and customer-facing skills.
Top Skills:
Apache AirflowAWSAzureDockerGCPKubernetesPostgresPythonSQL
Reposted 9 Days AgoSaved
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Conversational AI
The engineer will build and operate AI/ML infrastructure, managing services on AWS and bare metal, using tools like Kubernetes and Terraform.
Top Skills:
AWSBashGoKubernetesPythonSlurmTerraform
HR Tech • Information Technology • Professional Services • Sales • Software
Own and operate production-grade Kubernetes infrastructure on AWS, build GitOps CI/CD with GitHub Actions and ArgoCD, develop AI agents and internal DevOps tooling, maintain Datadog-based observability, and manage on-call incident response while collaborating with engineering teams to improve reliability and delivery speed.
Top Skills:
Ai/LlmArgocdAWSCi/CdDatadogGithub ActionsGitopsGoKubernetesPython
Big Data • Healthtech • HR Tech • Machine Learning • Software • Telehealth • Big Data Analytics
The Staff Site Reliability Engineer will architect, operate, and improve the platform while ensuring security compliance and enhancing development processes.
Top Skills:
AWSElasticsearchIstioKubernetesNatsNode.jsPostgresPythonReactTerraformTypescript
Reposted 9 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will build and scale identity management tools, automate operations, ensure security, and support AWS, GCP, and Azure environments.
Top Skills:
AnsibleAWSAzureC#Cloud Identity ProvidersDockerGCPGoInfrastructure As CodeJavaKubernetesPythonRubyTerraform
AdTech • Artificial Intelligence • Marketing Tech • Software • Analytics
The Senior Site Reliability Engineer will enhance system reliability, develop production-grade code, implement observability tools, conduct root cause analyses, and collaborate on system design for scalability.
Top Skills:
ArgocdCi/CdDockerGitopsGoGrafanaHoneycombJenkinsKubernetesOpentelemetryPrometheusPythonTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Reposted 11 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills:
AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
3D Printing • Aerospace • Hardware • Software • Manufacturing
The Senior Design Reliability Engineer will lead technical reviews, oversee flight systems testing, ensure design compliance, and foster cross-functional collaboration to enhance engineering reliability in space station development.
Top Skills:
ExcelMatlabPythonTableau
3D Printing • Aerospace • Hardware • Software • Manufacturing
The Staff Design Reliability Engineer leads design and test processes for artificial-gravity space habitats, ensuring system reliability and engineering standards while fostering cross-functional collaboration.
Top Skills:
ExcelMatlabPythonTableau
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
Lead technical direction for software architecture and cross-team initiatives focusing on scaling consumer-facing systems and maximizing loan originations while maintaining compliance and system integrity.
Top Skills:
AWSCi/CdDockerGithub ActionsInfrastructure As CodeReactRuby On Rails
Big Data • Cloud • Software • Database
Develop and maintain Kubernetes runtime environments, support developers, resolve critical issues, and participate in on-call rotations for production systems.
Top Skills:
AWSAzureCert-ManagerCorednsCrdsCriCsiGatekeeperGCPGoHelmKubernetesKustomizeOperatorsPythonTerraform
Fintech • Software
The Senior Site Reliability Engineer ensures fast, stable SaaS products through automation, collaboration, monitoring, and implementing AI tools to enhance performance and reliability.
Top Skills:
Ai ToolsAnsibleAppdynamicsAWSAzureAzure DevopsBashC# .NetCosmosDatadogDynatraceHarnessJavaJenkinsKubernetesNew RelicPowershellPythonSaaSSQLTerraform
Angel or VC Firm • Artificial Intelligence • Fintech • Software • Financial Services
As a Site Reliability Engineer, you'll maintain infrastructure for ML workloads, implement observability tools, manage CI/CD pipelines, and troubleshoot incidents in a collaborative environment.
Top Skills:
AirflowAWSAzureBashDatadogDockerElkGCPGithub ActionsGitlabGrafanaKubeflowKubernetesMlflowPalantir FoundryPrometheusPythonSagemaker PipelinesTerraform
Aerospace • Other
As a Build Reliability Engineer, you'll enhance manufacturing reliability for the Raptor engine, develop quality plans, and investigate production failures.
Top Skills:
Advanced Product Quality PlanningControl PlansCorrective ActionDesign Of ExperimentsLean PrinciplesMeasurement Systems AnalysisProcess Failure Mode And Effects AnalysisRoot Cause AnalysisStatistical Process Control
Database
Manage and optimize Postgres databases at scale on AWS RDS, own reliability/monitoring, execute low-downtime upgrades and migrations, troubleshoot production issues, participate in on-call rotation, and collaborate with platform and product teams.
Top Skills:
Aws RdsBarmanGoPgbackrestPostgresTypescriptWal-G
Legal Tech • Software
Lead automation and optimization of Filevine's data platform: performance tune MSSQL/Postgres, optimize Snowflake, provision infrastructure with Terraform/AWS, run stateful containers on Kubernetes, integrate AI/LLM and MCP for operational automation, manage CI/CD, capacity planning, documentation, and serve in 24/7 on-call rotation.
Top Skills:
AWSC#DapperDockerDynamoDBEntity FrameworkGitlabKubernetesLlmsMcp (Model Context Protocol)Microsoft Sql Server (Mssql)Octopus DeployOpensearchPostgresPowershellPythonRedisSnowflakeTerraform
Reposted 22 Days AgoSaved
Easy Apply
Easy Apply
Aerospace • Other
As a Sr. Hardware Reliability Engineer, you will ensure hardware reliability for Starshield satellites, troubleshoot failures, improve designs, and collaborate across teams.
Top Skills:
CC++Digital MultimetersOscilloscopesPower SuppliesPythonRf Test EquipmentSoldering EquipmentSpectrum AnalyzerSQLVector Network Analyzer
Reposted 18 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance CI/CD frameworks, automate cloud infrastructure, manage Kubernetes and AWS services, and ensure operational excellence.
Top Skills:
AnsibleAWSBashChefCi/CdDockerGitKubernetesPuppetPythonRubySaltTerraform
Artificial Intelligence • Other • Security • Software • Analytics • Big Data Analytics
The Lead Site Reliability Engineer will oversee the Infrastructure SRE team, focusing on system reliability, automation, and mentoring while collaborating with product engineering.
Top Skills:
Ci/CdDatadogDockerElk StackGitopsGoKubernetesLinux/UnixNew RelicNoSQLPrometheusPythonSQLStackdriverTerraform
Top Los Angeles Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results




.png)


.png)
.png)








.png)










