Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in Los Angeles, CA
Fintech • Software
The Senior Site Reliability Engineer ensures fast, stable SaaS products through automation, collaboration, monitoring, and implementing AI tools to enhance performance and reliability.
Top Skills:
Ai ToolsAnsibleAppdynamicsAWSAzureAzure DevopsBashC# .NetCosmosDatadogDynatraceHarnessJavaJenkinsKubernetesNew RelicPowershellPythonSaaSSQLTerraform
Reposted 10 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will develop and support distributed storage services, ensuring reliability and operational safety, with a focus on automation and efficiency.
Top Skills:
AWSAzureDnsGoGoogle Cloud PlatformKubernetesLinuxPythonTcp/IpTls
Big Data • Cloud • Software • Database
Seeking a Site Reliability Engineer with expertise in networking and distributed systems for building secure multi-cloud infrastructure. Responsibilities include maintaining network architecture and ensuring reliable service-to-service communication, involving a 24/7 on-call rotation.
Top Skills:
AWSAzureBgpDnsGCPIpv6KubernetesLoad BalancingMtlsService MeshTcp/IpTlsVpcsVpns
Defense • Manufacturing
The Senior Design Reliability Engineer will develop design review processes, ensure compliance with reliability standards, and manage project documentation for spacecraft design.
Top Skills:
Aerospace SystemsLaunch VehiclesProject ManagementSpacecraft
Defense • Manufacturing
The Principal Design Reliability Engineer ensures spacecraft designs meet rigorous reliability standards, overseeing design reviews, configuration management, and documentation for aerospace engineering. They lead cross-disciplinary teams, drive process improvements, and ensure smooth transitions to production while capturing lessons learned for continuous improvement.
Top Skills:
Aerospace EngineeringComputer EngineeringElectrical EngineeringMechanical Engineering
eCommerce • Fintech • Payments • Software
The role involves ensuring software reliability and performance, managing incidents, developing infrastructure automation, and mentoring junior engineers within a platform team.
Top Skills:
AWSCloudFormationDatadogKubernetesOpentelemetryRubyRuby On RailsTerraform
Aerospace • Other
As a Hardware Reliability Engineer, ensure satellite hardware reliability through failure analysis, collaboration with teams, and environmental testing.
Top Skills:
CC++Python
Aerospace • Other
The Sr. Reliability Engineer will support the design community in producing reliable EEE components, evaluate component quality, lead reliability testing, and interface with various teams to address component failure issues.
Top Skills:
Accelerated Testing MethodsAltEssHaltHassMonte Carlo SimulationsPcb Design AnalysisPythonReliability Modeling MethodsReliability PhysicsRgtStatistical TechniquesWeibull Analysis
Aerospace • Other
Investigate, analyze, and track on-orbit hardware reliability for the Starshield constellation. Drive root-cause investigations, collaborate across design/production/operations, build monitoring and analysis tools, and provide data-driven reliability improvements across design, test, and operations.
Top Skills:
CC++PythonSQL
Information Technology • Software • Database • App development
The Enterprise Reliability Engineer will lead technical support for enterprise clients, troubleshoot production bugs, and improve platform reliability while collaborating across teams and contributing to new features.
Top Skills:
AWSAzureCi/CdGCPPHP
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Principal Software Engineer on the SRE team, lead best practices adoption, mentor engineers, and improve system reliability and user experience through automation and collaboration.
Top Skills:
CdkCloudFormationDatadogGoJavaScriptPrometheusPythonTerraformTypescript
Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)
As a Site Reliability Engineer, you will ensure system stability and resilience, define reliability standards, and automate operational processes while collaborating cross-functionally to improve performance and reduce incidents.
Top Skills:
BashCi/CdDockerGoGrafanaKubernetesLinuxPrometheusPython
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Aerospace • Other
Oversee PCB design and development, manage reliability testing, optimize production, and enhance quality while ensuring DFM for satellite hardware.
Top Skills:
Chemical EngineeringElectrical EngineeringFailure Analysis ToolsIpc Manufacturing StandardsMaterials SciencePcb DesignStatistical Techniques
Aerospace
The role involves supporting the Space Resources Program by managing system safety and reliability, ensuring compliance with safety regulations, identifying hazards, and improving safety practices throughout hardware development and operational processes.
Top Skills:
Environmental Health And Safety (Ehs)Hazard AnalysisIndustrial HygieneProcess Safety ManagementReliability EngineeringRisk ManagementSafety Engineering
AdTech • Marketing Tech • Analytics
Manage and support customer applications, improve system reliability, collaborate with teams on infrastructure needs, and help drive architectural decisions.
Top Skills:
Auto ScalingAWSCdnsDatadogDnsDockerKafkaKibanaKubernetesLinuxLoad BalancersPostgresProxy ServersPythonRdsRedshiftShell/BashSparkTerraformWafs
Reposted 19 Days AgoSaved
Easy Apply
Easy Apply
Cloud • Information Technology • Security • Software • Cybersecurity
This internship role focuses on SRE skills, requiring collaboration and problem-solving in dynamic environments for Zscaler's Zero Trust Exchange team.
Top Skills:
AnsibleAws EcsKubernetesLinuxPythonTerraform
Software
Drive reliability testing and qualification of cellular base stations, collaborating with R&D for long-term reliability and product lifecycle support.
Top Skills:
ExcelMS OfficeMs WordPtc WindchillPythonTelcordia
Gaming • Software • Metaverse
The Site Reliability Engineer Intern will automate tasks, collaborate with engineers, and design cloud solutions to improve game services globally.
Top Skills:
C++GoJavaKubernetesLinuxPython
Database • Analytics
As a Database Reliability Engineer at ClickHouse, you'll improve reliability, manage escalation processes, support incident response, and enhance database performance while collaborating across teams.
Top Skills:
AWSAzureC++ClickhouseGoogle Cloud PlatformPythonShellSQL
Healthtech • Software
The Database Reliability Engineer manages and maintains cloud-based database infrastructures for SaaS applications, focusing on automation, process improvement, and collaboration with engineering teams.
Top Skills:
AnsibleAWSAzureAzure Data FactoryC#DatabricksGCPGitGrafanaInfluxdbMySQLPostgresPowershellPythonSQLSQL ServerTerraform
Aerospace • Other
The Site Reliability Engineer will manage and maintain mission-critical applications, improve software development processes, and provide end-user support, emphasizing safety and performance optimization.
Top Skills:
AnsibleBazelBuckC#C++ClickhouseDockerJavaScriptKubernetesLinuxMakeMySQLPostgresPuppetPythonTerraform
Reposted 23 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will lead security design and implementation for cloud infrastructures, mentor teams, and automate security solutions.
Top Skills:
AnsibleAWSAzureCloud Security ToolsCloudFormationGCPGoTerraform
Aerospace • Other
The Site Reliability Engineer will manage mission-critical platforms, enhance software lifecycle processes, and collaborate closely with software engineers to improve application health and response systems while supporting vehicle software teams.
Top Skills:
AnsibleC#ClickhouseDockerJavaScriptKubernetesLinuxMySQLPostgresPuppetPythonTerraform
Marketing Tech
The Cloud Reliability Engineer develops and deploys cloud tools, maintains systems performance, participates in incident response, and collaborates with teams. Requires DevOps experience, cloud expertise, and programming skills.
Top Skills:
AWSDockerGoGoogle BigqueryGCPKubernetesPythonSQLTerraform
Information Technology • Legal Tech
The Senior Technology Site Reliability Engineer is responsible for maintaining and optimizing infrastructure and applications, ensuring reliability and performance while automating processes and collaborating with teams.
Top Skills:
AWSChefDatadogGoGrafanaJavaPrometheusPuppetPythonSaltTerraform
Let Your Resume Do The Work
Upload your resume to be matched with jobs you're a great fit for.
Success! We'll use this to further personalize your experience.
Popular Job Searches
All Filters
Total selected ()
No Results
No Results



.png)





















