Get the job you really want.
Top Reliability Engineer Jobs in Los Angeles, CA
Cloud • Security • Software • Cybersecurity • Automation
The Intermediate Site Reliability Engineer at GitLab will design scalable networking infrastructure, collaborate on projects, respond to incidents, and automate operational tasks.
Top Skills:
AnsibleBashChefGitlab CiGoGoogle Cloud PlatformKubernetesRubyTerraform
Marketing Tech
The Cloud Reliability Engineer develops and deploys cloud tools, containerizes applications, collaborates with teams, ensures system performance, and participates in on-call incidents.
Top Skills:
AWSDockerGithub ActionsGoGoogle BigqueryGCPGoogle Cloud FunctionsKubernetesLambdaPythonSQLTerraform
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will design, implement, and enhance systems for infrastructure development, focusing on automation, reliability, and developer experience.
Top Skills:
AWSAzureBazelCrossplaneGCPGithub ActionsKubernetesTerraform
Cloud • Information Technology • Security • Software
The Cloud Reliability Engineer will enhance the reliability of HashiCorp Boundary Cloud by developing tools for monitoring, managing incidents, and improving developer productivity, while also participating in on-call rotations.
Top Skills:
AWSAws AuroraGoNomadPostgresTraefik
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The SRE Cloud Architect will design and optimize AWS cloud infrastructure focusing on scalability, reliability, and cost efficiency, while mentoring teams and ensuring best practices in security and operational excellence.
Top Skills:
AnsibleApi GatewayAWSAws CdkAws CloudwatchAws GuarddutyBashCloudFormationCloudfrontCloudtrailDocumentdbEc2EksGitlabGrafanaLambdaLokiMimirPrometheusPythonRdsS3Secrets ManagerSecurity HubSsmTempoTerraform
Mobile
The Telephony Reliability Engineer II leads telecommunications support initiatives, manages network systems, addresses voice-related issues, and oversees SIP integrations while providing 24/7 on-call support.
Top Skills:
AsteriskAWSFreeswitchKamailioLinuxMssqlMySQLVoip
Reposted 11 Days Ago
Cloud • Fintech • Cryptocurrency • NFT • Web3
As a Senior Software Engineer on the Core Reliability Team at Coinbase, you'll enhance system reliability, scale services significantly, and communicate effectively with all engineering levels while working on critical infrastructure projects.
Top Skills:
AWSAzureGCPGoRubyTerraform
Big Data • Cloud • Software • Database
Seeking a Site Reliability Engineer with strong networking skills to build and maintain secure infrastructure for service communication. Involves collaboration, support, and 24/7 on-call participation.
Top Skills:
AWSAzureBgpCloud ComputingDnsGCPKubernetesLoad BalancingSdnService MeshTcp/IpTls
Featured Jobs
Big Data • Cloud • Software • Database
Lead the Fabric team as a Site Reliability Engineer, focusing on building resilient infrastructure for secure service communication, while overseeing team direction and addressing technical issues.
Top Skills:
AWSAzureBgpDnsGCPKubernetesTcp/IpTls/MtlsVpcs
Big Data • Cloud • Software • Database
Design and build infrastructure for cloud services; improve resilience, automation, and monitoring; participate in on-call rotation.
Top Skills:
Amazon Web ServicesCi/CdGCPKubernetesLinuxAzureMongoDB
Big Data • Cloud • Software • Database
The Staff Site Reliability Engineer will manage secure communication infrastructure, focusing on deep networking, distributed systems, and ensuring system resilience in a multi-cloud environment.
Top Skills:
AWSAzureBgpDnsGCPKubernetesLoad-BalancingSdnService MeshTcp/IpTls/MtlsVpns
Big Data • Cloud • Software • Database
The Lead Site Reliability Engineer will manage the Fabric team, ensuring secure communication infrastructure, guiding engineering practices, and participating in on-call support.
Top Skills:
AWSAzureBgpDnsGCPKubernetesSdnTcp/IpTls/Mtls
Aerospace • Logistics • Security • Software • Cybersecurity
The Sr. Principal Reliability Engineer leads reliability assessments, conducts failure analyses, and supports spacecraft systems design and testing within an Integrated Product Team.
Top Skills:
Aerospace Reliability StandardsDod Military StandardsEffects And Criticality AnalysisElectronic Systems ReliabilityFailure ModesNasa StandardsReliability EngineeringRoot Cause Analysis
Aerospace • Logistics • Security • Software • Cybersecurity
The Principal Reliability Engineer will lead reliability analysis, conduct failure investigations, and develop reliability assessments for spacecraft systems.
Top Skills:
Electronic Parts Stress AnalysisFmecaReliability AnalysisSingle-Events Effects AnalysisWorst Case Analysis
Aerospace • Logistics • Security • Software • Cybersecurity
Responsible for developing and implementing machine monitoring strategies, ensuring equipment reliability, and preparing reports. Supports operational improvements and maintenance procedures.
Top Skills:
Allen BradleyCnc EquipmentExcelFanucHeidenheinIndustrial Iot TechnologyMs ProjectPlcPowerPointSiemensWord
12 Days Ago
Easy Apply
Easy Apply
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
The Lead Site Reliability Engineer will design, develop, and operate observability systems, ensuring service reliability in large distributed environments. Responsibilities include scaling observability systems, writing monitoring libraries, and collaborating with engineering teams.
Top Skills:
AnsibleBashElasticsearchGoKafkaPrometheusPythonRubyScalaTerraform
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves developing and maintaining cloud services for reliability and scalability, optimizing architecture, and mentoring other developers while focusing on innovative software practices.
Top Skills:
AWSCassandraElasticsearchGoJavaKafkaKotlinNode.jsPythonScala
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Senior Software Engineer II at CrowdStrike, you will lead initiatives in platform reliability and scalability, develop core libraries, and drive architectural decisions while mentoring others.
Top Skills:
AWSCassandraElasticsearchGoJavaK8SKafkaKotlinNode.jsPythonScala
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
The Senior Software Engineer will lead efforts in site reliability engineering, improving monitoring, incident response, and tooling to enhance system reliability and performance.
Top Skills:
CdkDatadogGoJavaScriptPrometheusPulumiPythonTerraformTypescript
Software • Consulting
As a MuleSoft Reliability Engineer, you will investigate integration issues, ensure platform performance, contribute to incident management, and support internal applications.
Top Skills:
Integration PlatformsMulesoft
14 Days Ago
Easy Apply
Easy Apply
Cloud • Security • Software • Cybersecurity • Automation
The Senior Site Reliability Engineer is responsible for maintaining user-facing services, managing database operations, and optimizing cloud infrastructure at GitLab. Key responsibilities include designing and maintaining ClickHouse and PostgreSQL clusters, implementing monitoring systems, and ensuring security compliance. The role requires strong technical skills in database management and cloud automation, along with leadership and communication abilities.
Top Skills:
AnsibleChefClickhouseGoGrafanaHelmKubernetesLinuxPostgresPrometheusPythonRubyTerraform
Reposted 15 Days Ago
Easy Apply
Easy Apply
Cloud • Security • Software • Cybersecurity • Automation
As a Site Reliability Engineer at GitLab, you will automate operational tasks, maintain system reliability, monitor capacity, and enhance security for customer environments.
Top Skills:
AnsibleAWSElkGCPGitlabGoJsonnetKubernetesPrometheusRubyTerraform
Artificial Intelligence • Fintech • Information Technology • Software • Data Privacy
The Principal Site Reliability Engineer ensures SaaS products are fast and stable, focuses on automation, system monitoring, and collaborates with teams to improve product performance.
Top Skills:
C#,.Net,Java,Harness,Azure Devops,Ansible,Jenkins,New Relic,Dynatrace,Datadog,Appdynamics,Powershell,Python,Bash,Terrraform,Sql,Cosmos,Solarwinds Database Performance Analyzer,Idera Sql Diagnostic Manager,Redgate Sql Monitor,Kubernetes,Aks,Eks
Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services
As a Staff Software Engineer in Reliability, you'll lead reliability solutions, mentor teams, and enhance system performance using modern architectures and best practices.
Top Skills:
Apache KafkaAWSDartDockerGitGoJavaKubernetesMySQLNginxOpentelemetryPostgresPythonReactRedshiftSnowflakeTypescript
Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
As a Senior Software Engineer II, you'll design scalable infrastructure to support Dandy's products, ensuring quality and performance in a collaborative environment.
Top Skills:
ChronosphereGCPGraphQLKubernetesNestjsNode.jsPostgresPulumiReactReduxTemporalTypescript
Top Los Angeles Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results