Get the job you really want.
Top Reliability Engineer Jobs in Los Angeles, CA
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves ensuring the reliability of data systems through building ETL processes, automated quality checks, and managing data integrity and reporting.
Top Skills:
Apache AirflowSparkApache SupersetETLJIRAMySQLNumpyPandasPostgresPythonSQLSqlalchemyTableau
Aerospace • Artificial Intelligence • Cloud • Machine Learning • Software • Cybersecurity • Defense
As a Senior Risk and Reliability Engineer, you will guide risk and reliability activities, provide critical assessments, and mentor junior engineers while ensuring systems meet safety and reliability standards in a rapidly evolving space industry.
Top Skills:
Failure Mode Effects AnalysisFault Tree AnalysisProbabilistic Risk AssessmentReliability ModelingReliability TestingStatistical Methods
Reposted 16 Days AgoSaved
Easy Apply
Easy Apply
3D Printing • Aerospace • Hardware • Robotics • Software • Manufacturing
The Mission Reliability Engineer ensures high reliability in rocket design and operations, performing risk assessments and driving architectural decisions in the aerospace sector.
Top Skills:
Additive ManufacturingAerodynamicsReliability AnalysisTrajectory Design
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves maintaining and automating database systems, working with large-scale data and cloud infrastructures, and supporting engineering teams.
Top Skills:
AWSCassandraChefElasticsearchKafkaMySQLPostgresRubySaltZookeeper
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves maintaining data components, developing infrastructure services for the engineering team, and ensuring data security and availability.
Top Skills:
AWSCassandraChefElasticsearchKafkaMySQLPostgresRubySaltZookeeper
Aerospace • Artificial Intelligence • Cloud • Machine Learning • Software • Cybersecurity • Defense
The position involves applying statistical techniques for risk analysis, conducting reliability modeling for satellite systems, and providing technical advice to management and customers. The candidate will also manage projects and deliver work products through teamwork and collaboration with various engineering disciplines.
Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
As a Senior Site Reliability Engineer, you'll develop cloud-based data platforms using GCP, support data pipeline construction, and improve data management practices while collaborating with various teams.
Top Skills:
Apache HadoopDataprocGitGoogle Cloud PlatformKafkaPy-SparkPythonRest ApiSparkSQL
Healthtech • Pharmaceutical • Telehealth
The Database Reliability Engineer will ensure database performance, scalability, and reliability, applying SRE principles and driving automation while collaborating with engineering teams.
Top Skills:
AWSBashDatadogGoInfrastructure-As-CodePostgresPulumiPythonRdsSplunkTerraform
Featured Jobs
Reposted 18 Hours AgoSaved
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
As a Senior Software Engineer on the Core Reliability Team at Coinbase, you'll enhance system reliability, scale services significantly, and communicate effectively with all engineering levels while working on critical infrastructure projects.
Top Skills:
AWSAzureGCPGoRubyTerraform
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will enhance system reliability, improve observability, build automation, and optimize cloud deployments while mentoring engineers and ensuring process improvements.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
As a Senior Site Reliability Engineer, you will manage IAM systems, implement cloud-native applications, and enhance automation and security in operations, ensuring peak uptime and performance.
Top Skills:
AnsibleAWSAzureAzure AdC#DockerDuoGCPGoGoogle WorkspaceJavaKubernetesOktaPingPythonRubyTerraform
Big Data • Cloud • Software • Database
Lead the Fabric team as a Site Reliability Engineer, focusing on building resilient infrastructure for secure service communication, while overseeing team direction and addressing technical issues.
Top Skills:
AWSAzureBgpDnsGCPKubernetesTcp/IpTls/MtlsVpcs
Big Data • Cloud • Software • Database
Design and build infrastructure for cloud services; improve resilience, automation, and monitoring; participate in on-call rotation.
Top Skills:
Amazon Web ServicesCi/CdGCPKubernetesLinuxAzureMongoDB
Big Data • Cloud • Software • Database
Seeking a Site Reliability Engineer with strong networking skills to build and maintain secure infrastructure for service communication. Involves collaboration, support, and 24/7 on-call participation.
Top Skills:
AWSAzureBgpCloud ComputingDnsGCPKubernetesLoad BalancingSdnService MeshTcp/IpTls
Cloud • Greentech • Other • Energy
In this role, you'll support virtualization and kernel performance, develop automation tools, optimize compute platforms for AI, and collaborate with hardware teams.
Top Skills:
CGoKvmLinuxQemuRustSmartnics
Cloud • Greentech • Other • Energy
You'll optimize Crusoe's compute infrastructure, focusing on virtualization, performance tuning, and kernel optimizations for AI workloads.
Top Skills:
CCi/CdGoHypervisorsInfrastructure As CodeKvmLinuxQemuRust
Reposted YesterdaySaved
Easy Apply
Easy Apply
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
The Lead Site Reliability Engineer will design, develop, and operate observability systems, ensuring service reliability in large distributed environments. Responsibilities include scaling observability systems, writing monitoring libraries, and collaborating with engineering teams.
Top Skills:
AnsibleBashElasticsearchGoKafkaPrometheusPythonRubyScalaTerraform
Reposted YesterdaySaved
Easy Apply
Easy Apply
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
The Senior Software Engineer will lead efforts in site reliability engineering, improving monitoring, incident response, and tooling to enhance system reliability and performance.
Top Skills:
CdkDatadogGoJavaScriptPrometheusPulumiPythonTerraformTypescript
Insurance • Sales • Software
As a Cloud & Site Reliability Engineer, ensure reliability and availability of software systems, participate in agile ceremonies, build infrastructure with IaC, and manage on-call duties.
Top Skills:
CloudDelivery PipelinesDevOpsInfrastructure As Code
Digital Media • Kids + Family • Mobile • Software • Sports
The Senior DevOps Engineer will enhance observability and performance monitoring systems, implement SRE best practices, develop custom dashboards, and collaborate across teams to ensure reliability and scalability of cloud infrastructure.
Top Skills:
DatadogGithub ActionsGitlab CiJenkinsPrometheusPythonTerraformTypescript
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The role involves developing and maintaining cloud services for reliability and scalability, optimizing architecture, and mentoring other developers while focusing on innovative software practices.
Top Skills:
AWSCassandraElasticsearchGoJavaKafkaKotlinNode.jsPythonScala
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
As a Staff Site Reliability Engineer, you will enhance the reliability of SailPoint's identity security services, coach engineers on best practices, and influence architectural designs for scalability.
Top Skills:
AWSGoGrafanaHoneycombJavaKibanaKubernetesPrometheusPythonTerraform
Reposted 3 Days AgoSaved
Easy Apply
Easy Apply
Cloud • Security • Software • Cybersecurity • Automation
The Senior Site Reliability Engineer is responsible for maintaining user-facing services, managing database operations, and optimizing cloud infrastructure at GitLab. Key responsibilities include designing and maintaining ClickHouse and PostgreSQL clusters, implementing monitoring systems, and ensuring security compliance. The role requires strong technical skills in database management and cloud automation, along with leadership and communication abilities.
Top Skills:
AnsibleChefClickhouseGoGrafanaHelmKubernetesLinuxPostgresPrometheusPythonRubyTerraform
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Software Engineer will enhance system reliability, manage projects for scalability, develop automation tools, and mentor engineering teams.
Top Skills:
AWSAzureDockerEc2GCPGoKubernetesRubyTerraform
Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services
As a Senior Software Engineer in the SRE team, you will design and develop solutions for reliability and performance, collaborating with engineering teams on internal tools and observability features.
Top Skills:
AWSDartDockerGitGitGoJavaKafkaKubernetesMySQLNginxOpentelemetryPostgresPythonReactSnowflakeTerraformTypescript
Top Los Angeles Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results