Get the job you really want.
Maximum of 25 job preferences reached.
Top Reliability Engineer Jobs in Los Angeles, CA
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
The role involves developing and managing scalable cloud infrastructure, automating tasks, and leading technical projects in a 24/7 on-call environment.
Top Skills:
AnsibleApache AirflowArgoAWSDebianDockerIaasLuigiPythonRubyScalaTerraformUbuntu
Reposted 17 Hours AgoSaved
Easy Apply
Easy Apply
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
Lead the design and operation of large-scale observability systems for Meraki's cloud services, ensuring performance and availability.
Top Skills:
AnsibleBashConsulCortexElasticsearchElkGoGraphiteKafkaKibanaLinuxLogstashPrometheusPythonRubyScalaTerraformThanos
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The Site Reliability Engineer will design, build, and optimize the AWS cloud platform and infrastructure, ensuring reliability, security, and cost-efficiency while implementing automation and operational excellence practices.
Top Skills:
Ansible (Awx)Argo CdAWSAws CdkAws CloudtrailAws CloudwatchBashCloudFormationConsulDockerEc2EksGitlabGrafanaHelmIamLambdaLokiMimirNew RelicPrometheusPythonS3Secrets ManagerSsmTempoTerraformVaultVpc
Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services
Design and code solutions for reliability and performance as a Senior Software Engineer on the Site Reliability Engineering team, focusing on internal tooling and collaboration with product teams.
Top Skills:
AWSDockerGitGitGoJavaKafkaKubernetesMySQLNginxOpentelemetryPostgresPythonReactSnowflakeTerraformTypescript
Reposted YesterdaySaved
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Lead system reliability, scalability, and automation initiatives. Build high-quality software while mentoring teams and enhancing cloud services performance.
Top Skills:
AWSAzureDockerEc2GCPGoKubernetesRubyTerraform
Fintech • HR Tech
Design and implement reliability tools and systems, drive DevOps practices, mentor peers, and enhance observability for operational excellence.
Top Skills:
AWSDatadogKubernetesLinuxPythonRubyTerraformTypescript
Fintech • Social Impact • Financial Services
As an SRE I Engineer, you will automate AWS deployments, enhance observability, collaborate with engineers, and migrate legacy technologies.
Top Skills:
ArgocdAWSBashC#C/C++Ci/CdCircleCIDatadogEcsEksElkGithub ActionsGoJavaJenkinsKubernetesLambdaPrometheusPythonRubyTerraformYaml
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Design and develop software applications for observability systems. Collaborate with teams, review code, and enhance system reliability and performance.
Top Skills:
AWSAzureGCPGoGrafanaKubernetesLinuxPrometheusPythonTsdbs
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will improve system reliability, automate processes, collaborate on cloud infrastructure, and mentor engineers to enhance the engineering culture at Coinbase.
Top Skills:
AWSAzureDatadogDockerEc2GCPGoKibanaKubernetesRubyTerraform
Artificial Intelligence • Enterprise Web • Machine Learning • Natural Language Processing • Software • Conversational AI • Automation
As a Site Reliability Engineer, you will build systems and abstractions, maintain cloud security, optimize CI/CD processes, and manage on-call practices while collaborating across teams and driving best practices.
Top Skills:
Aws CloudElasticsearchGoJavaScriptMongoDBNode.jsReactRedis
Reposted 8 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The role involves maintaining and improving CI/CD infrastructure using Argo Workflows and Kubernetes, ensuring effective deployment for engineering teams.
Top Skills:
AWSAzureGoGCPKubernetesPython
Big Data • Software
The Performance & Reliability Engineer will enhance Aerospike's graph database performance, troubleshoot issues, and optimize features for scalability and reliability.
Top Skills:
AnsibleC/C++CentosCoreosEbpfGoGrafanaJavaKubernetesPrometheusRedhatTerraformUnix/Linux
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Consumer Web • Digital Media • Information Technology • News + Entertainment • Social Media
The Senior Site Reliability Engineer will optimize and enhance infrastructure performance, analyze system stability, and develop automation tools while supporting the company’s growth in digital innovation.
Top Skills:
AnsibleBashCC++DjangoDockerFlaskGoJavaKubernetesLaravelLinuxMvcOrmPythonRustTerraform
Automotive • Fintech • Hardware • Payments • Travel • Financial Services
As a Senior Site Reliability Engineer, you will enhance cloud platform security and efficiency, automate deployments, troubleshoot issues, and collaborate with teams.
Top Skills:
AnsibleAWSBashJavaPowershellPythonSql Server 2019TerraformWindows Server 2019
Insurance
As a Senior Site Reliability Engineer, you'll build and maintain Openly's infrastructure, automate tasks, implement security practices, and ensure system reliability.
Top Skills:
ArcgisCi/CdCircleCIDatadogGoGCPJupyter NotebooksKubernetesNuxtPostgresPythonRTailwindTerraformVuejsWebpack
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
As a Senior Software Engineer, you will enhance system reliability and scalability while ensuring secure service configurations and improving deployment capabilities across the company.
Top Skills:
AWSAzureDebuggingGCPGoObservabilityPerformance TuningRubyService Oriented ArchitectureTerraform
Big Data • Cloud • Software • Database
This role involves supporting, maintaining, and growing the MongoDB Atlas platform, collaborating with teams, and resolving operational issues in a 24/7 on-call environment.
Top Skills:
AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
Reposted 17 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
Manage technical strategy and engineering operations ensuring application reliability. Collaborate cross-functionally and develop talent within the team while advocating for quality and ownership.
Top Skills:
AWSKotlinKubernetesMySQLPythonSpark
23 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Site Reliability Engineer in the DevInfra team, you will enhance infrastructure tools and workflows, ensuring safe and efficient delivery of infrastructure as code while collaborating with various teams.
Top Skills:
AWSAzureBazelCrossplaneGCPGithub ActionsKubernetesTerraform
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Senior Site Reliability Engineer will improve system reliability, optimize cloud deployments, automate incident responses, and mentor engineering teams at Coinbase.
Top Skills:
AWSAzureDockerEc2GCPGoKubernetesRubyTerraform
Reposted 24 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
The role involves enhancing site reliability through tooling, coding, and automation while instilling best practices across teams to improve system health and incident response.
Top Skills:
Ci/CdDatadogGoInfrastructure As CodeJavaScriptKubernetesPrometheusPythonTerraformTypescript
Information Technology
As a Database Reliability Engineer, you will maintain and improve PostgreSQL infrastructure, resolve production incidents, collaborate with developers, and implement infrastructure as code.
Top Skills:
Ci/CdGithub ActionsGrafanaMySQLPostgresPrometheusSaltSQLTerraform
Blockchain • Software
As a Site Reliability Engineer at Offchain Labs, you will manage infrastructure in cloud environments, design CI/CD workflows, and enhance system reliability with a focus on blockchain technology.
Top Skills:
ArgocdAWSAzureCodebuildGCPGithub ActionsGoGrafanaKubernetesLokiPrometheusPythonTerraform
Cloud • Greentech • Other • Energy
As a Senior Site Reliability Engineer, you'll optimize virtualization and kernel-level performance for AI workloads, develop automation tools, and support compute infrastructure, ensuring scalability and reliability.
Top Skills:
CCi/CdGoInfrastructure As CodeKvmLinuxQemuRust
Other • Social Impact
Design, develop, and maintain machine learning infrastructure while enhancing reliability and scalability, mentoring team members and collaborating across teams.
Top Skills:
AnsibleArgo CdDockerElk StackGpu AccelerationGrafanaHelmKubernetesPrometheusPythonPyTorchScikit-LearnTensorFlowTerraform
Top Los Angeles Companies Hiring Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results