Get the job you really want.
Maximum of 25 job preferences reached.
Top Remote Site Reliability Engineer Jobs in Los Angeles, CA
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The AWS Cloud Architect will design, build, and optimize cloud infrastructure, ensuring scalability and security while mentoring junior SREs and defining cloud strategy.
Top Skills:
AnsibleAws Api GatewayAws CloudfrontAws CloudtrailAws CloudwatchAws DocumentdbAws Ec2Aws EksAws LambdaAws RdsAws S3Aws Secrets ManagerAws SsmDockerGrafanaHashicorp ConsulHashicorp TerraformHashicorp VaultKubernetesNew RelicPrometheus
Sales • Software • Automation
Join the Infrastructure Team to build and maintain critical systems, automating database lifecycles and enhancing disaster recovery with a focus on resilience and simplicity.
Top Skills:
AnsibleArgocdAWSClickhouseDockerElasticsearchFlaskGithub ActionsGrafanaKubernetesMongoDBPostgresPythonRedisTerraform
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will ensure systems run smoothly, work with automation tools, resolve issues, and drive operational improvements.
Top Skills:
AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
Reposted 17 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
This role involves setting technical strategies, collaborating across teams, managing operations and availability, and fostering a culture of quality and ownership within the Site Reliability Engineering team.
Top Skills:
AWSKotlinKubernetesMySQLPythonSpark
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance reliability and observability, automate processes, support engineering teams, and promote a culture of reliability at Coinbase.
Top Skills:
AWSAzureDockerEc2GCPGoKubernetesRubyTerraform
Big Data • Cloud • Healthtech • Software • Big Data Analytics
The Senior Site Reliability Engineer will ensure the reliability and scalability of enterprise applications, lead incident management, develop automation tools, mentor team members, and collaborate with cross-functional teams.
Top Skills:
AnsibleAWSBashDockerGitGoHibernateJavaKubernetesLinuxMavenMySQLPythonRubyShellSolrSpringTomcatVagrant
Blockchain • Software
As a Site Reliability Engineer at Offchain Labs, you will manage infrastructure in cloud environments, design CI/CD workflows, and enhance system reliability with a focus on blockchain technology.
Top Skills:
ArgocdAWSAzureCodebuildGCPGithub ActionsGoGrafanaKubernetesLokiPrometheusPythonTerraform
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Principal Staff SRE will lead initiatives in building and optimizing core infrastructure services on-prem and cloud, deploying and managing services at scale, and improving performance with automation and monitoring tools.
Top Skills:
DhcpDnsEbpfGoLdapLinuxNtpPythonTerraformXdp
Software
The Lead Site Reliability Engineer will oversee the architecture and operational excellence of Mattermost's infrastructure, mentoring teams and driving strategic initiatives for performance and reliability in regulated sectors.
Top Skills:
AWSGrafanaKubernetesPrometheusTerraform
Information Technology
As a Site Reliability Engineer at New Era Technology, you'll focus on ensuring operational efficiency, creating reliable systems, and enhancing service performance through AWS expertise.
Top Skills:
AWS
Blockchain • Information Technology • Internet of Things
The Site Reliability Engineer will ensure system reliability, security, and performance by implementing infrastructure as code, CI/CD, and monitoring solutions.
Top Skills:
AWSAzureBashGCPGoKubernetesPythonRustTerraform
Aerospace • Manufacturing
As a Site Reliability Engineer, you'll build and manage observability platforms for satellite communications, define SLOs/SLIs, and collaborate on incident response and deployment automation.
Top Skills:
ArgocdAWSElkGCPGoGrafanaIstioJaegerKubernetesLinkerdLokiOpentelemetryPrometheusPythonTempoTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Aerospace • Manufacturing
The Staff Site Reliability Engineer will design and manage Aalyria's centralized observability platform, focus on metrics, logging, and tracing systems, implement SLOs and SLIs, automate deployments, and drive incident response strategies for enhanced reliability across satellite and cloud platforms.
Top Skills:
AWSElkGCPGitopsGoGrafanaJaegerJavaKubernetesLokiOpentelemetryPrometheusPythonTempoTerraform
Automotive
Design and implement scalable cloud infrastructure, monitor performance, automate processes, ensure security and compliance, and lead a DevOps team.
Top Skills:
AWSBashCi/CdDockerElk StackGCPGrafanaKubernetesPrometheusPythonTerraform
Reposted 3 Days AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Information Technology • Logistics • Machine Learning • Software
Lead reliability initiatives for the production platform, manage incident response, define SLIs/SLOs, and enhance security by embedding it into delivery pipelines. Drive platform improvements in AWS and CI/CD processes.
Top Skills:
AuroraAWSBazelCi/CdDagsterDbtDuckdbDynamoDBEcsJavaJavaScriptKubernetesPythonSpaceliftSqsSsmTerraformTrinoTypescript
Big Data • Healthtech • Information Technology • Analytics
As a Lead Site Reliability Engineer, you'll design and manage scalable cloud infrastructure on GCP, optimize CI/CD processes, and ensure system reliability through observability and incident response, while mentoring others in a cross-product SRE group.
Top Skills:
BashGitlab Ci/CdGkeGoogle Cloud PlatformJenkinsPythonSentrySumo LogicTerraform
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
The Site Reliability Engineer will build and maintain infrastructure, improve software systems, develop scalable microservices, and ensure quality software delivery.
Top Skills:
AWSGoGoogle Cloud PlatformJavaKubernetesAzureSQL
Cloud • Fintech • Information Technology • Software • Business Intelligence
As a Site Reliability Engineer, you will ensure production system reliability, optimize performance, respond to incidents, and collaborate on infrastructure improvements.
Top Skills:
AnsibleAWSBashDatadogDockerElkGitGrafanaKubernetesNew RelicOpentelemetryPrometheusPythonReactRubyRuby On RailsTerraform
Information Technology • Software • Web3
As a Software Engineer focused on SRE and DevSecOps, you will design scalable infrastructure, implement CI/CD pipelines, and automate processes while collaborating with teams to enhance performance and security.
Top Skills:
AnsibleBashDatadogDockerGCPGrafanaKubernetesPythonReactRustSolidityTerraformWeb3
Cloud • Security • Software
The Site Reliability Engineer will design, automate and scale cloud infrastructure while ensuring uptime, performance, and security best practices.
Top Skills:
AnsibleAWSAzureChefDockerGCPGoJavaScriptKubernetesLinuxPuppetPythonRubySaltstackTerraform
Hardware • Machine Learning • Security • Software
The Site Reliability Engineer will manage software deployment for IoT devices, improve observability, maintain dashboards, automate processes, and collaborate on incident responses.
Top Skills:
AnsibleAWSBashC/C++DatadogGrafanaGroovyJavaJavaScriptNoSQLPostgresPrometheusPythonRSigmaSQLTerraform
Artificial Intelligence • Cloud • Fintech • Machine Learning • Mobile • Software
The Staff Site Reliability Engineer will design, implement, and optimize infrastructure for AI services, ensure reliability and performance, and drive automation and observability excellence across engineering teams.
Top Skills:
AzureAzure DevopsDockerElk StackGithub ActionsGrafanaKubernetesMimirPostgresPrometheusSQL ServerTeamcityTerraform
Greentech • Software • Energy
This role involves managing cloud infrastructure, improving system reliability, automation, incident response, and mentoring engineers, requiring deep technical expertise and leadership skills.
Top Skills:
AWSBashDatadogDockerGCPJavaScriptKubernetesLinuxPythonTypescript
Security • Software • Cybersecurity
Seeking a Site Reliability Engineer to manage software development tools for DevOps, optimize workflows, and ensure system performance and reliability while integrating AI-driven solutions.
Top Skills:
ArtifactoryAWSAzureBashClickupConfluenceDockerFigmaFullstoryGCPGitGrafanaJIRAKubernetesPower BIPrometheusPythonSplunkTerraform
Reposted 9 Days AgoSaved
Easy Apply
Easy Apply
Software • Database
The Senior Site Reliability Engineer will manage AWS infrastructures, improve CI/CD pipelines, and assist teams with scaling solutions. Responsibilities include overseeing logging, monitoring, and high-quality software development with strong security and reliability considerations.
Top Skills:
AnsibleAWSChefCloudFormationDatadogDockerDynamoElasticsearchGithub ActionsMySQLOpensearchPostgresPuppetPythonRedisS3Terraform
Top Los Angeles, CA Companies Hiring Remote Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results




.png)




























