Get the job you really want.
Maximum of 25 job preferences reached.
Top Remote Site Reliability Engineer Jobs in Los Angeles, CA
Artificial Intelligence • Software • Generative AI
As a Site Reliability Engineer, you'll design and maintain cloud infrastructure, automate provisioning, ensure system reliability, and mentor junior engineers while leveraging various technologies to optimize performance and security.
Top Skills:
AWSAzureDockerElk StackGCPGoGrafanaJavaKubernetesPrometheusPythonScalaTerraform
Cloud • Security • Software • Cybersecurity
As a Staff Site Reliability Engineer, you will lead SRE initiatives, mentor engineers, ensure system reliability, and drive strategic engineering practices globally.
Top Skills:
C#GoGrafanaJavaJavaScriptKubernetesOpentelemetryPrometheusPulumiTerraformTypescript
Cloud • Security • Software • Cybersecurity
The Principal Site Reliability Engineer will lead Veeam's global SRE efforts, focusing on architecture, reliability strategies, and mentorship while influencing cross-functional teams.
Top Skills:
Automation ToolingCloud InfrastructureCloud-Native DevelopmentDistributed Systems
Cloud • Software
As a Site Reliability Engineer, you'll manage technical escalations, ensure system reliability, collaborate with engineering teams, and participate in on-call rotations.
Top Skills:
AnsibleAzureBashC#ChefElkGitGithub ActionsGitlabGrafanaJenkinsLinux/UnixPrometheusPulumiPythonSplunkSvnTcp/IpTerraform
Cloud • Software
The Site Reliability Engineer at Redis will handle technical escalations, ensure system reliability, collaborate with engineering teams during incidents, and participate in on-call rotations.
Top Skills:
AnsibleAzureBashC#ChefElkGitGithub ActionsGitlabGrafanaJenkinsLinuxPrometheusPulumiPythonSplunkSvnTcp/IpTerraform
Cloud • Software
As a Site Reliability Engineer, you will ensure system reliability, handle technical escalations, and collaborate with engineering teams while providing on-call support.
Top Skills:
AnsibleAzureBashC#ChefElkGitGithub ActionsGitlabGrafanaJenkinsLinuxPrometheusPulumiPythonRedisSplunkSvnTcp/IpTerraform
Artificial Intelligence • Blockchain • Internet of Things • Machine Learning • Software • App development • Automation
Join the Gigster Talent Network as an SRE Support Engineer, providing support for scalable applications and cloud services, including troubleshooting and improving internal tools.
Top Skills:
AnsibleAWSBashDatadogDockerGCPGrafanaKafkaKubernetesPrometheusPuppetPythonSparkSplunkTerraform
Sports
Manage and improve the AWS infrastructure, deploy into new regions, monitor releases, and implement new technologies in a fast-paced environment.
Top Skills:
AWSDockerGrafanaKubernetesPrometheusPython
Cloud • Information Technology • Marketing Tech • Software
Lead the scaling and reliability of AWS infrastructure, implement automation, optimize performance, manage incidents, and mentor engineers in a startup environment.
Top Skills:
AWSCloudFormationCloudfrontDatadogDockerEc2EksElkGithub ActionsGoGrafanaIamJenkinsKubernetesLokiPrometheusPythonRdsS3TerraformVpc
Blockchain • Software
As a Senior Engineer, SRE/DevOps, you will enhance blockchain infrastructure reliability, automate deployment, and collaborate on CI/CD practices while ensuring security and performance optimization.
Top Skills:
AnsibleAWSBashCloudtrailCloudwatchCosmosDockerElk-StackEthereumGCPK8SKubernetesOpsgeniePingdomPythonTerraform
Artificial Intelligence • Software • Generative AI
This role involves designing and maintaining cloud infrastructure, automating provisioning, and enhancing system reliability through monitoring, collaboration, and mentorship.
Top Skills:
AWSAzureDockerElk StackGCPGoGrafanaJavaKubernetesPrometheusPythonTerraform
Productivity • Software • Conversational AI
Design, build, and maintain scalable data infrastructure and frameworks. Collaborate with teams to implement data solutions while ensuring performance and reliability.
Top Skills:
Apache KafkaAWSDatadogGoGrafanaJavaKubernetesOpentelemetryPrometheusPythonTerraform
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Artificial Intelligence • Fintech • Software • Financial Services
Seeking a seasoned SRE to lead reliability for a cloud-native platform, overseeing infrastructure, CI/CD pipelines, observability, and mentoring engineers.
Top Skills:
AWSClickhouseGoJavaKafkaKubernetesPulumiTerraform
Software
The Sr Devops & SRE Engineer will manage Terraform IaC, optimize cloud reliability, enhance observability, and improve developer experience with a focus on AWS resources.
Top Skills:
AWSDatadogDockerGithub ActionsPostgresRuby on RailsReactRedisSentrySidekiqTerraform
Reposted 20 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills:
AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
Big Data • Cloud • Software • Database
As a Senior Site Reliability Engineer, you will design and maintain MongoDB Atlas, ensuring reliability and automation for customer applications.
Top Skills:
AWSAzureGCPGoLinuxPythonRuby
Blockchain • Software
As a Site Reliability Engineer at Offchain Labs, you will manage infrastructure in cloud environments, design CI/CD workflows, and enhance system reliability with a focus on blockchain technology.
Top Skills:
ArgocdAWSAzureCodebuildGCPGithub ActionsGoGrafanaKubernetesLokiPrometheusPythonTerraform
Artificial Intelligence • Machine Learning • Natural Language Processing • Software
The Public Sector Site Reliability Engineer will manage cloud infrastructure, ensure compliance with regulations, implement observability tools, lead incident response, and enhance automation in federal environments.
Top Skills:
AnsibleAws GovcloudAzure GovernmentBashDatadogDockerElasticGoGrafanaKubernetesPrometheusPulumiPythonTerraform
Healthtech • Software
Lead Site Reliability Engineer responsible for designing resilient systems, managing performance monitoring, defining SLIs/SLOs, and leading incident responses. Collaborate with teams for reliability improvements in cloud-based applications.
Top Skills:
AWSCloudwatchEc2Microsoft IisS3SQL ServerWindows Server
Fintech • Financial Services
The Site Reliability Engineer will manage AWS infrastructure, optimize performance, maintain Kubernetes clusters, and improve CI/CD pipelines for Seedify's platform.
Top Skills:
AnsibleAWSBashDockerGithub ActionsGrafanaHelmKubernetesKustomizeNew RelicOpentelemetryPrometheusTerraformTerragrunt
News + Entertainment
As an Ads Reliability Engineer, you'll enhance the reliability of the Netflix Ad Suite by designing scalable systems, automating processes, collaborating with teams for observability, and responding to incidents while promoting a culture of reliability.
Top Skills:
AWSAzureGCPGoJavaKubernetesPythonTerraform
Reposted 24 Days AgoSaved
Easy Apply
Easy Apply
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
The role involves developing and managing scalable cloud infrastructure, automating tasks, and leading technical projects in a 24/7 on-call environment.
Top Skills:
AnsibleApache AirflowArgoAWSDebianDockerIaasLuigiPythonRubyScalaTerraformUbuntu
Information Technology • Software
The Site Reliability Engineer will ensure system reliability, handle technical escalations, create automation tools, and collaborate with engineering teams while participating in on-call rotations.
Top Skills:
AnsibleAzureBashC#ChefElkGitGithub ActionsGitlabGrafanaJenkinsLinuxPrometheusPulumiPythonSplunkSvnTerraform
Information Technology • Software
As a Site Reliability Engineer at Redis, you will manage technical escalations, ensure system reliability, collaborate with engineering teams, and participate in on-call rotations to support production systems.
Top Skills:
AnsibleAzureBashC#ChefElkGitGithub ActionsGitlabGrafanaJenkinsLinuxPrometheusPulumiPythonSplunkSvnTcp/IpTerraform
Legal Tech • Software
As a Site Reliability Engineer at Litera, you'll ensure the stability and efficiency of SaaS products through automation, incident response, and system monitoring, while collaborating across teams to improve operations.
Top Skills:
AnsibleApacheCloud PlatformsDatadogDynatraceIisLinuxNew RelicPuppetSaaSSQLTerraformWindows
Top Los Angeles, CA Companies Hiring Remote Site Reliability Engineers
See AllPopular Job Searches
All Filters
Total selected ()
No Results
No Results