Get the job you really want.
Maximum of 25 job preferences reached.
Top Remote Site Reliability Engineer Jobs in Los Angeles, CA
Reposted YesterdaySaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The role involves maintaining and improving CI/CD infrastructure using Argo Workflows and Kubernetes, ensuring effective deployment for engineering teams.
Top Skills:
AWSAzureGoGCPKubernetesPython
Sales • Software • Automation
Join the Infrastructure Team to build and maintain critical systems, automating database lifecycles and enhancing disaster recovery with a focus on resilience and simplicity.
Top Skills:
AnsibleArgocdAWSClickhouseDockerElasticsearchFlaskGithub ActionsGrafanaKubernetesMongoDBPostgresPythonRedisTerraform
Reposted 3 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Fintech • Mobile • Payments • Financial Services
This role involves setting technical strategies, collaborating across teams, managing operations and availability, and fostering a culture of quality and ownership within the Site Reliability Engineering team.
Top Skills:
AWSKotlinKubernetesMySQLPythonSpark
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
The Site Reliability Engineer will enhance reliability and observability, automate processes, support engineering teams, and promote a culture of reliability at Coinbase.
Top Skills:
AWSAzureDockerEc2GCPGoKubernetesRubyTerraform
Reposted 5 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Engineer in the InfraSec team, you'll lead the design and deployment of security solutions for cloud platforms, automate monitoring, and manage security tooling while mentoring a small team of SREs.
Top Skills:
AnsibleAWSAzureCloudFormationGCPGoTerraform
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
The Sr. Engineer will manage CI/CD systems, lead project administration, enforce best practices, and improve service reliability while mentoring teams.
Top Skills:
Artifact Repository Services (ArtifactoryChefCi/Cd Tools (BazelGithub ActionsGithub)GitlabIac Provisioning Tools (AnsibleJenkins)NexusPuppetQuay.Io)Source Code Management (BitbucketTerraform)
Reposted 16 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
As a Staff Site Reliability Engineer, you will empower developers by optimizing MongoDB Atlas, ensuring seamless performance across multiple cloud platforms while fostering a supportive culture.
Top Skills:
AWSGCPAzureMongoDB
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Software
The AWS Cloud Architect will design, build, and optimize cloud infrastructure, ensuring scalability and security while mentoring junior SREs and defining cloud strategy.
Top Skills:
AnsibleAws Api GatewayAws CloudfrontAws CloudtrailAws CloudwatchAws DocumentdbAws Ec2Aws EksAws LambdaAws RdsAws S3Aws Secrets ManagerAws SsmDockerGrafanaHashicorp ConsulHashicorp TerraformHashicorp VaultKubernetesNew RelicPrometheus
Reposted 15 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
This role involves building and maintaining observability services, ensuring service reliability, and collaborating with other teams on best practices.
Top Skills:
AWSFluentbitGCPJaegerKubernetesAzureQuickwitSplunkVectorVictoriametrics
eCommerce • Legal Tech • Professional Services • Software • Data Privacy
The Site Reliability Engineer will ensure systems run smoothly, work with automation tools, resolve issues, and drive operational improvements.
Top Skills:
AWSAzureCloudFormationDockerGCPGrafanaKubernetesMemcachedNew RelicOpentelemetryPostgresPrometheusPulumiRedisSentryTerraform
Mobile • Software
Site Reliability Engineers will work on production infrastructure, focusing on AWS and Kubernetes while ensuring high availability and customer satisfaction.
Top Skills:
AirflowAWSCircleCICloudwatchEksGrafanaMongoDBPagerdutyPingdomRustScala SparkTerraformTypescript
Reposted 25 Days AgoSaved
Easy Apply
Easy Apply
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will support, maintain and grow the Atlas platform, focusing on automating processes and running multi-cloud environments.
Top Skills:
AWSAzureDnsGCPGoHTTPLinuxPythonRubyTls
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Software
As a Site Reliability Engineer, you'll ensure platform reliability through scalable systems, incident response, observability, and collaboration with engineering teams.
Top Skills:
AWSDatadogGrafanaKubernetesOpentelemetryPrometheusTypescript
Artificial Intelligence • Computer Vision • Hardware • Robotics • Metaverse
The Principal Staff SRE will lead initiatives in building and optimizing core infrastructure services on-prem and cloud, deploying and managing services at scale, and improving performance with automation and monitoring tools.
Top Skills:
DhcpDnsEbpfGoLdapLinuxNtpPythonTerraformXdp
Artificial Intelligence • Other • Sales • Software
As an Infrastructure Software Engineer, you'll design and improve core infrastructure for developer efficiency, manage Kubernetes, and automate operations.
Top Skills:
AWSAzureCi/CdCloudFormationGitopsGoGCPKubernetesPostgresPythonTerraform
Information Technology • Legal Tech
The role involves maintaining and improving Azure infrastructure, managing Infrastructure as Code with Terraform, enhancing security measures, and operating CI/CD pipelines.
Top Skills:
AzureAzure DevopsBashCircleCIDatadogEfkElkGithub ActionsPowershellPythonTerraform
Blockchain • Web3
As a Site Reliability Engineer, you'll enhance observability, logging, and tracing, collaborating with engineers to optimize performance and security of infrastructure.
Top Skills:
AnsibleAWSAws CdkGCPGitGoGrafanaKubernetesLgtmLokiMimirOpentelemetryPrometheusRustSentryTempoTerraformTypescriptWebassembly
Cloud • Security • Software • Analytics
The Site Reliability Engineer will develop and lead projects in areas like data platform architecture, capacity planning, disaster recovery, and cloud security, ensuring scalability and reliability of services.
Top Skills:
AnsibleBashGCPGkeGoKubernetesPulumiPython
Cloud • Information Technology • Analytics • Cybersecurity • Design
The Site Reliability Engineer will optimize GitLab, implement CI/CD practices, drive automation, and ensure compliance with federal security standards.
Top Skills:
AWSAzureBashConfluenceDockerGCPGitlabJIRAKubernetesLinuxPythonTerraform
Software
Improve monitoring, observability, and alerting; handle incidents; work with R&D; document actions; conduct on-call duties.
Top Skills:
AnsibleArgocdAWSAzureBashChefCoralogixDatadogDockerGitGitlabGCPHelmJavaScriptKubernetesNew RelicPrometheusPuppetPythonSplunk
Software • Cryptocurrency
Manage and scale Kubernetes clusters, automate infrastructure, optimize performance, maintain blockchain nodes, and improve system reliability while collaborating with product teams.
Top Skills:
Aws (Ec2Aws EksDatadogDockerIam)KubernetesOpentelemetryPulumiRdsS3Terraform
Software
Lead and manage engineering teams for ConductorOne's cloud infrastructure, ensuring reliability, security, and compliance while fostering team growth and culture.
Top Skills:
AICi/CdCloud InfrastructureIso 27001)KubernetesSecurity Compliance (Soc 2
Cloud • Security • Software • Cybersecurity
The Staff Site Reliability Engineer will enhance AI/ML infrastructure, manage CI/CD pipelines, ensure system reliability, and troubleshoot applications, focusing on cloud-based operations.
Top Skills:
AWSAzureBashDockerGitGitGCPGrafanaHuggingface TransformersKubernetesLlmPrometheusPythonPyTorchTensorrtTerraform
Artificial Intelligence • Software • Generative AI
As a Site Reliability Engineer, you'll design and maintain cloud infrastructure, automate provisioning, ensure system reliability, and mentor junior engineers while leveraging various technologies to optimize performance and security.
Top Skills:
AWSAzureDockerElk StackGCPGoGrafanaJavaKubernetesPrometheusPythonScalaTerraform
Cloud • Security • Software • Cybersecurity
As a Staff Site Reliability Engineer, you will lead SRE initiatives, mentor engineers, ensure system reliability, and drive strategic engineering practices globally.
Top Skills:
C#GoGrafanaJavaJavaScriptKubernetesOpentelemetryPrometheusPulumiTerraformTypescript
Popular Job Searches
All Filters
Total selected ()
No Results
No Results



.png)



























