StarCompliance Logo

StarCompliance

Site Reliability Engineer US

Posted Yesterday
Remote
Hiring Remotely in US
Senior level
Remote
Hiring Remotely in US
Senior level
The Site Reliability Engineer will enhance system reliability, implement observability tools, and collaborate with teams to improve SaaS applications.
The summary above was generated by AI
About StarCompliance
StarCompliance is on a mission to make compliance simple and easy. Trusted globally by enterprise financial institutions, the user-friendly STAR platform empowers organizations to achieve regulatory compliance while safeguarding their integrity and business reputations. Through a customizable, 360-degree view of employee activity, the STAR software enables firms to automate the detection and resolution of potential areas of conflict while streamlining daily workflows and increasing efficiency. 

Location: Candidates MUST be US East Coast  

We are seeking a highly skilled and pragmatic Site Reliability Engineer (SRE) to help lead our evolution from legacy single-tenant monoliths to modern, scalable, multi-tenant microservices. This is a pivotal role for our business, enabling faster delivery, improved reliability, and real scalability across our SaaS portfolio. 
While we’ve got a solid handle on infrastructure monitoring, we’re still in the early innings when it comes to application-level observability, autoscaling, and progressive delivery strategies (e.g., canary releases, blue/green deployments). That’s where you come in.

You’ll work closely with Infrastructure, Architecture, Engineering, and Support teams to design, build, and evangelize the next generation of SRE practices and tools that ensure uptime, resiliency, and customer trust. 

Responsibilities

  • Champion Reliability by Design: Collaborate with architects and engineers to build resilient, fault-tolerant systems across our evolving cloud-native stack. 
  • Observability Overhaul: Lead the charge on full-stack observability, leveraging modern APM tooling, meaningful SLOs/SLIs, and actionable alerts. 
  • Scaling Systems: Develop and implement auto-scaling strategies, load testing plans, and capacity forecasting for multi-tenant environments. 
  • Progressive Delivery: Help implement and automate deployment strategies such as canary releases, feature flags, and blue/green rollouts. 
  • Incident Response: Create and refine on-call processes, incident response playbooks, and blameless post-mortem routines. 
  • Monitoring & Tooling: Own and evolve our monitoring infrastructure, integrating metrics, logs, and traces into a cohesive ecosystem. 
  • Developer Empowerment: Build reusable templates, dashboards, and platform tooling to empower dev teams to “shift left” on reliability. 
  • Cross-functional Collaboration: Work hand-in-hand with Infrastructure, Architecture, Support, and Engineering teams to drive shared accountability for uptime and performance. 

Skills

  • 5+ years in SRE, DevOps, or Production Engineering roles, ideally within a SaaS or cloud-native environment. 
  • Deep experience with cloud platforms (preferably Azure or AWS), and Infrastructure-as-Code tools (e.g. Terraform). 
  • Hands-on experience with Azure DevOps is strongly preferred, as our CI/CD and project workflows are fully built around it. 
  • Proficiency with observability tools such as New Relic, Datadog, Prometheus, or similar. 
  • Strong understanding of software deployment strategies, CI/CD pipelines, and release engineering. 
  • Ability to code in at least one modern scripting or systems language (e.g., Python,PowerShell, Go, Bash). 
  • Experience operating multi-tenant environments with an emphasis on security, performance, and cost optimization. 
  • Excellent communicator who thrives in cross-functional settings and can influence engineering culture around reliability. 

Desirable Skills

  • Experience in regulated industries (e.g., financial services, healthcare). 
  • Background with service mesh architectures, distributed tracing, and gRPC/GraphQL. 
  • Familiarity with incident management platforms (e.g., PagerDuty, OpsGenie). 
  • Contributions to open-source SRE tooling or frameworks. 

StarCompliance Background Checks

All positions require pre-employment screening due to employees potentially having access to highly sensitive and confidential information involving finance and compliance; candidates must be trustworthy and have a heightened sensitivity to protecting confidential financial, professional information.  To be eligible for employment with StarCompliance, candidates must undergo a rigorous background investigation with checks including, but not limited to, criminal record history, consumer credit, employment history, qualifications, and education checks.  


Equal Opportunity Employer Statement

We prohibit discrimination and harassment of any kind based on race, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, gender identity or expression, marital/civil union/domestic partnership status, veteran status or any other protected characteristic as outlined by country, state, or local laws.

This policy applies to all employment practices within our organisation, including hiring, recruiting, promotion, termination, layoff, recall, leave of absence, compensation, benefits, training, and apprenticeship. StarCompliance makes hiring decisions based solely on qualifications, merit, and business needs at the time. For more information, please request a copy of our Equal Opportunities Policy.

Top Skills

AWS
Azure
Azure Devops
Bash
Datadog
Go
New Relic
Powershell
Prometheus
Python
Terraform

Similar Jobs

2 Days Ago
In-Office or Remote
4 Locations
124K-206K Annually
Senior level
124K-206K Annually
Senior level
Cloud • Information Technology
The Site Reliability Engineer will support IaaS services, monitor infrastructure health, perform root cause analysis, automate processes, and collaborate with teams for service reliability.
Top Skills: AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
3 Days Ago
Easy Apply
In-Office or Remote
3 Locations
Easy Apply
124K-206K Annually
Senior level
124K-206K Annually
Senior level
Analytics
The Site Reliability Engineer will ensure the reliability and performance of IaaS services, perform incident resolution, and enhance system reliability through automation while supporting mobility across hybrid infrastructures and collaborating extensively with various teams.
Top Skills: AnsibleAWSAzureBashGitlab CiJenkinsKubernetesLinuxOpenshiftPythonTerraformVmware Vsphere
8 Days Ago
In-Office or Remote
Boston, MA, USA
119K-165K Annually
Senior level
119K-165K Annually
Senior level
Security • Software
The Senior Site Reliability Engineer will support AWS infrastructure, manage SaaS reliability, implement automation, respond to incidents, and collaborate with teams to enhance performance.
Top Skills: AnsibleAWSCloudFormationCloudwatchDatadogGrafanaHelmKubernetesOpensearchPagerdutyPythonSaltTerraform

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account