Abnormal Security Logo

Abnormal Security

Senior Software Engineer - Site Reliability

Posted 4 Days Ago
Remote
Hiring Remotely in USA
176K-207K Annually
Senior level
Remote
Hiring Remotely in USA
176K-207K Annually
Senior level
The Senior Software Engineer - Site Reliability will enhance operational excellence, improve service reliability, and mentor engineers while defining goals and roadmaps for the SRE team.
The summary above was generated by AI
About the Role

Abnormal Security is looking for a Senior Software Engineer - Site Reliability to join our Infrastructure team. In this role, you will be responsible for the reliability, scalability, and operational excellence of our systems and services. You will lead initiatives to improve the operational maturity of both SRE-managed services and critical product systems, driving change across the organization in support of stable operations.

As a senior member of the team, you will independently define and execute quarterly goals, create forward-looking roadmaps, and own cross-functional projects aligned with company-level objectives. You will serve as a key advocate for reliability, providing technical leadership, deep analysis, and mentorship while embedding with product teams as needed to improve service ownership and incident response practices.

The ideal candidate:

  • Has strong technical depth in distributed systems and operational excellence
  • Possesses a product-focused mindset with the ability to translate business needs into reliability goals
  • Is a strong communicator and mentor, able to influence both within the SRE team and across engineering
  • Has demonstrated experience leading broad technical initiatives across teams and systems
What You Will Do
  • Own the operational maturity of services in the SRE software stack, driving architectural and tooling improvements
  • Proactively partner with product teams to embed SRE best practices and support services with operational challenges
  • Independently define and drive quarterly goals for the SRE team with measurable impact on system reliability and developer productivity
  • Design and maintain systems that promote observability, automated recovery, scalability, and resilience
  • Lead incident reviews and root cause analyses; ensure follow-up actions are implemented and shared across teams
  • Collaborate with engineering leadership to shape the team roadmap and contribute to company-wide reliability goals
  • Mentor other engineers and drive adoption of SRE principles throughout the engineering organization
Must Have
  • 8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering roles
  • Deep knowledge of production-grade distributed systems and cloud-native architectures
  • Demonstrated experience managing service availability, latency, and incident response in production environments
  • Strong programming skills in Python, Go, or similar languages
  • Experience with Kubernetes, Terraform, and observability tools (e.g., Prometheus, Grafana, Datadog)
  • Proven ability to lead complex, multi-team initiatives and influence system design for reliability
Nice To Have
  • Prior experience embedding with product engineering teams to support operational goals
  • Familiarity with AWS and multi-cloud environments (e.g., Azure, GCP)
  • Experience in regulated environments or with FedRAMP-compliant systems
  • Contributions to open-source SRE tooling or community knowledge sharing

#LI-NT1

At Abnormal AI, certain roles are eligible for a bonus, restricted stock units (RSUs), and benefits. Individual compensation packages are based on factors unique to each candidate, including their skills, experience, qualifications and other job-related reasons. We know that benefits are also an important piece of your total compensation package. Learn more about our Compensation and Equity Philosophy on our Benefits & Perks page.

Base pay range:
$176,000$207,050 USD
San Francisco/New York Base pay range:
$195,000$230,000 USD

Abnormal AI is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status or other characteristics protected by law. For our EEO policy statement please click here. If you would like more information on your EEO rights under the law, please click here.

Top Skills

AWS
Azure
Datadog
GCP
Go
Grafana
Kubernetes
Prometheus
Python
Terraform

Similar Jobs

12 Days Ago
Remote
USA
176K-207K Annually
Senior level
176K-207K Annually
Senior level
Security • Cybersecurity
The Senior Software Engineer will ensure system reliability and scalability, lead initiatives for operational excellence, and mentor team members.
Top Skills: DatadogGoGrafanaKubernetesPrometheusPythonTerraform
16 Days Ago
Remote
USA
111K-178K Annually
Senior level
111K-178K Annually
Senior level
Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services
As a Senior Software Engineer in the SRE team, you will design and develop solutions for reliability and performance, collaborating with engineering teams on internal tools and observability features.
Top Skills: AWSDartDockerGitGitGoJavaKafkaKubernetesMySQLNginxOpentelemetryPostgresPythonReactSnowflakeTerraformTypescript
18 Days Ago
Remote
United States
Senior level
Senior level
Information Technology
Enhance the reliability and performance of Spreedly's payments platform by implementing monitoring, incident management, and optimizing application performance.
Top Skills: AWSCockroachdbDatadogElixirOpentelemetryPostgresRuby on RailsRuby

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account