SentinelOne

Director, Site Reliability Engineering

Posted 12 Days Ago

Be an Early Applicant

Remote

Hiring Remotely in United States

198K-273K Annually

Senior level

Remote

Hiring Remotely in United States

198K-273K Annually

Senior level

Lead the Site Reliability Engineering team, ensure product reliability, manage incidents, and collaborate with various engineering teams to improve system performance.

The summary above was generated by AI

About Us

At SentinelOne, we’re redefining cybersecurity by pushing the limits of what’s possible—leveraging AI-powered, data-driven innovation to stay ahead of tomorrow’s threats.

From building industry-leading products to cultivating an exceptional company culture, our core values guide everything we do. We’re looking for passionate individuals who thrive in collaborative environments and are eager to drive impact. If you’re excited about solving complex challenges in bold, innovative ways, we’d love to connect with you.

What are we looking for?

We are seeking an experienced engineering and operational director to lead our Site Reliability Engineering (SRE) team at SentinelOne. As the Director of SRE, you will manage a team of SRE professionals responsible for ensuring the reliability and scalability of our products and production services, focusing on the experience our customers have in production every day. You will work closely with other engineering teams to identify and address availability, performance, and capacity issues, and you’ll be a key partner for our externally facing teams including Support, Customer Success, and Sales Engineering. This is a highly visible role within S1 with frequent executive communication opportunities, and is a great opportunity to do good work with good people all around the world.

As a team we value

Thinking from first principles, understanding second order impacts
Curiosity to understand new systems, their operating principles and limitations
Strong operational ownership and a desire to reduce toil via automation
A drive to learn, especially from prior failures
Courage to take risks and make things happen
Empathy and humility to collaborate effectively with peers and across teams

What will you do?

Grow and lead a team of SRE professionals, including setting performance goals and measuring deliverables against key metrics, while evolving those metrics as S1 grows and needs develop
Invest in data-driven deep triage on recurring issues, collaborating with other engineering teams to identify and address issues related to reliability, performance, and capacity
Develop, improve, and implement processes for the full incident lifecycle including incident management, post-incident analysis, and learning from incidents Lead incident response efforts, including coordinating with other teams to investigate and resolve customer-impacting incidents
Design support model for SRE regarding service maturity and service ownership, including monitoring and alerting improvements and SLI / SLO design and implementation
Analyze production metrics and signals to identify areas for improvement and take proactive steps to mitigate issues
Develop and implement best practices and standards for Site Reliability Engineering, from day to day operations to hiring and planning
Communicate effectively with cross-functional teams to ensure alignment on objectives and priorities. Deliver outcomes, not just stories and tasks.

What experience or knowledge should you bring?

10+ years of engineering experience, with at least 5 years in a leadership role

Demonstrated experience leading technical and operational teams at various stages of maturity
Excellent analytical and problem-solving skills
Familiarity with modern software development methodologies, tools, and techniques including CI/CD
Experience working with cloud-native applications and large scale distributed systems including a working knowledge of technologies such as Kubernetes and Terraform/IaC and cloud providers such as AWS or GCP
Experience with various monitoring and alerting techniques and tools, including frameworks and concepts such as SLOs, OTel and Golden Signals as well as tooling such as Prometheus and Grafana
Extensive experience with incident response and management at various layers of the stack across different business needs and applications, including both hands on experience leading incidents/post-incident analysis and experience driving broader incident management initiatives
Ability to thrive in a fast-paced, dynamic environment
Driven by curiosity and humility - complex distributed systems are complex, so ask the “silly” question and seek out answers

Why us?

You will be joining a cutting-edge company where you will tackle extraordinary challenges and work with the very best in the industry.

Medical, Vision, Dental, 401(k), Commuter, Health and Dependent FSA
Unlimited PTO
Industry-leading gender-neutral parental leave
Paid Company Holidays
Paid Sick Time
Employee stock purchase program
Disability and life insurance
Employee assistance program
Gym membership reimbursement
Cell phone reimbursement
Numerous company-sponsored events, including regular happy hours and team-building events

This U.S. role has a base pay range that will vary based on the location of the candidate. For some locations, a different pay range may apply. If so, this range will be provided to you during the recruiting process. You can also reach out to the recruiter with any questions.

Base Salary Range

$198,000—$272,800 USD

SentinelOne is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

SentinelOne participates in the E-Verify Program for all U.S. based roles.

Top Skills

AWS

Ci/Cd

GCP

Grafana

Kubernetes

Prometheus

Terraform

Similar Jobs

GHX

Senior Security Engineer

48 Minutes Ago

Easy Apply

Remote or Hybrid

United States

Easy Apply

109K-146K

Mid level

109K-146K

Mid level

Cloud • Healthtech • Payments • Professional Services • Software • Analytics • Automation

The Senior Security Engineer will enhance security operations, manage DLP strategies, lead security projects, and collaborate on incident responses.

Top Skills: Application SecurityCloud SecurityData Loss PreventionData SecurityEmail SecurityIntrusion Detection SystemsNetwork Security

King's Hawaiian

Business Development Manager

49 Minutes Ago

Remote or Hybrid

Fort Lauderdale, FL, USA

Junior

Food • Sales • Manufacturing

The Business Development Manager will facilitate business growth for the Irresistible Foods Group, partnering with sales leadership to develop strategies, cultivate relationships, and leverage data insights to support new business integration and increase revenue.

Top Skills: Data ExplorationEtl ProcessesExcelIri,Synidcated DataMicrosoft 365Power BITableau

Comcast Advertising

Quality Assurance Engineer

2 Hours Ago

Remote or Hybrid

Pennsylvania, USA

63K-147K Annually

Senior level

63K-147K Annually

Senior level

AdTech • Digital Media • Marketing Tech

The QA Engineer will validate software solutions, develop test strategies, collaborate with engineers on defect resolution, and mentor junior team members to ensure quality assurance standards are met.

Top Skills: .NetAngularAWSAzureC#DockerKubernetesPythonSap AbapSQL

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
Key Industries: Artificial intelligence, adtech, media, software, game development
Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering