Zscaler Logo

Zscaler

Senior Director, Site Reliability Engineering

Posted 51 Minutes Ago
Be an Early Applicant
Easy Apply
Hybrid
San Jose, CA
231K-330K Annually
Senior level
Easy Apply
Hybrid
San Jose, CA
231K-330K Annually
Senior level
The Senior Director of Site Reliability Engineering defines technical strategies, leads high-performing teams, and improves operational reliability for Zscaler's global platform, focusing on automation and observability standards.
The summary above was generated by AI

About Zscaler

Zscaler accelerates digital transformation to ensure our customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise, we are constantly pushing the envelope, leveraging the world’s largest security data lake to power our cloud-native Zero Trust Exchange platform. This innovation protects our customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location.

Here, impact in your role matters more than title and trust is built on results. We say, impact over activity. We seek innovators who actively use AI to amplify their impact and who thrive in an environment where we leverage intelligent systems to stay ahead of evolving threats. We believe in transparency and value constructive, honest debate—we’re focused on getting to the best ideas, faster. We build high-performing teams that can make an impact quickly and with high quality. To do this, we are building a culture of execution centered on customer obsession, collaboration, ownership, and accountability.

We value high-impact, high-accountability with a sense of urgency where you’re enabled to do your best work and embrace your potential. If you’re driven by purpose, thrive on solving complex challenges, and want to be part of the team that’s helping to secure the AI age, we invite you to bring your talents to Zscaler and help shape the future of cybersecurity.

Role

We are looking for a Senior Director, Production Engineering to join our team. This role is available as a hybrid opportunity 3 days a week in San Jose, CA or as a remote position, reporting to VP, Engineering in the Cloud Infrastructure & Operations department. Join Zscaler to set the strategic direction and lead the organization responsible for the reliability and operational excellence of our global platform protecting over 15 million users.

In this role, you will define the long-term technical vision and operational strategy, leading high-priority investments to drive an "automation-first" culture and architect reliability into our next generation of products. You will mature observability standards and define company-wide SLIs/SLOs, acting as the executive owner for programs aligned to achieve availability goals and ensuring the scalability and resilience of our globally distributed, multi-cloud infrastructure.

What you’ll do (Role Expectations)

  • Define the multi-year technical strategy and roadmap for Production Engineering, focusing on platform architecture, automation, and operational standards across AWS, Azure, GCP, and bare-metal environments.
  • Lead, mentor, and grow a high-performing SRE organization, including hiring and developing leaders (e.g., Directors and Principal SREs), and championing a culture of execution, accountability, and continuous improvement.
  • Drive the "AI-first" (and “automation-first”) mandate at an organizational level, sponsoring large-scale engineering initiatives to eliminate systemic toil and build advanced self-healing capabilities.
  • Establish and enforce enterprise-wide standards for observability (Prometheus, Grafana, OpenTelemetry), SLIs/SLOs, and error budget discipline across all engineering teams.
  • Serve as the executive owner for Service Health Reviews, partnering closely with Product, Security, and Engineering leadership to align priorities, manage critical dependencies, and drive company-wide maturity in post-incident analysis, systematic problem management, and the reliability of mission-critical customer-facing systems.

Who You Are (Success Profile)

  • You are a visionary leader who sets a compelling long-term technical and operational strategy.
  • You act like an owner, operating with integrity, a strong bias for action, and the ability to seamlessly navigate between executive strategy and tactical execution.
  • You are customer-obsessed, anchoring major decisions in solving real-world problems and championshiping availability as a core customer experience metric.
  • You champion simplicity by distilling highly complex, distributed systems problems into clear, actionable plans and organizational goals.
  • You lead with urgency and constructive energy, inspiring a large team to deliver high-impact results with relentless focus on both speed and quality.

What We’re Looking for (Minimum Qualifications)

  • 18+ years of relevant experience, including a minimum of 10+ years leading large-scale engineering or SRE organizations delivering mission-critical, production-grade systems.
  • Deep technical mastery and strategic understanding of distributed architecture, high-scale networking protocols, Linux systems, and multi-cloud environments (AWS, Azure, GCP).
  • Proven experience setting and executing an operational strategy across multiple departments, with a track record of significantly improving platform availability, performance, and MTTM.
  • Exceptional executive-level cross-functional leadership and communication skills, with demonstrated ability to influence product roadmaps and engineering culture across a global organization.
  • Strong production ownership experience: defining and meeting SLOs/SLIs, driving continuous reliability improvements, and managing high-stakes incident response programs.

What Will Make You Stand Out (Preferred Qualifications)

  • Cloud Migration & Infrastructure-as-Code: Proven track record of successfully migrating large, complex systems to cloud-native architectures while leveraging IaC (Ansible, Terraform) at an enterprise scale to manage global infrastructure
  • Global Networking & L7 Proxy Architectures: Expertise in global routing (BGP, OSPF) and high-performance L7 proxy architectures (HAProxy, Envoy) within high-volume, multi-tenant environments
  • Resilience & Disaster Recovery: Deep experience with large-scale chaos engineering, resilience testing, and comprehensive disaster recovery planning

#LI-CM3

#LI-Hybrid

Zscaler’s salary ranges are benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors, including job-related skills, experience, and relevant education or training.

The base salary range listed for this full-time position excludes commission/ bonus/ equity (if applicable) + benefits.

Base Pay Range
$231,000$330,000 USD

At Zscaler, we are committed to building a team that reflects the communities we serve and the customers we work with. We foster an inclusive environment that values all backgrounds and perspectives, emphasizing collaboration and belonging. Join us in our mission to make doing business seamless and secure.

Our Benefits program is one of the most important ways we support our employees. Zscaler proudly offers comprehensive and inclusive benefits to meet the diverse needs of our employees and their families throughout their life stages, including:

  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks, and more!

Learn more about Zscaler's hybrid working model and benefits here.

By applying for this role, you adhere to applicable laws, regulations, and Zscaler policies, including those related to security and privacy standards and guidelines.

Zscaler is committed to providing equal employment opportunities to all individuals. We strive to create a workplace where employees are treated with respect and have the chance to succeed. All qualified applicants will be considered for employment without regard to race, color, religion, sex (including pregnancy or related medical conditions), age, national origin, sexual orientation, gender identity or expression, genetic information, disability status, protected veteran status, or any other characteristic protected by federal, state, or local laws. See more information by clicking on the Know Your Rights: Workplace Discrimination is Illegal link.

Pay Transparency

Zscaler complies with all applicable federal, state, and local pay transparency rules.

Zscaler is committed to providing reasonable support (called accommodations or adjustments) in our recruiting processes for candidates who are differently abled, have long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support.

Similar Jobs at Zscaler

3 Hours Ago
Easy Apply
Remote or Hybrid
Easy Apply
123K-175K Annually
Senior level
123K-175K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Staff Technical Program Manager will manage authorization activities, facilitate audits, and support ongoing compliance monitoring in government programs, ensuring effective communication with stakeholders.
Top Skills: Cnssi 1253Dod Cloud Computing SrgGrc ControlsNist 800-53 Rev 5
Yesterday
Easy Apply
Hybrid
Easy Apply
109K-155K Annually
Senior level
109K-155K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Senior Technical Trainer designs and delivers advanced educational programs for customers and partners, enhancing technical proficiency to drive product adoption. Responsibilities include creating training materials, conducting sessions, and mentoring junior trainers.
Top Skills: Cloud ComputingCybersecurityNetworkingSaaS
Yesterday
Easy Apply
Remote or Hybrid
USA
Easy Apply
140K-200K Annually
Senior level
140K-200K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
As a Senior Consultant/Resident Engineer, you will implement and manage Zscaler solutions, integrate with security teams, provide configuration guidance, and lead troubleshooting efforts while ensuring customer success.
Top Skills: AndroidAWSAzureDnsFirewallsGCPGreHttp/HttpsHyper-ViOSIps/IdsIpsecKvmLdapLinuxmacOSProxyRoutersSAMLScimSsl InspectionTcpdumpTracerouteVMwareWindowsWiresharkZscaler

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account