Pacific Life Logo

Pacific Life

Lead Site Reliability Engineer (SRE)

Posted 3 Days Ago
Be an Early Applicant
Newport Beach, CA
160K-195K Annually
Senior level
Newport Beach, CA
160K-195K Annually
Senior level
Lead Site Reliability Engineer responsible for platform engineering, system design, optimizing reliability, performance, and automation while mentoring team members.
The summary above was generated by AI

Job Description:

Providing for loved ones, planning rewarding retirements, saving enough for whatever lies ahead – our policyholders count on us to be there when it matters most. It’s a big ask, but it’s one that we have the power to deliver when we work together. We collaborate and innovate – pushing one another to transform not just Pacific Life, but the entire industry for the better. Why? Because it’s the right thing to do. Pacific Life is more than a job, it’s a career with purpose. It’s a career where you have the support, balance, and resources to make a positive impact on the future – including your own.
We’re actively seeking a talented Lead Site Reliability Engineer (SRE) to join our Engineering Excellence team in Newport Beach, CA
• This role is hybrid. We believe in empowering our employees to get work done both in and out of the office.

 

As a Lead SRE you’ll move Pacific Life, and your career, forward by providing technical leadership, direction and accountability for platform engineering, system design and end-to-end implementation to meet and exceed the product or platform non-functional requirements including quality, security, reliability, availability and performance. The main responsibilities include, but are not limited to, optimizing design and engineering for new system and enhancements, including processes and day to day activities, to reliably support product rollout and operation in production.  As a lead SRE, the role will include both oversight for production operations of our portfolio of systems, as well as development/engineering of solutions to optimize system reliability and automation.

How you’ll help move us forward:

  • Lead the design, build and implement orchestration and tooling solutions to ensure that repetitive administration tasks are performed at a high level of efficiency and free of defect
  • Establish best practices for structuring, automating, building, deploying and monitoring complex distributed software products and environments.
  • Ensure the reliability and traceability of software releases and deployments of software and infrastructure changes.
  • Create and maintain platform architecture and design specifications to aid development, testing and maintenance of software environments
  • Design and implement monitoring and recovery tools to provide for site high availability (HA) and disaster recovery (DR)
  • Design and develop highly available infrastructure and platform components to meet the needs of our growing and evolving product lines
  • Design and implement security engineering best practices in all our deployed platform and environments
  • Triage alerts & diagnose/resolve critical issues, manage the implementation of changes
  • Manage the coordination, documentation, and tracking of critical incidents and corresponding root cause analysis, ensuring rapid and complete issue resolution and appropriate closed loop to customers and other key stakeholders.
  • Collaborate with Delivery Engineers and DevExp Engineers to enhance and implement continuous integration/continuous deployment orchestration system to reduce friction for software delivery to production
  • Lead, grow, mentor other SRE team member.
  • Evangelize the DevSecOps culture and SRE mindset, and mentor others about reliability and best practices.
  • Identify and work with other engineering discipline to implement opportunities for:
    • Automation
    • Signal to noise reduction
    • Prevention of recurring issues, and other actions to reduce time to mitigate service-impacting events and increase the productivity of cloud operations and development resources
  • Maintain a strong understanding of IaaS, PaaS, and SaaS offerings with building and maintaining a state-of-the-art, cloud-based environment for large-scale data processing
  • Design and implement processes, technology and automation for performance testing.
  • Ensure that implementation and solution are fully documented, and solution deployed with fully operationalized processes to support the solution lifecycle

The experience you bring:

  • 10-15 years of experience in infrastructure, system engineering, software engineering
  • Advanced knowledge in software engineering in test, testing automation frameworks and tools for application and/or any-as-code (infrastructure, configuration, development tools such as documentation or diagram as code)
  • Advanced knowledge in at least 3 of the following key areas: Cloud native and IaaS Architecture (performance testing, monitoring, operations), Design (compliance, security), Cloud Engineering (planning, provision), Containers orchestration solutions.
  • Strong understanding of business technology drivers and their impact on architecture design, performance and monitoring
  • A systematic problem-solving approach, coupled with strong communications skills and a sense of ownership and drive.
  • Hands-on experience in designing, analyzing, scaling, and troubleshooting medium to large scale distributed systems.
  • Practice and well-versed with SRE methodologies and passionate about solving operation problems through automation and software engineering.
  • Ability to communicate effectively vertically and horizontally within the organization about technical strategy in clear, concise, understandable terms appropriate to the audience technical understanding and expertise
  • Demonstrated ability to conceptualize, launch and deliver multiple engineering projects on time and within budget
  • Demonstrated ability to understand and troubleshoot complex problems under pressure

What makes you stand out:

  • Subject matter expert in designing and supporting one of the 3 major public cloud provider – AWS is a plus will consider any other public cloud providers experience
  • Demonstrated expertise in microservices lifecycle management (integration, testing, deployment)
  • Strong experience in multiple technologies in the following set of logging and monitoring tools: ELK stack, Prometheus, Stackdriver, New Relic, Datadog, Dynatrace, Splunk, AWS logging and monitoring
  • Expert knowledge of release software tooling (e.g. Jenkins or Jenkins X, Spinnaker, Harness, Azure Devops service or other Cloud specific cloud environment)
  • Expert level knowledge of containerization technologies including experience in optimizing Docker image and managing Docker image lifecycle
  • Expert level of knowledge for Kubernetes preferred but will consider experienced in other orchestration solution
  • Advanced experience with algorithms, data structures, complexity analysis and software design
  • Expert level of Linux/Unix/Window OS experience.

Base Pay Range:

The base pay range noted represents the company’s good faith minimum and maximum range for this role at the time of posting. The actual compensation offered to a candidate will be dependent upon several factors, including but not limited to experience, qualifications and geographic location.

$159,660.00 - $195,140.00

Base Pay Range:

The base pay range noted represents the company’s good faith minimum and maximum range for this role at the time of posting. The actual compensation offered to a candidate will be dependent upon several factors, including but not limited to experience, qualifications and geographic location. Also, most employees are eligible for additional incentive pay.

Your Benefits Start Day 1  
 

Your wellbeing is important to Pacific Life, and we’re committed to providing you with flexible benefits that you can tailor to meet your needs. Whether you are focusing on your physical, financial, emotional, or social wellbeing, we’ve got you covered.

  • Prioritization of your health and well-being including Medical, Dental, Vision, and Wellbeing Reimbursement Account that can be used on yourself or your eligible dependents

  • Generous paid time off options including: Paid Time Off, Holiday Schedules, and Financial Planning Time Off

  • Paid Parental Leave as well as an Adoption Assistance Program

  • Competitive 401k savings plan with company match and an additional contribution regardless of participation

EEO Statement:

Pacific Life Insurance Company is an Equal Opportunity /Affirmative Action Employer, M/F/D/V. If you are a qualified individual with a disability or a disabled veteran, you have the right to request an accommodation if you are unable or limited in your ability to use or access our career center as a result of your disability. To request an accommodation, contact a Human Resources Representative at Pacific Life Insurance Company.

Top Skills

Cloud
Datadog
Docker
Elk Stack
Iaas
Jenkins
Kubernetes
Linux
Paas
Prometheus
SaaS
Spinnaker
HQ

Pacific Life Newport Beach, California, USA Office

700 Newport Center Drive, Newport Beach, CA, United States, 92660

Similar Jobs

2 Days Ago
Easy Apply
Hybrid
San Francisco, CA, USA
Easy Apply
199K-283K
Senior level
199K-283K
Senior level
Cloud • Software
Lead the Production Engineering SRE team, focusing on DevSecOps, system reliability, security architecture, and team mentorship in cloud-native technologies.
Top Skills: ArgocdAWSDockerGoKubernetesOpentelemetryPrometheusPythonTerraform
5 Days Ago
Easy Apply
Remote
Hybrid
2 Locations
Easy Apply
148K-236K Annually
Senior level
148K-236K Annually
Senior level
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
As a Lead Site Reliability Engineer, you will enhance cloud infrastructure, automate operations, and troubleshoot complex production issues in a secure environment.
Top Skills: AnsibleAWSBashChefDirect ConnectDockerGoKubernetesPuppetPythonRestRubyScalaSoapTlsTransit GatewayUnix/LinuxVpc
8 Days Ago
Hybrid
San Mateo, CA, USA
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
Lead the Product Site Reliability Engineering team to enhance system reliability and performance, drive automation, and ensure observability best practices.
Top Skills: AWSAzureGCPSre Tools

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account