GoodLeap Logo

GoodLeap

Site Reliability Engineer

Reposted 25 Days Ago
In-Office or Remote
7 Locations
97K-141K Annually
Mid level
In-Office or Remote
7 Locations
97K-141K Annually
Mid level
The Site Reliability Engineer will ensure application reliability and performance by designing automation, leading incident responses, and improving system stability in collaboration with development and operations teams.
The summary above was generated by AI
About GoodLeap:
GoodLeap is a technology company delivering best-in-class financing and software products for sustainable solutions, from solar panels and batteries to energy-efficient HVAC, heat pumps, roofing, windows, and more. Over 1 million homeowners have benefited from our simple, fast, and frictionless technology that makes the adoption of these products more affordable, accessible, and easier to understand. Thousands of professionals deploying home efficiency and solar solutions rely on GoodLeap’s proprietary, AI-powered applications and developer tools to drive more transparent customer communication, deeper business intelligence, and streamlined payment and operations. Our platform has led to more than $30 billion in financing for sustainable solutions since 2018.
 
GoodLeap is also proud to support our award-winning nonprofit, GivePower, which is building and deploying life-saving water and clean electricity systems, changing the lives of more than 1.6 million people across Africa, Asia, and South America.

Position Summary
The Site Reliability Engineer (SRE) role is a hybrid position that combines elements of software engineering and systems engineering to ensure the reliability, scalability, and performance of applications and services. This role is responsible for designing and implementing automation, monitoring, and incident response processes that support high service availability. The SRE will work closely with both development and operations teams to drive best practices in infrastructure management, reduce manual toil, and enable continuous improvement in system stability and operational efficiency. The position plays a critical role in supporting our DevOps initiatives and improving the overall health of our production environments.

Essential Job Duties and Responsibilities

  • Partner with engineering, DevOps, and product teams to understand system requirements, communicate reliability best practices, and embed a culture of shared ownership. Strong communication, empathy, and influence are key to success.
  • Lead incident response efforts, facilitate root cause analysis, and drive continuous improvements post-incident. Requires composure under pressure, clear decision-making, and the ability to bring teams together in critical moments.
  • Identify opportunities to reduce manual work by building and maintaining internal tools and automation pipelines. Emphasizes problem-solving, initiative, and a continuous improvement mindset.
  • Leverage DataDog to enhance system visibility, improve alerting strategies, and ensure observability across services. Requires proactive thinking, a focus on end-user impact, and the ability to coach teams on effective usage of monitoring tools.
  • Develop and maintain documentation including runbooks, service readiness guides, and knowledge articles to support operational excellence. Strong written communication and a focus on clarity are essential.
  • Collaborate with teams to support scaling initiatives and optimize system performance using data-informed insights. Requires strategic thinking, collaboration, and attention to long-term growth needs.

Required Skills, Knowledge and Abilities

  • Solid understanding of the Software Development Lifecycle (SDLC), including source control, defect tracking, automated build systems, and production control processes
  • Strong knowledge of CI/CD and DevOps principles, tools, and integrations
  • Hands-on experience with Amazon Web Services (AWS), including services such as DynamoDB, CloudFormation, CloudFront, S3, Route53, Lambda, and YAML configuration
  • Proficiency with containerization and serverless technologies
  • Experience with infrastructure as code tools, particularly Terraform and Kubernetes
  • Strong understanding of observability concepts, including tracing, structured logging, and metrics
  • Experience using application and infrastructure monitoring tools—specifically DataDog—to ensure system health and performance
  • Familiarity with designing and implementing self-healing, fault-tolerant, and autoscaling systems
  • Experience working with SQL and relational databases; familiarity with MongoDB Cloud Atlas is a plus
  • Proficiency with Git and source control workflows; understanding of change management best practices
  • Demonstrated problem-solving and analytical skills in fast-paced environments
  • Excellent verbal and written communication skills, with the ability to explain complex technical topics to both technical and non-technical stakeholders
  • Self-motivated with a strong sense of ownership, accountability, and follow-through

Additional Information Regarding Job Duties and Job Descriptions:

Job duties include additional responsibilities as assigned by one's supervisor or other managers related to the position/department. This job description is meant to describe the general nature and level of work being performed; it is not intended to be construed as an exhaustive list of all responsibilities, duties and other skills required for the position. The Company reserves the right at any time with or without notice to alter or change job responsibilities, reassign or transfer job position or assign additional job responsibilities, subject to applicable law. The Company shall provide reasonable accommodations of known disabilities to enable a qualified applicant or employee to apply for employment, perform the essential functions of the job, or enjoy the benefits and privileges of employment as required by the law.

If you are an extraordinary professional who thrives in a collaborative work culture and values a rewarding career, then we want to work with you!  Apply today!

Top Skills

AWS
Datadog
Git
Kubernetes
MongoDB
SQL
Terraform

GoodLeap Irvine, California, USA Office

22 Executive Park, Irvine, CA, United States, 92614

Similar Jobs

2 Days Ago
Remote or Hybrid
San Diego, CA, USA
111K-172K Annually
Junior
111K-172K Annually
Junior
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
As a Site Reliability Engineer, you'll enhance the reliability and performance of ServiceNow's infrastructure, troubleshooting issues, driving automation, and employing DevOps practices.
Top Skills: AutomationAWSAzureCloud TechnologiesDevOpsJavaScriptLinuxMySQLPythonRuby
3 Days Ago
Remote or Hybrid
San Diego, CA, USA
156K-273K Annually
Senior level
156K-273K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead Site Reliability Engineering efforts within the DevSecOps team, focusing on operational excellence, security, reliability, and cost optimization for security services, while mentoring a team of SREs.
Top Skills: AIAnsibleAWSAzureBashDockerElkGCPGrafanaKubernetesPrometheusPythonTerraform
3 Days Ago
Remote or Hybrid
Chicago, IL, USA
156K-273K Annually
Senior level
156K-273K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead the Site Reliability Engineering efforts for the DevSecOps team, focusing on operational excellence, security, reliability, and cost optimization of security services.
Top Skills: AnsibleAWSAzureBashDockerElkGCPGrafanaKubernetesPrometheusPythonServicenowTerraform

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account