Runwise

Senior Site Reliability Engineer

Posted 11 Days Ago

Be an Early Applicant

Remote

Hiring Remotely in USA

140K-190K

Senior level

Remote

Hiring Remotely in USA

140K-190K

Senior level

Responsible for maintaining system stability and performance. Collaborate with engineering teams to automate workflows, improve reliability, and manage incident response. Conduct capacity planning and advocate for best engineering practices.

The summary above was generated by AI

Runwise is looking for a Senior Site Reliability Engineer (Sr. SRE) who is highly motivated, results-oriented, and passionate about building products that customers love.

Runwise (www.runwise.com) is a fast-paced, customer-focused climate-tech startup that controls and runs the key energy systems (heating, water, etc.) in 6000+ buildings throughout the US. Runwise’s unique hardware and software service significantly reduces energy usage, substantially lowering costs and carbon output. As of today, Runwise’s technology takes the equivalent of 50,000 cars worth of carbon emissions off the road each year. Runwise has offices in New York and Boston but is a remote-first product/engineering company, and has been since its creation.

We’re looking for a Senior Site Reliability Engineer (Sr. SRE)to join our infrastructure team. You will be responsible for maintaining the stability and performance of our services, ensuring they are reliable, scalable, and fault-tolerant. You’ll work closely with hardware and software engineers to build and maintain tools that improve the reliability and efficiency of our systems.

Responsibilities will include, but are not limited to:

Design and maintain scalable infrastructure in AWS cloud and distributed on-prem systems
Automate infrastructure provisioning, deployment pipelines, and operational workflows using tools like Terraform, Ansible, or Helm
Build and improve monitoring, alerting, and observability systems (e.g., Cloud Health, Grafana)
Collaborate with development teams to improve service reliability, performance, and scalability
Participate in on-call rotation and manage incident response, including root cause analysis and postmortems
Define and track service-level objectives (SLOs) and service-level indicators (SLIs)
Conduct capacity planning, chaos testing, and disaster recovery exercises
Advocate for engineering best practices across CI/CD, security, and fault tolerance

You have:

5+ years of experience in Site Reliability Engineering, DevOps, or infrastructure-focused roles.
Proven success managing production systems in cloud environments like AWS, with a strong understanding of scalability and fault tolerance. Additional experience with distributed IoC systems is a huge plus.
Experience using infrastructure-as-code tools like AWS CloudFormation and Ansible to manage and automate deployments.
Strong scripting or development skills in Python, Go, and Bash for building tools and automating workflows.
Hands-on experience with observability and alerting systems like Prometheus, Grafana, or CloudWatch.
Deep familiarity with CI/CD practices and tools, especially GitHub Actions, and a track record of improving build and release automation.
Comfort participating in on-call rotations and managing incident response, including postmortems and service recovery.
Ability to collaborate effectively across remote, distributed teams, with strong asynchronous communication and documentation habits.
A proactive mindset with a focus on continuous improvement, resilience, and customer impact.
Excitement about working in a fast-paced climate-tech company making a measurable environmental difference.

Salary range: $140,000-$190,000 based on experience level

What you believe:

No job is too small.
Sincerity builds trust.
Setbacks fuel progress.
Efficiency is vital.

Benefits:

Medical, dental, and vision insurance
HSA & FSA options
Paid Parental Leave
Access to Talkspace & Health Advocate
Flexible PTO
Commuter Benefits
401K
Company-paid life insurance
Voluntary supplemental life insurance
Free in-office lunch on Wednesdays
Hybrid work environment
Summer Fridays
Monthly L&D Series
Employee Resource Groups (e.g. DEIB Committee, Run Club)

This is an excellent opportunity to join a fast-growing company, one of the true leaders within energy efficiency in the Northeast. You will be surrounded by talented people, including working very closely with our co-founder and sales leader. Your success will also make a tangible impact on reducing carbon emissions across the country, within the cities where we operate.

Top Skills

Ansible

AWS

Bash

Cloud Health

Cloudwatch

Github Actions

Grafana

Helm

Prometheus

Python

Terraform

Similar Jobs

Crusoe Energy Systems

Senior Site Reliability Engineer, Compute

2 Days Ago

Remote

Hybrid

183K-210K Annually

Senior level

183K-210K Annually

Senior level

Cloud • Greentech • Other • Energy

You'll optimize Crusoe's compute infrastructure, focusing on virtualization, performance tuning, and kernel optimizations for AI workloads.

Top Skills: CCi/CdGoHypervisorsInfrastructure As CodeKvmLinuxQemuRust

Crusoe Energy Systems

Senior Site Reliability Engineer, Storage

8 Days Ago

Remote

Hybrid

183K-210K Annually

Senior level

183K-210K Annually

Senior level

Cloud • Greentech • Other • Energy

As a Senior Site Reliability Engineer at Crusoe, you'll ensure system reliability and performance, automate operations, and improve infrastructure through collaboration and monitoring.

Top Skills: AnsibleCircleCICloudFormationDockerGithub ActionsGitlab Ci/CdGoKubernetesPythonTerraform

Nexthink

Senior SRE - West

9 Days Ago

Remote

Hybrid

Phoenix, AZ, USA

Senior level

Artificial Intelligence • Big Data • Information Technology • Software

The Senior Site Reliability Engineer will manage Nexthink's cloud infrastructure, ensuring high availability and security compliance while collaborating with cross-functional teams for optimal performance and incident management.

Top Skills: AnsibleAWSAzureBashCloudFormationCrossplaneDockerGCPGitGitlabGoJenkinsKubernetesPythonTerraform

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
Key Industries: Artificial intelligence, adtech, media, software, game development
Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering