Runwise Logo

Runwise

Senior Site Reliability Engineer

Posted 11 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in USA
140K-190K
Senior level
Remote
Hiring Remotely in USA
140K-190K
Senior level
Responsible for maintaining system stability and performance. Collaborate with engineering teams to automate workflows, improve reliability, and manage incident response. Conduct capacity planning and advocate for best engineering practices.
The summary above was generated by AI

Runwise is looking for a Senior Site Reliability Engineer (Sr. SRE) who is highly motivated, results-oriented, and passionate about building products that customers love. 

Runwise (www.runwise.com) is a fast-paced, customer-focused climate-tech startup that controls and runs the key energy systems (heating, water, etc.) in 6000+ buildings throughout the US. Runwise’s unique hardware and software service significantly reduces energy usage, substantially lowering costs and carbon output. As of today, Runwise’s technology takes the equivalent of 50,000 cars worth of carbon emissions off the road each year. Runwise has offices in New York and Boston but is a remote-first product/engineering company, and has been since its creation. 

We’re looking for a Senior Site Reliability Engineer (Sr. SRE)to join our infrastructure team. You will be responsible for maintaining the stability and performance of our services, ensuring they are reliable, scalable, and fault-tolerant. You’ll work closely with hardware and software engineers to build and maintain tools that improve the reliability and efficiency of our systems.  

Responsibilities will include, but are not limited to:

  • Design and maintain scalable infrastructure in AWS cloud and distributed on-prem systems 
  • Automate infrastructure provisioning, deployment pipelines, and operational workflows using tools like Terraform, Ansible, or Helm
  • Build and improve monitoring, alerting, and observability systems (e.g., Cloud Health, Grafana)
  • Collaborate with development teams to improve service reliability, performance, and scalability
  • Participate in on-call rotation and manage incident response, including root cause analysis and postmortems
  • Define and track service-level objectives (SLOs) and service-level indicators (SLIs)
  • Conduct capacity planning, chaos testing, and disaster recovery exercises
  • Advocate for engineering best practices across CI/CD, security, and fault tolerance

You have:

  • 5+ years of experience in Site Reliability Engineering, DevOps, or infrastructure-focused roles.
  • Proven success managing production systems in cloud environments like AWS, with a strong understanding of scalability and fault tolerance. Additional experience with distributed IoC systems is a huge plus.
  • Experience using infrastructure-as-code tools like AWS CloudFormation and Ansible to manage and automate deployments.
  • Strong scripting or development skills in Python, Go, and Bash for building tools and automating workflows.
  • Hands-on experience with observability and alerting systems like Prometheus, Grafana, or CloudWatch.
  • Deep familiarity with CI/CD practices and tools, especially GitHub Actions, and a track record of improving build and release automation.
  • Comfort participating in on-call rotations and managing incident response, including postmortems and service recovery.
  • Ability to collaborate effectively across remote, distributed teams, with strong asynchronous communication and documentation habits.
  • A proactive mindset with a focus on continuous improvement, resilience, and customer impact.
  • Excitement about working in a fast-paced climate-tech company making a measurable environmental difference.

Salary range: $140,000-$190,000 based on experience level 

What you believe: 

  • No job is too small.
  • Sincerity builds trust.
  • Setbacks fuel progress.
  • Efficiency is vital.

Benefits:

  • Medical, dental, and vision insurance
  • HSA & FSA options
  • Paid Parental Leave
  • Access to Talkspace & Health Advocate
  • Flexible PTO
  • Commuter Benefits
  • 401K
  • Company-paid life insurance
  • Voluntary supplemental life insurance
  • Free in-office lunch on Wednesdays
  • Hybrid work environment
  • Summer Fridays
  • Monthly L&D Series
  • Employee Resource Groups (e.g. DEIB Committee, Run Club)

This is an excellent opportunity to join a fast-growing company, one of the true leaders within energy efficiency in the Northeast. You will be surrounded by talented people, including working very closely with our co-founder and sales leader. Your success will also make a tangible impact on reducing carbon emissions across the country, within the cities where we operate.

Top Skills

Ansible
AWS
Bash
Cloud Health
Cloudwatch
Github Actions
Go
Grafana
Helm
Prometheus
Python
Terraform

Similar Jobs

2 Days Ago
Remote
Hybrid
2 Locations
183K-210K Annually
Senior level
183K-210K Annually
Senior level
Cloud • Greentech • Other • Energy
You'll optimize Crusoe's compute infrastructure, focusing on virtualization, performance tuning, and kernel optimizations for AI workloads.
Top Skills: CCi/CdGoHypervisorsInfrastructure As CodeKvmLinuxQemuRust
8 Days Ago
Remote
Hybrid
2 Locations
183K-210K Annually
Senior level
183K-210K Annually
Senior level
Cloud • Greentech • Other • Energy
As a Senior Site Reliability Engineer at Crusoe, you'll ensure system reliability and performance, automate operations, and improve infrastructure through collaboration and monitoring.
Top Skills: AnsibleCircleCICloudFormationDockerGithub ActionsGitlab Ci/CdGoKubernetesPythonTerraform
9 Days Ago
Remote
Hybrid
Phoenix, AZ, USA
Senior level
Senior level
Artificial Intelligence • Big Data • Information Technology • Software
The Senior Site Reliability Engineer will manage Nexthink's cloud infrastructure, ensuring high availability and security compliance while collaborating with cross-functional teams for optimal performance and incident management.
Top Skills: AnsibleAWSAzureBashCloudFormationCrossplaneDockerGCPGitGitlabGoJenkinsKubernetesPythonTerraform

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account