Blink Health Logo

Blink Health

Staff Site Reliability Engineer

Reposted 9 Days Ago
Easy Apply
Remote
Hiring Remotely in USA
Senior level
Easy Apply
Remote
Hiring Remotely in USA
Senior level
The Staff Site Reliability Engineer will establish SRE best practices, drive observability strategy, implement software solutions, and mentor engineers. Responsibilities include improving platform resilience, managing risks, and participating in incident response processes.
The summary above was generated by AI

Company Overview:

Blink Health is the fastest growing healthcare technology company that builds products to make prescriptions accessible and affordable to everybody.  Our two primary products – BlinkRx and Quick Save – remove traditional roadblocks within the current prescription supply chain, resulting in better access to critical medications and improved health outcomes for patients. 

BlinkRx is the world’s first pharma-to-patient cloud that offers a digital concierge service for patients who are prescribed branded medications. Patients benefit from transparent low prices, free home delivery, and world-class support on this first-of-its-kind centralized platform. With BlinkRx, never again will a patient show up at the pharmacy only to discover that they can’t afford their medication, their doctor needs to fill out a form for them, or the pharmacy doesn’t have the medication in stock. 

We are a highly collaborative team of builders and operators who invent new ways of working in an industry that historically has resisted innovation. Join us!

Responsibilities
  • Establish and evolve SRE best practices across the organization, including reliability principles, error budgets, incident response, postmortems, and operational readiness standards.
  • Define and drive observability strategy for system health, performance, and reliability, including SLIs/SLOs, alerting quality, dashboards, and service health indicators.
  • Design and implement software-driven solutions within the infrastructure domain, automating manual processes and eliminating operational complexity and toil.
  • Act as a technical leader and force multiplier, helping set priorities and influencing decision-making across core cloud infrastructure, reliability tooling, and platform architecture.
  • Take ownership of large, ambiguous initiatives, driving them from concept to delivery while aligning stakeholders across engineering, security, and product.
  • Combine deep knowledge of software development, infrastructure, and security to improve platform resilience, scalability, performance, and compliance.
  • Proactively identify systemic risks and reliability gaps, recommending and leading platform upgrades and architectural improvements before they become incidents.
  • Partner with engineering teams to improve developer workflows, tooling, and operational maturity, increasing productivity while reducing cognitive load.
  • Provide technical mentorship, architecture guidance, and high-quality design and code reviews for engineers across infrastructure and product teams.
  • Lead by example in documentation and knowledge sharing, ensuring systems and processes are well-understood and not dependent on individual ownership.
  • Participate in and help mature incident response, escalation practices, and post-incident learning across the organization.
Desired Experience
  • Bachelor’s or Master’s degree in Computer Science or equivalent practical experience.
  • 7+ years of experience in site reliability engineering, infrastructure engineering, or platform engineering roles, with demonstrated impact at scale.
Reliability & Troubleshooting
  • Expert-level, methodical troubleshooting across the entire stack, from application to kernel to network.
  • Strong command-line proficiency and deep expertise in Linux systems and operating system fundamentals.
  • Advanced understanding of networking concepts including load balancing, proxies, DNS, TCP/IP, NAT, and service-to-service communication.
Software & Automation
  • Experience working across multiple languages (e.g., Python, Go, Bash, and familiarity troubleshooting application stacks such as React or similar).
  • Strong track record of automating repetitive and complex operational work to reduce toil and increase reliability.
  • Ability to design and build internal tools (Python or Go) that standardize and scale engineering practices.
  • Comfortable operating in an agile environment, with disciplined testing and quality practices.
Cloud & Platform Engineering
  • Deep experience with cloud platforms (AWS preferred, GCP/Azure acceptable), particularly managed services and production-grade architectures.
  • Strong expertise in Kubernetes and container orchestration (EKS, Helm), including lifecycle management and operational best practices.
  • Proven experience designing and implementing observability systems, including metrics, logging, tracing, dashboards, and alerting.
  • Deep understanding of container technologies, security scanning, secrets management, dynamic configuration, and microservices architectures.
  • Familiarity with service meshes and advanced traffic management concepts.
Infrastructure as Code
  • Experience designing and maintaining company-wide IaC codebases using tools such as Terraform, Pulumi, CloudFormation, or Ansible.
  • Ability to think holistically about infrastructure design, cost, reliability, security, and long-term maintainability.

Why Join Us:

It is rare to have a company that both deeply impacts its customers and is able to provide its services across a massive population.  At Blink, we have a huge impact on people when they are most vulnerable: at the intersection of their healthcare and finances. We are also the fastest growing healthcare company in the country and are driving that impact across millions of new patients every year.  Our business model not only helps people, but drives economics that allow us to build a generational company. We are a relentlessly learning, constantly curious, and aggressively collaborative cross-functional team dedicated to inventing new ways to improve the lives of our customers.

We are an equal opportunity employer and value diversity of all kinds. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Applicants who provide their phone number and consent to receive text messages may receive SMS or MMS updates from Blink Health regarding their application.

Top Skills

Ansible
AWS
Azure
Bash
CloudFormation
GCP
Go
Kubernetes
Pulumi
Python
Terraform

Similar Jobs

2 Days Ago
Easy Apply
Remote or Hybrid
San Jose, CA, USA
Easy Apply
119K-170K Annually
Senior level
119K-170K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
As a Staff Site Reliability Engineer, you'll oversee Zscaler production data center services, optimize code, and ensure cloud service availability and performance. Collaborate with cross-functional teams to improve processes and resolve escalated issues.
Top Skills: BashDnsFirewallsGrafanaHTTPIcmpLoad BalancingNagiosOsi ModelPrometheusPythonTcp/Ip
8 Days Ago
In-Office or Remote
2 Locations
180K-225K Annually
Senior level
180K-225K Annually
Senior level
Consumer Web • eCommerce • Food • Healthtech • Natural Language Processing • Social Impact
Lead and define the DevOps strategy, oversee migration and architecture of Kubernetes-based platforms, and mentor engineering teams.
Top Skills: AnsibleAWSBashChefCloudFormationDatadogGoGrafanaKubernetesPrometheusPuppetPythonRubyTerraform
9 Days Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
136K-170K Annually
Senior level
136K-170K Annually
Senior level
Cloud • Security • Software
As a Staff Site Reliability Engineer, you will design, build, and maintain cloud infrastructure, improve deployment processes, and collaborate across teams.
Top Skills: Ci/CdDockerGoKubernetes

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account