Stellar Cyber Logo

Stellar Cyber

Senior DevOps Engineer/Site Reliability Engineer-East Coast

Posted 3 Days Ago
Remote
Hiring Remotely in United States
165K-215K Annually
Senior level
Remote
Hiring Remotely in United States
165K-215K Annually
Senior level
This role involves managing Kubernetes clusters, cloud infrastructure, and CI/CD pipelines. The engineer will enhance system reliability and efficiency while troubleshooting production issues.
The summary above was generated by AI

Join a fast-growing global leader in cybersecurity, trusted by some of the biggest names in the industry. In addition to some of the world’s largest enterprises and government agencies, more than 30% of the world’s top MSSPs rely on our platform. We’re at the forefront of protecting organizations against sophisticated cyber threats using cutting-edge AI and automation technologies. Our culture is built on diversity, openness, and collaboration, fostering creativity and innovation that drives real impact in the market..

We are seeking a highly skilled Senior DevOps / Site Reliability Engineer (SRE) to join our globally distributed engineering organization. This is a hands-on senior-level role focused on building, operating, and scaling reliable cloud-native infrastructure and distributed data platforms.

The ideal candidate will have strong expertise in Kubernetes, cloud infrastructure, observability, automation, CI/CD, incident management, and infrastructure reliability. This role combines DevOps engineering practices with SRE principles to improve scalability, resiliency, operational efficiency, and platform performance across production environments.

The engineer will work closely with platform, development, and operations teams to drive automation, operational excellence, and reliability best practices for mission-critical systems.

Key Responsibilities

  • Administer and maintain Kubernetes clusters and containerized workloads.
  • Manage cloud infrastructure across OCI, AWS, GCP, or Azure environments.
  • Develop and maintain CI/CD pipelines for reliable application deployments.
  • Implement and manage Infrastructure as Code (IaC) using Terraform and Helm.
  • Build automation tooling and operational workflows using Python, Go, or Bash.
  • Drive observability initiatives including monitoring, logging, tracing, and alerting improvements.
  • Monitor, troubleshoot, and resolve production incidents while participating in on-call rotations.
  • Support and optimize distributed data platforms including Kafka, Elasticsearch, Spark, Redis, and MongoDB.
  • Improve platform reliability, scalability, and operational efficiency using SRE best practices.
  • Collaborate with cross-functional teams across multiple time zones.
  • Perform Linux system administration and networking troubleshooting.
  • Contribute to incident response processes, postmortems, and reliability improvements.
  • Support GitOps and deployment workflows using tools such as ArgoCD and GitHub Actions.
  • Evaluate and implement AI-assisted operational tooling for auto-remediation, alert correlation, and operational intelligence.

Requirements
    • 5+ years of experience in DevOps, SRE, or Platform Engineering roles.
    • Strong expertise with Kubernetes, Docker, and container orchestration.
    • Hands-on experience managing production cloud environments.
    • Strong Infrastructure as Code experience with Terraform and Helm.
    • Experience with CI/CD tools and deployment automation.
    • Advanced troubleshooting skills in Linux systems, networking, and distributed systems.
    • Experience with observability platforms including Prometheus, Grafana, Loki, Alertmanager, and Elastic Stack.
    • Strong programming and scripting skills in Python, Bash, or Go.
    • Experience supporting high-availability production systems and on-call operations.
    • Knowledge of incident management and reliability engineering practices.
    • Familiarity with data platform technologies such as Kafka, Spark, Elasticsearch, Redis, or MongoDB.
    • Understanding of AI-driven operational tooling and automated remediation concepts.
    • Excellent communication, collaboration, and problem-solving skills.
    • Resides on the East Coast

Benefits

We pride ourselves in recognizing our employees. Here are some examples of our benefits program:

  • Pre-IPO Stock Options
  • Medical, Dental & Vision care
  • 401(k)
  • Employee Assistance Program
  • Employee Discount Program
  • Life Insurance
  • Paid time off
  • Referral Program
  • Rewards and Recognition Program

The base compensation range for this role is USD 165,000-215,000 per year. Total compensation includes bonus opportunity and equity, and will vary based on candidate location.

Similar Jobs

6 Minutes Ago
Remote
US
85K-90K Annually
Senior level
85K-90K Annually
Senior level
Artificial Intelligence • Healthtech • Mobile • Software • Telehealth • Generative AI
Own and steward Pager Health's visual brand across marketing and sales. Create high-impact assets—presentations, social, web, motion, infographics—that translate complex healthcare and AI concepts into persuasive narratives. Build templates and visual systems, manage the design library, support sales and executive storytelling, and drive consistency and efficiency across cross-functional teams.
Top Skills: Adobe Creative CloudBrazeCanvaFigmaHubspotPowerPoint
8 Minutes Ago
Remote or Hybrid
140K-215K Annually
Senior level
140K-215K Annually
Senior level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Senior back-end engineer for CrowdStrike Identity product. Build and scale cloud-deployed, distributed systems, design architecture, mentor junior engineers, collaborate with UI and Sensor teams, manage priorities, drive data-informed decisions, and communicate effectively across the organization and with senior leadership.
Top Skills: C#C++Cloud-Deployed ApplicationsDistributed SystemsGoJava
10 Minutes Ago
Remote
United States
120K-135K Annually
Senior level
120K-135K Annually
Senior level
Fintech • Financial Services
Lead design and maintenance of automated end-to-end tests across internal and external systems, build AI/LLM evaluation frameworks, embed testing in CI/CD pipelines, promote shift-left quality practices, and transform manual processes into automation suites while collaborating with engineers and product managers.
Top Skills: APIsAWSCi/CdDatadogGenerative AiGitGoJIRAKatalon StudioLeapworkLlmLoad TestingMablPerformance TestingPlaywrightPostmanPythonReactRubySalesforceSpasTestsigmaTricentis ToscaTypescript

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account