Clarifai Logo

Clarifai

Senior Site Reliability Engineer

Posted 23 Days Ago
Easy Apply
Remote
Hiring Remotely in Canada
Senior level
Easy Apply
Remote
Hiring Remotely in Canada
Senior level
The Senior Site Reliability Engineer will ensure high availability of core services, optimize system performance, manage cloud infrastructure, and collaborate with teams to solve engineering challenges.
The summary above was generated by AI
Senior Site Reliability EngineerAbout the Company

Clarifai is a leading, compute orchestration AI platform specializing in computer vision and generative AI. We empower organizations to transform unstructured image, video, text, and audio data into actionable insights, significantly faster and more accurately than manual processes. Founded in 2013 by Matt Zeiler, Ph.D., Clarifai has been at the forefront of AI innovation since achieving the top five placements in the 2013 ImageNet Challenge. Our diverse, globally distributed team operates across the United States, Canada, Estonia, Argentina, and India.

We have secured $100M in funding, including a $60M Series C round, backed by industry leaders such as Menlo Ventures, Union Square Ventures, Lux Capital, NEA, LDV Capital, Corazon Capital, Google Ventures, NVIDIA, Qualcomm, and Osage.

Clarifai is proud to be an equal-opportunity workplace committed to building and maintaining a diverse and inclusive team.

Your Impact

Clarifai’s platform is a kubernetes-native distributed system that requires the orchestration of many components. Efficiently serving and training large neural networks presents unique design and infrastructure challenges. 

You will be critical to solving these challenges both in the context of the cloud and in on premise environments. Additionally, you will be responsible for our broader cloud infrastructure and development tools and environments.

The Opportunity
  • Ensure the smooth operation and high availability of Clarifai's core services
  • Monitor system performance, identify bottlenecks, and implement optimizations to enhance reliability and efficiency
  • Develop Kubernetes resources and custom tooling for seamless cloud and on-premise deployments
  • Design and implement scalable, secure, and cost-effective infrastructure solutions.
  • Partner with teams across the organization to identify & solve engineering challenges
Requirements
  • BS/BA in Computer Science or related degree
  • Good knowledge of cloud providers (AWS, GCP or similar)
  • Expertise with Kubernetes (EKS, GKE, self-hosted) and Infrastructure as Code using Terraform, Helm
  • Solid understanding of web and networking (HTTP, TLS, DNS, Certificates, etc)
  • Experience with CI/CD pipelines using tools such as GitHub Actions, ArgoCD, and Atlantis
  • Strong interpersonal skills working with teams across different time zones and regions
Great to Have
  • Knowledge of basic Microservice Architecture principles
  • Familiarity with security best practices for cloud-based systems.
  • Experience with relational databases, message queues, key value stores
  • Experience writing python, golang, or any other popular programming language
  • Familiarity with any RPC framework
  • Experience developing & building custom Kubernetes operators

Top Skills

Argocd
Atlantis
AWS
GCP
Github Actions
Go
Helm
Kubernetes
Python
Terraform

Similar Jobs

7 Hours Ago
Easy Apply
Remote or Hybrid
7 Locations
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
Manage continuous delivery infrastructure for reliable code deployment. Collaborate with teams to streamline onboarding, support deployment systems, and participate in on-call rotations.
Top Skills: Argo WorkflowsArgocdAWSAzureGoGoogle Cloud PlatformKubernetesPython
Yesterday
In-Office or Remote
Toronto, ON, CAN
150K-175K Annually
Senior level
150K-175K Annually
Senior level
Marketing Tech
The Senior Site Reliability Engineer ensures system reliability, performance, and scalability while automating infrastructure and processes. Responsibilities include incident response, monitoring, team collaboration, and continuous improvement in service stability.
Top Skills: AnsibleAWSAzureBashCi/CdIacPythonSnowflakeTerraformTerragrunt
8 Days Ago
Remote
3 Locations
140K-230K Annually
Senior level
140K-230K Annually
Senior level
Cloud • Information Technology • Internet of Things • Software • Consulting • Infrastructure as a Service (IaaS) • Automation
As a Senior Site Reliability Engineer, you'll develop and manage OpenShift services, improve reliability, automate processes, and troubleshoot issues to enhance customer experiences.
Top Skills: AnsibleAzureDockerGoJavaKubernetesOpenshiftPrometheusPythonRed Hat Enterprise Linux

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account