Our mission at Tensorwave Cloud is to build seamless, secure, reliable, and resilient AI infrastructure at scale, eliminating barriers and challenging the status quo to empower builders and support AI innovation.
About the role
We are seeking a Principal Platform Engineer to lead the design, development, and deployment of our next-generation Kubernetes platform.
In this role, you will define what production excellence looks like at scale: a global, self-healing, autoscaling Kubernetes platform with strong observability, security, and cost efficiency, capable of supporting millions of users.
As a technical leader and hands-on architect, you will build and evolve cloud-native and serverless systems on Kubernetes, writing complex manifests, operators, and controllers from scratch.
You will set standards and best practices across the company, ensuring platform tooling is well-documented, reliable, and continuously improved, while enabling developer teams to deploy applications with speed, confidence, and minimal friction.
Responsibilities
Architect and implement end-to-end Kubernetes infrastructure for large-scale, cloud-native applications
Design and build serverless platforms on top of Kubernetes using technologies such as Knative, OpenFaaS, or KEDA
Develop and maintain Kubernetes custom resources (CRDs), controllers, operators, and admission controllers in Go or Python
Define multi-tenant, multi-region architecture supporting millions of users with high availability and low latency
Lead Kubernetes cluster lifecycle management - provisioning, upgrades, scaling, monitoring, troubleshooting
Collaborate closely with engineering teams to containerize applications, write Helm charts or Kustomize overlays, and standardize deployment practices
Implement infrastructure as code using tools like Terraform, Pulumi, or Crossplane
Lead efforts around observability, policy enforcement, cost optimization, and RBAC/security hardening within the cluster
Evaluate and integrate Kubernetes ecosystem tools - Istio/Linkerd, ArgoCD, Flux, Prometheus, Grafana, OPA
Mentor and upskill DevOps engineers and SREs in Kubernetes best practices
Required Experience
Bachelor of Science in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
8+ years of experience in cloud infrastructure, DevOps, or platform engineering roles
8+ years of hands-on Kubernetes experience, including deep knowledge of the Kubernetes API, internals, networking, and storage
Proficiency in writing Kubernetes manifests, Helm charts, and custom Kubernetes controllers/operators
Proven experience designing cloud-native systems that scale globally - multi-region, multi-cloud or hybrid setups
Experience with serverless technologies in production - Knative, OpenFaaS, AWS Lambda
Strong knowledge of cloud platforms such as AWS, GCP, or Azure
Experience with GitOps tools - ArgoCD, Flux
Deep understanding of security, compliance, and resilience in containerized workloads
Preferred Experience
Contributions to Kubernetes open-source projects or CNCF-related tooling
Experience with service mesh design (Istio, Linkerd)
Familiarity with eBPF, Cilium, or network-level observability
Background in building PaaS or developer platforms on top of Kubernetes
What We Bring
Mission driven company
Competitive Salary
Stock Options
100% paid Medical, Dental, and Vision insurance
Flexible PTO
Paid Holidays
401(k)
Parental Leave
Flexible Spending Account
Short Term Disability Insurance
Life and Voluntary Supplemental Insurance
Mental Health Benefits through Spring Health
We’re looking for resilient, adaptable people to join our team, people who believe in the mission and think at massive scale. The solutions that worked on a handful of devices will not work at Exascale. Be prepared to be pushed daily, to learn a lot, and literally build the future.
Tensorwave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, national origin, or veteran status.
Top Skills
Similar Jobs
What you need to know about the Los Angeles Tech Scene
Key Facts About Los Angeles Tech
- Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
- Key Industries: Artificial intelligence, adtech, media, software, game development
- Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
- Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

