Saviynt Jobs

Principal Software Engineer, AI Platform Engineering

Saviynt

Principal Software Engineer, AI Platform Engineering

Reposted 3 Days Ago

Be an Early Applicant

Hybrid

El Segundo, CA, USA

Senior level

Hybrid

El Segundo, CA, USA

Senior level

The Principal Software Engineer leads the architectural design of a data platform, ensuring isolated and compliant data flows, overseeing AI data lakes, batch and streaming pipelines, and developing service APIs.

The summary above was generated by AI

ABOUT SAVIYNT

Saviynt is a leader in identity security, delivering an AI-powered platform that governs and secures access to applications, data, and business processes for global enterprises and government institutions. Built for the AI era, Saviynt helps organizations move faster — securely and compliantly.

ABOUT THE ROLE

You set the architectural direction for how training data flows, evolves, and is governed across the AI Platform. You define the standards ML engineers and scientists build on, and ensure every training signal is tenant-isolated, PII-free, and traceable from source to model.

WHAT YOU'LL OWN

AI Data Lake on GCS: bucket layout, raw → silver → gold tier separation, CMEK encryption, lifecycle rules
Batch pipelines: Spark on Dataproc for TB-scale feature backfills, Iceberg compaction, and daily S3→GCS incremental sync
Streaming pipelines: Apache Beam on Dataflow for sub-5-min CDC ingestion with exactly-once semantics and PII assertion gates
Schema registry: Avro / Protobuf schema versioning, compatibility modes, and migration playbooks for safe schema evolution
Orchestration: Flyte as primary DAG layer — task authoring standards, domain isolation, retry policies, DataCatalog memoization; evaluate Kubeflow Pipelines where relevant
Multi-tenancy: strict per-tenant GCS prefix isolation, quota policies, and cross-tenant contamination validation
Data Anonymizer and Data Labeler microservices: strip PII and attach ML labels before signals leave each customer environment
Feature store: Feast offline (GCS Parquet) and online (Redis) with point-in-time correctness and < 0.1% consistency SLA
Vector database: operate Pgvector (Cloud SQL) for POC and Qdrant on GKE for production-scale embedding storage; design index strategies (IVFFlat, HNSW) and manage ANN query latency SLAs
RAG data pipeline: build embedding generation pipelines that chunk, encode, and upsert document embeddings into the vector store; own the data refresh cadence and staleness SLAs for retrieval context
Service APIs: expose data platform services (feature serving, embedding upsert, schema validation) over HTTPS with mTLS and gRPC where low-latency streaming is required
Synthetic data pipelines for dev/staging where real customer data is not permitted
Data quality gates: Great Expectations / dbt checks as Flyte tasks, blocking on schema and PII-absence failures

YOU'LL THRIVE HERE IF YOU HAVE

8+ years of data engineering at production scale across multiple companies
Demonstrated principal impact: platform standards you defined adopted org-wide, or major cross-team pipeline/schema migrations you led
Data lake ownership (essential): you have designed and operated a production data lake end-to-end — storage layout, partitioning strategy, tiered retention (hot/warm/cold), table format (Iceberg or Delta Lake), compaction, and access control; not just consumed one
Deep Spark (PySpark / Scala): executor tuning, shuffle diagnosis, Iceberg table maintenance
Hands-on Beam / Dataflow: windowing, exactly-once, side inputs, autoscaling
Schema registry experience: Protobuf / Avro compatibility rules, breaking-change migrations in production
Orchestration at scale: Flyte, Kubeflow Pipelines, Airflow, or Prefect — operated in production, ideally benchmarked two
Multi-tenant data architecture: per-tenant isolation as a hard requirement, not a post-hoc concern
Feature store operations: Feast or Tecton, point-in-time joins, online/offline consistency
Vector databases: Pgvector or Qdrant in production — index tuning, ANN search, embedding upsert pipelines
RAG data fundamentals: chunking strategies, embedding model selection, retrieval quality evaluation, and context freshness management
API transport: gRPC and HTTPS/mTLS for service-to-service communication; comfortable defining proto contracts and managing certificate lifecycle
Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience

NICE TO HAVE

Differential privacy or k-anonymity for ML training datasets
Open source contributions: Feast, Great Expectations, Apache Beam, or dbt
Familiarity with IAM / access governance data: entitlements, provisioning events, access graphs
Iceberg or Delta Lake at petabyte scale

WHY JOIN SAVIYNT

Work on a large-scale, Kubernetes-based SaaS platform
Solve challenging cloud and reliability problems at scale
Collaborate with strong engineers in a reliability-focused culture
Competitive compensation, benefits, and growth opportunities

SECURITY & COMPLIANCE

This role requires adherence to Saviynt's information security and privacy policies, including annual security training.

1301 E El Segundo Blvd, El Segundo, CA, United States, 90245,

Similar Jobs

ServiceNow

Principal Software Engineer

6 Days Ago

Remote or Hybrid

218K-381K Annually

Expert/Leader

218K-381K Annually

Expert/Leader

Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation

The role involves leading security architecture and development for AI Platform, focusing on cryptography, key management, and machine identity. Collaboration across teams is essential to deliver secure solutions.

Top Skills: AjaxCSSEjbcaHsm TechnologiesHTMLJavaJavaScriptJSONRestThales CiphertrustmanagerXML

Wells Fargo

Registered Client Associate

6 Hours Ago

Hybrid

Los Angeles, CA, USA

31-46 Hourly

Junior

31-46 Hourly

Junior

Fintech • Financial Services

The Registered Client Associate assists Financial Advisors with client service, account management, administrative tasks, and ensuring compliance through proper documentation and communication.

Top Skills: MS Office

Wells Fargo

Senior Commercial Banking Relationship Manager

6 Hours Ago

Hybrid

167K-260K Annually

Senior level

167K-260K Annually

Senior level

Fintech • Financial Services

The role involves developing and maintaining client relationships, driving revenue growth, managing banking transactions, and networking to expand the client base.

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
Key Industries: Artificial intelligence, adtech, media, software, game development
Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Saviynt

Principal Software Engineer, AI Platform Engineering

Saviynt El Segundo, California, USA Office

Similar Jobs

Principal Software Engineer

Registered Client Associate

Senior Commercial Banking Relationship Manager

What you need to know about the Los Angeles Tech Scene

Key Facts About Los Angeles Tech