Caylent Logo

Caylent

Principal AI/ML Architect

Posted 3 Days Ago
Easy Apply
Remote
Hiring Remotely in United States
165K-185K Annually
Senior level
Easy Apply
Remote
Hiring Remotely in United States
165K-185K Annually
Senior level
The role involves leading ML client engagements, shaping strategies, advising on ML architecture, and influencing project outcomes, while guiding teams and driving business value.
The summary above was generated by AI

Caylent is a cloud native services company that helps organizations bring the best out of their people and technology using Amazon Web Services (AWS). We provide a full-range of AWS services including workload migrations and modernization, cloud native application development, DevOps, data engineering, security and compliance, and everything in between.

At Caylent, our people always come first.  We are a global company and operate fully remote with employees in Canada, the United States, and Latin America. We celebrate the culture of each of our team members and foster a community of technological curiosity. Come talk to us to learn more about what it means to be a Caylien!

The Opportunity

This is a senior technical client leadership role that blends deep hands-on ML expertise with strategic advisory and consulting skills. You will be the most experienced ML voice across a diverse and expanding book of customer engagements — from early-stage companies bringing ambitious ML ideas to market, to established enterprises modernizing how they build and operate AI systems on AWS.

You will shape strategy, influence architecture, and leave every team you touch better than you found it. You bring the scientific depth to design and evaluate models rigorously, the engineering depth to architect production ML systems at scale, and the consulting instincts to translate both into business value for customers.

If you have led the hard conversations, shaped the architecture decisions that mattered, and built the things others benchmark against — and you are looking to do that across a growing portfolio of varied and interesting customers — this is the role for you.

What You'll Do
  • Lead end-to-end ML assessments across infrastructure, data pipelines, model lifecycle, and organizational readiness — producing recommendations that drive executive decision-making and earn Caylent the next engagement.
  • Partner with sales and solutions teams through the proposal and scoping phase, contributing the technical depth needed to shape well-grounded statements of work.
  • Serve as the senior technical authority on client engagements — possibly across multiple projects simultaneously — providing architectural guidance, ensuring technical quality from your project team members, and getting hands-on when the engagement demands it, without owning day-to-day implementation responsibilities.
  • Own or orchestrate high-quality POCs that give customers confidence before committing to a larger initiative.
  • Advise customers on ML operations standards and architecture — covering MLOps pipeline design, model lifecycle management, LLMOps patterns, and production monitoring frameworks — translating operational complexity into decisions and guardrails their teams can own and sustain.
  • Shape how Caylent wins its most technically complex opportunities — contributing the architectural thinking and credibility that turns prospects into customers.
  • Strengthen the ML practice from the inside — through peer guidance, technical interviews, and contributions to accelerators, reference architectures, and thought leadership content.

What You Bring

The non-negotiables
  • 10+ years in machine learning or AI, with a proven track record of leading client-facing engagements in a consulting or advisory capacity.
  • Deep, current knowledge of the AWS ML and GenAI ecosystem, with the ability to make and defend architectural decisions across the full ML lifecycle — from data and feature engineering through training, deployment, and monitoring.
  • Deep expertise in at least two or three ML domains — whether traditional ML, computer vision, NLP, time series, or others — combined with the judgment to assess, architect, and advise across the broader ML landscape.
  • Proven ability to architect and govern production ML systems end-to-end, translating MLOps, LLMOps, and broader AI operations complexity into standards and decisions that engineering teams can execute and executives can act on.
  • Deep expertise across foundation model adaptation — fine-tuning (LoRA, QLoRA, PEFT), alignment (RLHF, DPO), inference optimization (quantization, vLLM), and distributed training (DeepSpeed, FSDP) — combined with RAG and agentic system design, including multi-agent architectures, event-driven workflows, MCP integration, and human-in-the-loop patterns on AWS. Technical authority to prescribe the right approach and set architectural standards that teams can execute against.
  • Proven ability to operate independently in complex customer environments — navigating ambiguity, aligning stakeholders, and translating ML tradeoffs into business risk and value for both technical and executive audiences.
Strong differentiators
  • AWS Certified Machine Learning – Specialty and/or AWS Certified Solutions Architect – Professional.
  • Experience shaping practice-level standards, reference architectures, and reusable ML accelerators across multiple engagements.
  • Exposure to varied industries and problem types in a consulting or client-facing context.
  • Deep fluency in responsible AI practices — model evaluation, bias detection, fairness frameworks, and AI governance — applied in enterprise deployments.
  • Hands-on experience designing and deploying SRE agents and AI-driven operations workflows in production — spanning automated incident detection, triage, and remediation — with the ability to integrate across observability platforms and translate AI operations outcomes into measurable business value.

Technical Stack

Our practice spans a broad range of ML domains. Candidates are expected to prescribe — not just recognize — with the judgment to maximize what AWS makes possible and the experience to know how open-source tooling strengthens it.

ML Domains: Classical ML, Computer Vision, NLP, Generative AI & LLMs, AI Agents & Autonomous Systems, Intelligent Document Processing, Video Understanding, Speech & Audio, Time Series & Forecasting, Recommender Systems, Graph ML, Reinforcement Learning, Multimodal AI

AWS ML Platform: SageMaker, SageMaker Pipelines, SageMaker Feature Store, SageMaker Model Registry, SageMaker Clarify, Bedrock (Agents, Knowledge Bases, Guardrails, AgentCore, Model Evaluation)

Multi-provider LLM: Bedrock, Anthropic API, OpenAI API, Google Gemini API, Azure OpenAI — with the judgment to reason across provider tradeoffs in enterprise contexts

AWS AI Services: Rekognition, Comprehend, Transcribe, Textract, Translate, Personalize, Neptune, Kinesis Video Streams, Polly

Data Platform: Apache Spark / PySpark, Apache Kafka, Amazon Kinesis, Apache Iceberg, Delta Lake, Apache Hudi, AWS Glue

Vector Databases: Pinecone, pgvector, Amazon OpenSearch (vector), Weaviate

Frameworks: PyTorch, TensorFlow, JAX, Scikit-learn, XGBoost, HuggingFace (Transformers, PEFT, TRL), LangChain, LlamaIndex, DSPy, Ollama

MLOps & Governance: MLflow, W&B, Airflow / MWAA (data orchestration), Dagster (asset-based pipelines), Kubeflow Pipelines, CI/CD, IaC (CloudFormation, CDK, Terraform), Docker, Kubernetes, ML Governance (lineage, data contracts, audit), Responsible AI / Bias & Fairness

LLM Evaluation & Safety: RAGAS, LLM-as-judge patterns, DeepEval, NeMo Guardrails, Constitutional AI patterns, structured output validation

Inference & Optimization: Triton, vLLM, SGLang, Trainium, Inferentia, Quantization (GPTQ, AWQ, bitsandbytes), SageMaker Neo

Benefits
  • Medical Insurance for you and eligible dependents 
  • 100% remote work
  • 401k plan with company match up to 4% and immediate vesting
  • Competitive phantom equity
  • Company issued laptop
  • Dental and Vision insurance
  • Term Disability Insurance
  • Term Life Insurance
  • Flexible Spending Account
  • Equipment & Office Stipend
  • Annual stipend for Learning and Development
  • Unlimited Paid Time Off, following a 90-day probationary period
  • 10 Paid Holidays

Base Salary Range: The expected base salary range for this position is $165,000 - $185,000 per year, commensurate with experience and qualifications.

Additional Compensation Components: In addition to the base salary, the compensation package may include bonuses, commissions, equity, and other incentives. The specific components will vary depending on the role and individual and/or company performance.

NOTE: We’re unable to provide visa sponsorship now or at any time in the future.

At Caylent, we are committed to fair, transparent, and inclusive hiring practices. As part of our recruitment process, we may use artificial intelligence (AI) tools or automated systems to assist with the screening and evaluation of applications to help match candidate qualifications with job requirements.
These tools are designed to support — not replace — human decision-making. Final hiring decisions are always made by our trained recruitment professionals.
If an AI or automated tool is used during your application process, it will only be in accordance with applicable laws and regulations, and your information will be handled in a secure and confidential manner.
If you have any questions, please contact [email protected] 

Caylent is a place where everyone belongs. We celebrate diversity and are committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at Caylent.  

We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at [email protected].

Top Skills

Amazon Kinesis
Apache Kafka
Spark
AWS
Docker
Genai
Huggingface
Jax
Kubeflow
Kubernetes
Llmops
Ml
Mlflow
Mlops
Pyspark
PyTorch
Scikit-Learn
TensorFlow
Xgboost
HQ

Caylent Irvine, California, USA Office

4521 Campus Dr, Suite 344, Irvine, CA, United States, 92612

Similar Jobs

12 Days Ago
In-Office or Remote
160K-210K Annually
Expert/Leader
160K-210K Annually
Expert/Leader
Cloud • Security • Cybersecurity
The AI/ML Principal Solutions Architect will design tailored AI solutions, lead proposal efforts, and advise clients on technical architecture and integration, ensuring sales success and customer satisfaction.
Top Skills: Amazon BedrockAzure OpenaiGoogle Vertex AiHugging FaceLangchain
12 Days Ago
Remote
United States of America
161K-350K Annually
Expert/Leader
161K-350K Annually
Expert/Leader
AdTech • Digital Media • Information Technology • Other
The Senior Principal AI/ML Architect will drive AI architectural vision, influence multi-squad strategy, design intelligent systems, and set operational standards for Yahoo Mail's AI capabilities.
Top Skills: AIDistributed Ml SystemsLlmsMlPersonalization Systems
An Hour Ago
Easy Apply
Remote or Hybrid
District of Columbia, USA
Easy Apply
177K-221K Annually
Senior level
177K-221K Annually
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
The Senior Sales Engineer will lead technical sales processes, architect Zero Trust solutions, and enhance customer success in Federal and A&D sectors.
Top Skills: AWSAzureCloud-Native Zero Trust Security ModelsGCPNetwork SecurityZero Trust Architecture

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account