Caylent Logo

Caylent

AI/ML Engineering Manager

Reposted 24 Days Ago
Remote
Hiring Remotely in United States
140K-215K Annually
Senior level
Remote
Hiring Remotely in United States
140K-215K Annually
Senior level
The AI/ML Engineering Manager leads a team of ML engineers and architects, managing hiring, performance, and customer engagements while providing technical guidance in ML operations and architecture.
The summary above was generated by AI

Caylent is a cloud native services company that helps organizations bring the best out of their people and technology using Amazon Web Services (AWS). We provide a full-range of AWS services including workload migrations and modernization, cloud native application development, DevOps, data engineering, security and compliance, and everything in between.

At Caylent, our people always come first.  We are a global company and operate fully remote with employees in Canada, the United States, and Latin America. We celebrate the culture of each of our team members and foster a community of technological curiosity. Come talk to us to learn more about what it means to be a Caylien!

The Mission

This is a senior role for someone who leads from both directions at once — deeply technical on customer engagements, and fully accountable for the growth and performance of a team of ML engineers and architects. You will report to the Director of AI/ML.

You own hiring, development, and team health alongside leading complex customer engagements, shaping architecture, and driving pre-sales. Both parts of this job are real and ongoing. The right candidate will find energy in that combination, not tension.

Your Assignment

Leading Your Team
  • Hire and build: Set the technical bar for ML roles on your team, lead or oversee technical assessments, and make hiring decisions you can stand behind. Build a team that raises the practice's overall standard.
  • Develop people: Run regular structured 1:1s, provide candid feedback at meaningful milestones, and actively invest in each person's growth — whether they are early in their career or highly experienced.
  • Manage performance: Recognize strong contributors and address performance gaps directly and early. Partner with HRBPs and the Director of AI/ML when situations require a structured path, and advocate for your team when they deserve it.
  • Stay close to staffing: Understand how your team is utilized across engagements, keep the staffing team informed of each person's skills evolution and preferences, and ensure people are placed in work that stretches them appropriately.
Strategic Advisory
  • Lead ML assessments: Evaluate customer environments end-to-end — infrastructure, data pipelines, model lifecycle, and organizational readiness — and produce recommendations that drive executive decisions and open the door to the next engagement.
  • Shape architecture: Serve as the senior technical authority on engagements, setting architectural direction, ensuring technical quality across the team, and making the calls that matter when tradeoffs are hard.
  • Advise on ML operations: Help customers build ML systems they can actually own and sustain — translating MLOps, LLMOps, and production monitoring complexity into standards their engineering teams can execute and their leadership can act on.
  • Drive pre-sales: Partner with sales and solutions teams during scoping and proposal phases, contributing the technical depth needed to scope work accurately and give prospects confidence in Caylent's ability to deliver.
Hands-On Delivery
  • Lead engagements end-to-end: Drive architecture and solution design from kickoff through delivery — setting technical direction, unblocking the team on hard problems, and ensuring the work meets Caylent's quality standards.
  • Own the technical relationship: Depending on the engagement, you are either the primary client contact owning all architect-level outcomes, or the senior technical authority providing oversight across the team. The expectation is the same in both cases — you are the person the engagement depends on technically.
Growing the Practice
    • Raise the bar internally: Mentor engineers and architects through real work, contribute to technical interviews, and build reference architectures and accelerators that make the broader ML practice better.

Your Qualifications (non-negotiables)

  • 10+ years in machine learning or AI, with a proven track record of leading client-facing engagements in a consulting or advisory capacity.
  • Demonstrated people management experience — hiring, performance calibration, career development, and the ability to have difficult conversations directly and constructively.
  • Deep, current knowledge of the AWS ML and GenAI ecosystem, with the ability to make and defend architectural decisions across the full ML lifecycle — from data and feature engineering through training, deployment, and monitoring.
  • Deep expertise in at least two or three ML domains — whether classical ML, computer vision, NLP, time series, or others — combined with the judgment to assess, architect, and advise across the broader ML landscape.
  • Proven ability to architect and govern production ML systems end-to-end, translating MLOps, LLMOps, and broader AI operations complexity into standards that engineering teams can execute and executives can act on.
  • Deep expertise across foundation model adaptation — fine-tuning (LoRA, QLoRA, PEFT), alignment (RLHF, DPO), inference optimization, and distributed training — combined with RAG and agentic system design, including multi-agent architectures, MCP integration, and human-in-the-loop patterns on AWS.
  • Proven ability to operate independently in complex, ambiguous customer environments — navigating competing priorities, aligning stakeholders, and translating ML tradeoffs into business risk and value for both technical and executive audiences.
Strong differentiators
  • AWS Certified Machine Learning – Specialty and/or AWS Certified Solutions Architect – Professional.
  • Experience shaping practice-level standards, reference architectures, and reusable ML accelerators across multiple engagements.
  • Exposure to varied industries and problem types in a consulting or client-facing context.
  • Deep fluency in responsible AI practices — model evaluation, bias detection, fairness frameworks, and AI governance — applied in enterprise deployments.
  • Fluency in AIOps patterns — designing agentic workflows for anomaly detection, automated root cause analysis, and remediation across observability platforms — and the ability to translate AI operations outcomes into measurable business value for customers.

Technical Stack

Our practice spans a broad range of ML domains. Candidates are expected to prescribe — not just recognize — with the judgment to maximize what AWS makes possible and the experience to know how open-source tooling strengthens it.

  • ML Domains: Classical ML, Computer Vision, NLP, Generative AI & LLMs, AI Agents & Autonomous Systems, Intelligent Document Processing, Video Understanding, Speech & Audio, Time Series & Forecasting, Recommender Systems, Graph ML, Reinforcement Learning, Multimodal AI
  • AWS ML Platform: SageMaker, SageMaker Pipelines, SageMaker Feature Store, SageMaker Model Registry, SageMaker Clarify, Bedrock (Agents, Knowledge Bases, Guardrails, AgentCore, Model Evaluation)
  • Multi-provider LLM: Bedrock, Anthropic API, OpenAI API, Google Gemini API, Azure OpenAI — with the judgment to reason across provider tradeoffs in enterprise contexts
  • AWS AI Services: Rekognition, Comprehend, Transcribe, Textract, Translate, Personalize, Neptune, Kinesis Video Streams, Polly
  • Data Platform: Apache Spark / PySpark, Apache Kafka, Amazon Kinesis, Apache Iceberg, Delta Lake, Apache Hudi, AWS Glue
  • Vector Databases: Pinecone, pgvector, Amazon OpenSearch (vector), Weaviate
  • Frameworks: PyTorch, TensorFlow, JAX, Scikit-learn, XGBoost, HuggingFace (Transformers, PEFT, TRL), LangChain, LlamaIndex, DSPy, Ollama
  • MLOps & Governance: MLflow, W&B, Airflow / MWAA (data orchestration), Dagster (asset-based pipelines), Kubeflow Pipelines, CI/CD, IaC (CloudFormation, CDK, Terraform), Docker, Kubernetes, ML Governance (lineage, data contracts, audit), Responsible AI / Bias & Fairness
  • LLM Evaluation & Safety: RAGAS, LLM-as-judge patterns, DeepEval, NeMo Guardrails, Constitutional AI patterns, structured output validation
  • Inference & Optimization: Triton, vLLM, SGLang, Trainium, Inferentia, Quantization (GPTQ, AWQ, bitsandbytes), SageMaker Neo

Benefits

  • Medical Insurance for you and eligible dependents 
  • 401k plan with company match up to 4% and immediate vesting
  • Competitive phantom equity
  • Company issued laptop
  • Dental and Vision insurance
  • Term Disability Insurance
  • Term Life Insurance
  • Flexible Spending Account
  • Equipment & Office Stipend
  • Annual stipend for Learning and Development
  • Unlimited Paid Time Off, following a 90-day probationary period
  • 10 Paid Holidays

Base Salary Range: The expected base salary range for this position is $140,000 - $215,000 per year, commensurate with experience and qualifications.

Additional Compensation Components: In addition to the base salary, the compensation package may include bonuses, commissions, equity, and other incentives. The specific components will vary depending on the role and individual and/or company performance.

NOTE: We are unable to provide sponsorship for this position.  

NOTE: We’re unable to provide visa sponsorship now or at any time in the future.

At Caylent, we are committed to fair, transparent, and inclusive hiring practices. As part of our recruitment process, we may use artificial intelligence (AI) tools or automated systems to assist with the screening and evaluation of applications to help match candidate qualifications with job requirements.
These tools are designed to support — not replace — human decision-making. Final hiring decisions are always made by our trained recruitment professionals.
If an AI or automated tool is used during your application process, it will only be in accordance with applicable laws and regulations, and your information will be handled in a secure and confidential manner.
If you have any questions, please contact [email protected] 

Caylent is a place where everyone belongs. We celebrate diversity and are committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at Caylent.  

We are proud to be an equal opportunity employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at [email protected].
HQ

Caylent Irvine, California, USA Office

4521 Campus Dr, Suite 344, Irvine, CA, United States, 92612

Similar Jobs

8 Days Ago
Remote or Hybrid
149K-255K Annually
Senior level
149K-255K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead and grow an AI/ML engineering team to design, build, and deploy production-grade ML/AI solutions (NLP, NLU, deep learning, CV, ASR). Oversee architecture, MLOps, experiments, model monitoring/versioning, cross-functional collaboration, and ethical AI practices to translate research into scalable business solutions.
Top Skills: Automatic Speech RecognitionAzureComputer VisionDatabricksDeep LearningHadoopKafkaNlpNluPythonPyTorchScikit-LearnSparkTensorFlow
13 Days Ago
In-Office or Remote
149K-255K Annually
Senior level
149K-255K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead and grow an AI/ML engineering team delivering agentic, reusable AI services for healthcare payment integrity and clinical operations. Drive platform design, cloud-based deployment, CI/CD/DevSecOps, production observability, ethical AI practices, cross-functional partnership, and repeatable customer deployments.
Top Skills: Ai AgentsAWSAzureCi/CdClaudeDevsecopsDockerGCPGeminiGitJenkinsKubernetesLangchainLarge Language ModelsModel MonitoringOpenaiPrompt EngineeringPythonRetrieval-Augmented GenerationVector Databases
13 Days Ago
In-Office or Remote
149K-255K Annually
Senior level
149K-255K Annually
Senior level
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
Lead an AI platform team to design, build, and deploy agentic, reusable AI services for healthcare payment integrity and clinical workflows. Provide technical and strategic leadership, drive deployment automation, CI/CD, production observability, ethical AI practices, and cross-functional partnership to ensure secure, scalable delivery and measurable business outcomes.
Top Skills: Ai AgentsAWSAzureCi/CdClaudeDevsecopsDockerGCPGeminiGitJenkinsKubernetesLangchainLlmsModel Fine-TuningOpenaiPrompt EngineeringPythonRagVector Databases

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account