Fabrion Logo

Fabrion

ML/AI Research Engineer — Agentic AI Lab (Founding Team)

Posted 3 Days Ago
In-Office or Remote
Hiring Remotely in CA
Senior level
In-Office or Remote
Hiring Remotely in CA
Senior level
Design, train, evaluate, and optimize agent-native LLMs and RAG pipelines for enterprise use. Build training and RL pipelines (RLHF/DPO/PPO), embedding-based memory, evaluation harnesses, observability, and inference optimization across cloud and on-prem environments.
The summary above was generated by AI
ML/AI Research Engineer — Agentic AI Lab (Founding Team)

Location: San Francisco Bay Area
Type: Full-Time
Compensation: Competitive salary + meaningful equity (founding tier)

Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems.

About the Role

We’re designing the future of enterprise AI infrastructure — grounded in agents, retrieval-augmented generation (RAG), knowledge graphs, and multi-tenant governance.

We’re looking for an ML/AI Research Engineer to join our AI Lab and lead the design, training, evaluation, and optimization of agent-native AI models. You'll work at the intersection of LLMs, vector search, graph reasoning, and reinforcement learning — building the intelligence layer that sits on top of our enterprise data fabric.

This isn’t a prompt engineer role. It’s full-cycle ML: from data curation and fine-tuning to evaluation, interpretability, and deployment — with cost-awareness, alignment, and agent coordination all in scope.

Core Responsibilities

  • Fine-tune and evaluate open-source LLMs (e.g. LLaMA 3, Mistral, Falcon, Mixtral) for enterprise use cases with both structured and unstructured data

  • Build and optimize RAG pipelines using LangChain, LangGraph, LlamaIndex, or Dust — integrated with our vector DBs and internal knowledge graph

  • Train agent architectures (ReAct, AutoGPT, BabyAGI, OpenAgents) using enterprise task data

  • Develop embedding-based memory and retrieval chains with token-efficient chunking strategies

  • Create reinforcement learning pipelines to optimize agent behaviors (e.g. RLHF, DPO, PPO)

  • Establish scalable evaluation harnesses for LLM and agent performance, including synthetic evals, trace capture, and explainability tools

  • Contribute to model observability, drift detection, error classification, and alignment

  • Optimize inference latency and GPU resource utilization across cloud and on-prem environments

Desired Experience

Model Training:

  • Deep experience fine-tuning open-source LLMs using HuggingFace Transformers, DeepSpeed, vLLM, FSDP, LoRA/QLoRA

  • Worked with both base and instruction-tuned models; familiar with SFT, RLHF, DPO pipelines

  • Comfortable building and maintaining custom training datasets, filters, and eval splits

  • Understand tradeoffs in batch size, token window, optimizer, precision (FP16, bfloat16), and quantization

RAG + Knowledge Graphs:

  • Experience building enterprise-grade RAG pipelines integrated with real-time or contextual data

  • Familiar with LangChain, LangGraph, LlamaIndex, and open-source vector DBs (Weaviate, Qdrant, FAISS)

  • Experience grounding models with structured data (SQL, graph, metadata) + unstructured sources

  • Bonus: Worked with Neo4j, Puppygraph, RDF, OWL, or other semantic modeling systems

Agent Intelligence:

  • Experience training or customizing agent frameworks with multi-step reasoning and memory

  • Understand common agent loop patterns (e.g. Plan→Act→Reflect), memory recall, and tools

  • Familiar with self-correction, multi-agent communication, and agent ops logging

Optimization:

  • Strong background in token cost optimization, chunking strategies, reranking (e.g. Cohere, Jina), compression, and retrieval latency tuning

  • Experience running models under quantized (int4/int8) or multi-GPU settings with inference tuning (vLLM, TGI)

Preferred Tech Stack

  • LLM Training & Inference: HuggingFace Transformers, DeepSpeed, vLLM, FlashAttention, FSDP, LoRA

  • Agent Orchestration: LangChain, LangGraph, ReAct, OpenAgents, LlamaIndex

  • Vector DBs: Weaviate, Qdrant, FAISS, Pinecone, Chroma

  • Graph Knowledge Systems: Neo4j, Puppygraph, RDF, Gremlin, JSON-LD

  • Storage & Access: Iceberg, DuckDB, Postgres, Parquet, Delta Lake

  • Evaluation: OpenLLM Evals, Trulens, Ragas, LangSmith, Weight & Biases

  • Compute: Ray, Kubernetes, TGI, Sagemaker, LambdaLabs, Modal

  • Languages: Python (core), optionally Rust (for inference layers) or JS (for UX experimentation)

Soft Skills & Mindset

  • Startup DNA: resourceful, fast-moving, and capable of working in ambiguity

  • Deep curiosity about agent-based architectures and real-world enterprise complexity

  • Comfortable owning model performance end-to-end: from dataset to deployment

  • Strong instincts around explainability, safety, and continuous improvement

  • Enjoy pair-designing with product and UX to shape capabilities, not just APIs

Why This Role Matters

This role is foundational to our thesis: that agents + enterprise data + knowledge modeling can create intelligent infrastructure for real-world, multi-billion-dollar workflows. Your work won’t be buried in research reports — it will be productionized and activated by hundreds of users and hundreds of thousands of decisions. If this is your dream role - we would love to hear from you.

Similar Jobs

2 Hours Ago
Remote or Hybrid
93K-250K Annually
Junior
93K-250K Annually
Junior
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Lead design, deployment, and sustainment of IL6S/TPM systems to eliminate losses and improve equipment reliability. Train and coach teams, run Kaizen and DMAIC events, track KPIs (OEE, MTBF/MTTR), implement SOPs and visual management, perform loss analysis, and support preventive/predictive maintenance to drive productivity and safety targets.
Top Skills: 5WhysAutonomous MaintenanceDmaicE2E Data Collection SystemsGeIshikawaKaizenLean Six SigmaMakigamiMtbbMtbfMttrOeeParetoPdcaPredictive MaintenanceRoot Cause Analysis (Rca)SmedStandard WorkTpmValue Stream Mapping (Vsm)Visual ManagementWpi Tool
2 Hours Ago
Remote or Hybrid
CA, USA
130K-234K Annually
Senior level
130K-234K Annually
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Outbound-focused senior account executive responsible for sourcing and closing new restaurant merchant logos. Duties include prospecting, discovery, demos, consultative selling of Square ecosystem, field relationship building, partnering with BD/Product/Marketing, managing the sales cycle and onboarding, and meeting monthly sales KPIs using Salesforce.
Top Skills: SalesforceSquare
6 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Manage and grow ServiceNow partner relationships across Canada: build partner practices, set targets, drive governance, enablement, reporting, business reviews, remediation plans, and achieve joint revenue goals while coaching partners and collaborating with global teams.
Top Skills: AIServicenow

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account