BJAK Logo

BJAK

Founding AI/ML Research Engineer

Posted 18 Days Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
The role involves fine-tuning machine learning models, designing datasets and pipelines, and ensuring product quality and safety for a new AI application.
The summary above was generated by AI
Transform language models into real-world, high-impact product experiences.

A1 is a self-funded AI group, operating in full stealth. We’re building a new global consumer AI application focused on an important but underexplored use case.

You will shape the core technical direction of A1 - model selection, training strategy, infrastructure, and long-term architecture. This is a founding technical role: your decisions will define our model stack, our data strategy, and our product capabilities for years ahead.

You won’t just fine-tune models - you’ll design systems: training pipelines, evaluation frameworks, inference stacks, and scalable deployment architectures. You will have full autonomy to experiment with frontier models (LLaMA, Mistral, Qwen, Claude-compatible architectures) and build new approaches where existing ones fall short.

Why This Role Matters
  • You are creating the intelligence layer of A1’s first product, defining how it understands, reasons, and interacts with users.

  • Your decisions shape our entire technical foundation — model architectures, training pipelines, inference systems, and long-term scalability.

  • You will push beyond typical chatbot use cases, working on a problem space that requires original thinking, experimentation, and contrarian insight.

  • You influence not just how the product works, but what it becomes, helping steer the direction of our earliest use cases.

  • You are joining as a founding builder, setting engineering standards, contributing to culture, and helping create one of the most meaningful AI applications of this wave.

What You’ll Do
  • Build end-to-end training pipelines: data → training → eval → inference

  • Design new model architectures or adapt open-source frontier models

  • Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation)

  • Architect scalable inference systems using vLLM / TensorRT-LLM / DeepSpeed

  • Build data systems for high-quality synthetic and real-world training data

  • Develop alignment, safety, and guardrail strategies

  • Design evaluation frameworks across performance, robustness, safety, and bias

  • Own deployment: GPU optimization, latency reduction, scaling policies

  • Shape early product direction, experiment with new use cases, and build AI-powered experiences from zero

  • Explore frontier techniques: retrieval-augmented training, mixture-of-experts, distillation, multi-agent orchestration, multimodal models

What It’s Like to Work Here
  • You take ownership - you solve problems end-to-end rather than wait for perfect instructions

  • You learn through action - prototype → test → iterate → ship

  • You’re calm in ambiguity - zero-to-one building energises you

  • You bias toward speed with discipline - V1 now > perfect later

  • You see failures and feedback as essential to growth

  • You work with humility, curiosity, and a founder’s mindset

  • You lift the bar for yourself and your teammates every day

Requirements
  • Strong background in deep learning and transformer architectures

  • Hands-on experience training or fine-tuning large models (LLMs or vision models)

  • Proficiency with PyTorch, JAX, or TensorFlow

  • Experience with distributed training frameworks (DeepSpeed, FSDP, Megatron, ZeRO, Ray)

  • Strong software engineering skills — writing robust, production-grade systems

  • Experience with GPU optimization: memory efficiency, quantization, mixed precision

  • Comfortable owning ambiguous, zero-to-one technical problems end-to-end

Nice to Have
  • Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer)

  • Contributions to open-source ML libraries

  • Background in scientific computing, compilers, or GPU kernels

  • Experience with RLHF pipelines (PPO, DPO, ORPO)

  • Experience training or deploying multimodal or diffusion models

  • Experience in large-scale data processing (Apache Arrow, Spark, Ray)

  • Prior work in a research lab (Google Brain, DeepMind, FAIR, Anthropic, OpenAI)

What You’ll Get
  • Extreme ownership and autonomy from day one - you define and build key model systems.

  • Founding-level influence over technical direction, model architecture, and product strategy.

  • Remote-first flexibility

  • High-impact scope—your work becomes core infrastructure of a global consumer AI product.

  • Competitive compensation and performance-based bonuses

  • Backing of a profitable US$2B group, with the speed of a startup

  • Insurance coverage, flexible time off, and global travel insurance

  • Opportunity to shape a new global AI product from zero

  • A small, senior, high-performance team where you collaborate directly with founders and influence every major decision.

Our Team & Culture

We operate as a dense, senior, high-performance team. We value clarity, speed, craftsmanship, and relentless ownership. We behave like founders — we build, ship, iterate, and hold ourselves to a high technical bar.

If you value excellence, enjoy building real systems, and want to be part of a small team creating something globally impactful, you’ll thrive here.

About A1

A1 is a self-funded, independent AI group backed by BJAK, focused on building a new consumer AI product with global impact. We’re assembling a small, elite team of ML and engineering builders who want to work on meaningful, high-impact problems.

Top Skills

Deep Learning
PyTorch
TensorFlow
Transformers

Similar Jobs

3 Hours Ago
Remote or Hybrid
United States
119K-222K Annually
Senior level
119K-222K Annually
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Design and build a machine learning platform, deploy ML models, collaborate across teams, and establish monitoring standards for AI solutions.
Top Skills: AIAmazon BedrockAmazon SagemakerAWSDockerFeastMicroservicesMlRestful Apis
3 Hours Ago
Remote or Hybrid
United States
88K-163K Annually
Mid level
88K-163K Annually
Mid level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
Develop and maintain identity security training programs, collaborating cross-functionally to enhance training effectiveness and learner success while incorporating customer feedback and market trends.
Top Skills: Agile MethodologiesArticulate 360AsanaConfluenceDemo And Simulation ToolsIdentity SecurityLearning Management SystemsSaaSSlackTeamsVideo Production
3 Hours Ago
Remote or Hybrid
United States
115K-213K Annually
Mid level
115K-213K Annually
Mid level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
The Advisory Solutions Consultant will support sales teams by understanding customer needs, providing product demonstrations, and participating in the sales process, focusing on Identity Security solutions.
Top Skills: AWSAzureGCPJavaJSONLdapSQLXML

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account