Prime Intellect Logo

Prime Intellect

Research Engineer - Distributed Training

Job Posted 10 Days Ago Reposted 10 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
Mid level
In-Office or Remote
2 Locations
Mid level
The Research Engineer will lead research on decentralized AI training, optimize performance, contribute to open-source libraries, and communicate technical outcomes to a broad audience.
The summary above was generated by AI

At Prime Intellect, we are on a mission to accelerate open and decentralized AI progress by enabling anyone to contribute compute, code or capital to train powerful, open models. Our ultimate goal? Openly accessible AGI that benefits everyone. But we can't do it alone and we want to do this together with you.

We are building the infrastructure for decentralized AI development at scale. We aggregate global compute and enable researchers to collaboratively train state-of-the-art models through distributed training across clusters.

As a Research Engineer working on Distributed Training, you'll play a crucial role in shaping our technological direction, focusing on our decentralizing AI training stack. If you love scaling things and maximizing training efficiency, this role is for you.

Responsibilities
  • Lead and participate in novel research to build a massive scale, highly reliable and secure decentralized training orchestration solution

  • Optimize the performance, cost, and resource utilization of AI workloads by leveraging the most recent advances for compute & memory optimization techniques.

  • Contribute to the development of our open-source libraries and frameworks for distributed model training.

  • Publish research in top-tier AI conferences such as ICML & NeurIPS.

  • Distill highly technical project outcomes in layman approachable technical blogs to our customers and developers.

  • Stay up-to-date with the latest advancements in AI/ML infrastructure and tools, decentralized training research and proactively identify opportunities to enhance our platform's capabilities and user experience.

Requirements
  • Strong background in AI/ML engineering, with extensive experience in designing and implementing end-to-end pipelines for training and deploying large-scale AI models.

  • Deep expertise in distributed training techniques, frameworks (e.g., PyTorch Distributed, DeepSpeed, MosaicML’s LLM Foundry), and tools (e.g. Ray) for optimizing the performance and scalability of AI workloads.

  • Experience in large-scale model training incl. distributed training techniques such as data, tensor & pipeline parallelism

  • Solid understanding of MLOps best practices, including model versioning, experiment tracking, and continuous integration/deployment (CI/CD) pipelines.

  • Passion for advancing the state-of-the-art in decentralized AI model training and democratizing access to AI capabilities for researchers, developers, and businesses worldwide.

  • If you're not familiar with these, but feel like that you can contribute to our mission and you're a high-energy person, get familiar with these resources (here, here and here) and please reach out!

Benefits & Perks
  • Competitive compensation, including equity and token incentives, aligning your success with the growth and impact of Prime Intellect.

  • Flexible work arrangements, with the option to work remotely or in-person at our offices in San Francisco.

  • Visa sponsorship and relocation assistance for international candidates.

  • Quarterly team off-sites, hackathons, conferences and learning opportunities.

  • Opportunity to work with a talented, hard-working and mission-driven team, united by a shared passion for leveraging technology to accelerate science and AI.

We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others.

If you're excited about the opportunity to build the foundation for the future of decentralized AI and create a platform that empowers developers and researchers to push the boundaries of what's possible, we'd love to hear from you.

Top Skills

AI
Ci/Cd
Deepspeed
Ml
Mosaicml
Pytorch Distributed
Ray

Similar Jobs

4 Hours Ago
Remote or Hybrid
US
105K-148K Annually
Senior level
105K-148K Annually
Senior level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The Sr Engineer-Mainframe DB2 is responsible for installing, maintaining, and supporting DB2 software, facilitating project planning, and providing technical assistance in disaster recovery and business continuity.
Top Skills: BmcComputer AssociatesCompuwareDb2Ibm MainframesQmfZ/Os
4 Hours Ago
Remote or Hybrid
WI, USA
174K-174K
Expert/Leader
174K-174K
Expert/Leader
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The Enterprise Architect collaborates with clients to develop infrastructure solutions, leveraging deep technical expertise to strengthen relationships and drive business outcomes.
Top Skills: AWSAzureCloudConverged InfrastructureDataDigitalManaged SolutionsNetworkingSecurityVmc On Aws
4 Hours Ago
Remote or Hybrid
US
99K-147K Annually
Senior level
99K-147K Annually
Senior level
Artificial Intelligence • eCommerce • Information Technology • Internet of Things • Automation
The Sr Software Engineer I - AI designs and implements AI solutions, managing projects and technical aspects while driving innovation and compliance.
Top Skills: AIDatabase Schema DesignMicrosoft AiObject-Oriented Analysis

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

  • Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
  • Key Industries: Artificial intelligence, adtech, media, software, game development
  • Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
  • Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account