LlamaIndex

Multimodal AI Engineer, Document Understanding

Posted 20 Hours Ago

In-Office or Remote

2 Locations

180K-250K Annually

Mid level

In-Office or Remote

2 Locations

180K-250K Annually

Mid level

Develop and optimize machine learning models for document understanding, handle production ML systems, and integrate innovations into APIs.

The summary above was generated by AI

Join us and help shape the future of AI by redefining document workflows with AI agents.

About the Role:

We are seeking exceptional AI engineers to join our core document understanding team. You will work at the intersection of computer vision, natural language processing, and production ML systems to push the boundaries of what's possible in document parsing and understanding.

Our document understanding team builds the intelligence behind LlamaParse, LlamaExtract, and our other processing products. These systems are processing millions of complex documents including PDFs, PowerPoints, Word documents, and spreadsheets. Your work will directly impact thousands of developers building RAG applications and document agents, while also contributing to our open-source frameworks that shape how the industry approaches document processing.

Depending on your background and interests, you might focus more on data curation and evaluation, model fine-tuning and experimentation, or ML infrastructure and production systems. We're hiring multiple people and will work with you to find the best fit.

Responsibilities:

Develop, train, and optimize machine learning models for document structure understanding, table extraction, layout analysis, and multimodal content processing
Build robust data pipelines, evaluation frameworks, and experimentation infrastructure
Design and implement production ML systems that handle complex, real-world documents at scale
Stay current with latest advances in vision-language models, document AI, and multimodal learning
Collaborate with engineering teams to integrate ML innovations into production APIs
Contribute to both our open-source frameworks and enterprise offerings
Drive technical decisions while balancing research exploration with product delivery

Required Qualifications:

3-7 years of experience in machine learning engineering or applied research
Strong software engineering fundamentals with production Python experience (modern tooling: uv, ruff, mypy, Pydantic)
Hands-on experience training, fine-tuning, or deploying ML models in production
Deep understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning
Experience with at least one of: data pipeline development, model training/fine-tuning, or ML infrastructure
Ability to read and implement from research papers and technical specifications
Track record of executing with high intensity in fast-paced environments
Strong technical communication skills and comfort with open-source collaboration

Preferred Qualifications:

Experience with vision-language models, transformer architectures, or model fine-tuning (LoRA, QLoRA)
Experience building evaluation frameworks, benchmarks, or data quality pipelines
Experience with model serving frameworks (vLLM, TensorRT, ONNX) or MLOps tools
Experience specifically with document understanding, OCR, or layout analysis
Contributions to open-source ML projects or frameworks
Experience with LLM applications and RAG systems
Strong understanding of model optimization techniques (quantization, distillation, pruning)
Experience with Docker/Kubernetes and distributed systems
Active participation in ML research community

Location:

We offer a hybrid-friendly culture based out of our downtown San Francisco office. Remote candidates will be considered for exceptional fits.

Why Join Us?

Impactful Mission: Work on innovative AI products that redefine how knowledge is accessed and utilized. Your models will process millions of documents and directly impact thousands of developers.
Cutting-Edge Technology: Work with the latest vision-language models, contribute to open-source frameworks used industry-wide, and shape the future of document AI.
Collaborative Team: Join a focused team of passionate engineers and researchers committed to pushing the boundaries of what's possible in document understanding.
Technical Autonomy: Significant creative freedom to explore new approaches while maintaining focus on delivering high-quality, production-ready solutions.
Growth Opportunities: Be at the forefront of the AI revolution, with ample opportunities to grow alongside our scaling organization. Shape your role based on your interests and strengths.

Additional Benefits:

Competitive base salary and equity compensation
Comprehensive medical/dental/vision coverage for you and your family
Unlimited paid time off policy
Daily catered lunch and snacks in the San Francisco office
Budget for conferences, research materials, and professional development
Access to cutting-edge compute resources and research tools

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

LlamaIndex does not accept unsolicited agency resumes. Please do not forward resumes to our jobs alias, employees, or any other organization location. LlamaIndex is not responsible for any fees related to unsolicited resumes.

Top Skills

Docker

Kubernetes

Mypy

Onnx

Pydantic

Python

Ruff

Tensorrt

Similar Jobs

ActBlue

Staff Engineer

31 Seconds Ago

Easy Apply

Remote

USA

Easy Apply

192K-241K Annually

Senior level

192K-241K Annually

Senior level

Fintech • Social Impact • Software

The Staff Engineer will lead technical initiatives, architect full-stack solutions, mentor team members, and collaborate on strategic platform evolution.

Top Skills: PostgresReactRuby On RailsTypescript

ActBlue

Senior Software Engineer

34 Seconds Ago

Easy Apply

Remote

USA

Easy Apply

158K-183K Annually

Senior level

158K-183K Annually

Senior level

Fintech • Social Impact • Software

As a Senior Software Engineer, you'll build and improve platform capabilities using Ruby on Rails, React, TypeScript, and PostgreSQL, leading projects, mentoring others, and collaborating with cross-functional teams to design user-facing features and maintain code quality.

Top Skills: PostgresReactRuby On RailsTypescript

Flywire

Senior Software Engineer

2 Minutes Ago

Remote or Hybrid

Chicago, IL, USA

155K-175K Annually

Senior level

155K-175K Annually

Senior level

Fintech • Payments • Software

As a Senior Software Engineer II, you will develop and maintain a payments platform, ensuring high code quality, scalability, and collaboration with cross-functional teams while mentoring others.

Top Skills: AWSCSS3DockerHibernateHTML5JavaJavaScriptOpen TelemetryPostgresReactSpring Boot

What you need to know about the Los Angeles Tech Scene

Los Angeles is a global leader in entertainment, so it’s no surprise that many of the biggest players in streaming, digital media and game development call the city home. But the city boasts plenty of non-entertainment innovation as well, with tech companies spanning verticals like AI, fintech, e-commerce and biotech. With major universities like Caltech, UCLA, USC and the nearby UC Irvine, the city has a steady supply of top-flight tech and engineering talent — not counting the graduates flocking to Los Angeles from across the world to enjoy its beaches, culture and year-round temperate climate.

Key Facts About Los Angeles Tech

Number of Tech Workers: 375,800; 5.5% of overall workforce (2024 CompTIA survey)
Major Tech Employers: Snap, Netflix, SpaceX, Disney, Google
Key Industries: Artificial intelligence, adtech, media, software, game development
Funding Landscape: $11.6 billion in venture capital funding in 2024 (Pitchbook)
Notable Investors: Strong Ventures, Fifth Wall, Upfront Ventures, Mucker Capital, Kittyhawk Ventures
Research Centers and Universities: California Institute of Technology, UCLA, University of Southern California, UC Irvine, Pepperdine, California Institute for Immunology and Immunotherapy, Center for Quantum Science and Engineering