LlamaIndex
To automate knowledge work, starting with complex document workflows. LlamaIndex is a data framework and agent development platform that connects large language models with external data sources to create retrieval-augmented generation (RAG) applications and AI workflows.
At a Glance
- Enterprise businesses
- Fortune 500 companies
- Healthcare and medical providers
- Financial services and investment firms
- +8 more
AI Tools by LlamaIndex
(2)SemTools
Semantic Data Extraction Library
LlamaIndex
Enterprise Document Parsing Framework
Discussions
No discussions yet
Be the first to start a discussion about LlamaIndex
Latest News
LlamaIndex Secures $19 Million Series A and Launches LlamaCloud General Availability
LlamaParse v2 API Launch with New LlamaCloud SDKs
Revamped n8n Integration with Stable Nodes for LlamaParse, LlamaExtract, LlamaCloud Index, LlamaClassify, and LlamaSheets
LlamaIndex Achieves SOC 2 Type II, GDPR, and HIPAA Compliance Certifications
Products & Services
A simple, flexible open-source data framework available in Python and TypeScript for building knowledge assistants and RAG applications using LLMs connected to enterprise data. Includes data connectors, indexing, query engines, and advanced retrieval capabilities.
Turn-key commercial knowledge management platform for agentic knowledge management over unstructured data. Features automated parsing, ingestion, indexing, and retrieval with SaaS and on-premise/VPC deployment options. Includes enterprise-grade security with RBAC, SSO, SOC 2 Type II, GDPR, and HIPAA compliance.
Self-serve API for transforming complex unstructured documents (PDFs, Word, PowerPoint, images, charts) into RAG-ready structured data. Features layout-aware parsing with LLMs/VLMs, handles tables, charts, and visual elements. Premium mode adds diagram-to-Mermaid and equation-to-LaTeX conversion.
Collection of data loaders and connectors for various platforms including Notion, Slack, Google Drive, Salesforce, MongoDB, S3, Azure Blob Storage, Microsoft OneDrive, SharePoint, Box, and Confluence.
Market Position
Leading open-source framework for RAG and agentic AI with industry-leading document parsing capabilities. Positioned as a unified, end-to-end enterprise-grade solution for complex unstructured data, addressing accuracy and scaling issues in fragmented toolsets. Competes with LangChain (more suited for prototyping), CrewAI (multi-agent automation), Cohere (enterprise foundation models), Contextual AI (RAG), Dify (generative AI development), and Haystack. LlamaIndex's event-driven workflows provide advantages over rigid graph-based frameworks, and LlamaParse excels at extracting insights from visual-heavy documents where standard Python parsers fail.
Leadership
Founders
Jerry Liu
Previously Machine Learning Engineering Manager at Robust Intelligence, Research Scientist and AI Resident at Uber, Machine Learning Engineer at Quora, and software engineering internships at Two Sigma, Quora, and Apple. Co-president of The Princeton Entrepreneurship Club and Co-Director of HackPrinceton at Princeton University.
Simon Suo
Previously Senior Research Scientist at Waabi, Research Scientist at Uber Advanced Technologies Group, and software engineering/research internships at Facebook, Citadel LLC, LinkedIn, and Bloomberg LP. Undergraduate Research Assistant at the University of Waterloo. Holds BCS '18 from University of Waterloo.
Executive Team
Jerry Liu
Co-Founder and CEO
Previously Machine Learning Engineering Manager at Robust Intelligence, Research Scientist at Uber, and Machine Learning Engineer at Quora. Princeton University alumnus.
Simon Suo
Co-Founder and CTO
Previously Senior Research Scientist at Waabi and Research Scientist at Uber Advanced Technologies Group. Holds BCS '18 from University of Waterloo. Named Forbes 30 Under 30.
Board of Directors
Founding Story
Started in late 2022 as an open-source project called GPT Index to overcome context size limitations of large language models by enabling them to access and feed on larger, private databases of knowledge through Retrieval-Augmented Generation (RAG). The company was officially incorporated in April 2023 by former Uber research scientists Jerry Liu and Simon Suo.
Business Model
Revenue Model
Subscription-based SaaS and pay-as-you-go credit system for LlamaCloud, commercial enterprise licenses, and self-serve APIs. Credits are billed per page processed or minute of audio, varying by parsing mode and model. Commercial offering built on top of open-source project.
Pricing Tiers
10K credits (~1,000 pages), 1 user, 1 project, 5 indexes, 50 files, 0 data sources, 1 data sink, 2 extraction agents, basic support
40K credits plus pay-as-you-go up to 400K credits, 5 users, 1 project, 50 indexes, 250 files, 50 data sources, 5 data sinks, 5 extraction agents, basic support. 1,000 credits cost $1.25
400K credits plus pay-as-you-go up to 4,000K credits, 10 users, 5 projects, 100 indexes, 1,250 files, 100 data sources, 25 data sinks, 15 extraction agents, Slack support
Custom credits, unlimited users/projects/indexes/files/data sources, volume discounts, 5x higher rate limits, Enterprise SSO, SaaS or Hybrid cloud/VPC deployment, dedicated account manager
Target Markets
- Enterprise businesses
- Fortune 500 companies
- Healthcare and medical providers
- Financial services and investment firms
- Legal industry
- Biopharma and life sciences
- Document research and information extraction
- Automating knowledge workflows
- Synthesizing insights and generating reports
- Building RAG applications
- Medical record and underwriting automation
- Enterprise document parsing
- Salesforce
- Rakuten
- The Carlyle Group
- KPMG