LlamaIndex
To automate knowledge work, starting with complex document workflows. LlamaIndex is a data framework and agent development platform that connects large language models with external data sources to create retrieval-augmented generation (RAG) applications and AI workflows.
Founding Story
Started in late 2022 as an open-source project called GPT Index to overcome context size limitations of large language models by enabling them to access and feed on larger, private databases of knowledge through Retrieval-Augmented Generation (RAG). The company was officially incorporated in April 2023 by former Uber research scientists Jerry Liu and Simon Suo.
Discussions
No discussions yet
Be the first to start a discussion about LlamaIndex
Leadership
Founders
Jerry Liu
Previously Machine Learning Engineering Manager at Robust Intelligence, Research Scientist and AI Resident at Uber, Machine Learning Engineer at Quora, and software engineering internships at Two Sigma, Quora, and Apple. Co-president of The Princeton Entrepreneurship Club and Co-Director of HackPrinceton at Princeton University.
Simon Suo
Previously Senior Research Scientist at Waabi, Research Scientist at Uber Advanced Technologies Group, and software engineering/research internships at Facebook, Citadel LLC, LinkedIn, and Bloomberg LP. Undergraduate Research Assistant at the University of Waterloo. Holds BCS '18 from University of Waterloo.
Executive Team
Jerry Liu
Co-Founder and CEO
Previously Machine Learning Engineering Manager at Robust Intelligence, Research Scientist at Uber, and Machine Learning Engineer at Quora. Princeton University alumnus.
Simon Suo
Co-Founder and CTO
Previously Senior Research Scientist at Waabi and Research Scientist at Uber Advanced Technologies Group. Holds BCS '18 from University of Waterloo. Named Forbes 30 Under 30.
Business Model
Revenue Model
Subscription-based SaaS and pay-as-you-go credit system for LlamaCloud, commercial enterprise licenses, and self-serve APIs. Credits are billed per page processed or minute of audio, varying by parsing mode and model. Commercial offering built on top of open-source project.
Pricing Tiers
10K credits (~1,000 pages), 1 user, 1 project, 5 indexes, 50 files, 0 data sources, 1 data sink, 2 extraction agents, basic support
40K credits plus pay-as-you-go up to 400K credits, 5 users, 1 project, 50 indexes, 250 files, 50 data sources, 5 data sinks, 5 extraction agents, basic support. 1,000 credits cost $1.25
400K credits plus pay-as-you-go up to 4,000K credits, 10 users, 5 projects, 100 indexes, 1,250 files, 100 data sources, 25 data sinks, 15 extraction agents, Slack support
Custom credits, unlimited users/projects/indexes/files/data sources, volume discounts, 5x higher rate limits, Enterprise SSO, SaaS or Hybrid cloud/VPC deployment, dedicated account manager
Target Markets
- Enterprise businesses
- Fortune 500 companies
- Healthcare and medical providers
- Financial services and investment firms
- Legal industry
- Biopharma and life sciences
- Document research and information extraction
- Automating knowledge workflows
- Synthesizing insights and generating reports
- Building RAG applications
- Medical record and underwriting automation
- Enterprise document parsing
- Salesforce
- Rakuten
- The Carlyle Group
- KPMG
History & Milestones
Launched LlamaParse v2 API with new LlamaCloud SDKs for Python and TypeScript
Revamped n8n integration with stable nodes for LlamaParse, LlamaExtract, LlamaCloud Index, LlamaClassify, and LlamaSheets
Introduced Agentic Document Workflows
Secured $19M Series A funding led by Norwest Venture Partners; reached 3M monthly downloads, 38K GitHub stars, 230K LinkedIn followers; announced LlamaCloud General Availability
LlamaCloud reached one-year anniversary; community reached 300K LlamaCloud users, 25M monthly package downloads, 1,500 contributors, 20K community members
2 AI Tools by LlamaIndex
SemTools
23dA Python library by LlamaIndex for building semantic tools and structured data extraction pipelines using LLMs.

LlamaIndex
2moEnterprise document processing and AI agent framework for building GenAI applications with parsing, extraction, indexing, and retrieval capabilities.
