Vespa.ai

Name: Vespa.ai
Price: 20000.00 USD
Availability: OnlineOnly
Author: Vespa.ai AS

An AI Search Platform for building large-scale applications combining vector search, text search, machine-learned ranking, and real-time inference at enterprise scale.

Visit Website

At a Glance

Pricing

Open Source

Get started with Vespa.ai at no cost

Startup: Custom/contact

Basic: Custom/contact

Commercial: Custom/contact

+2 more plans

Engagement

Available On

Web

API

CLI

SDK

Vespa.ai ASTrondheim, NorwayEst. 2023$31M raised

Listed Mar 2026

About Vespa.ai

Vespa.ai is an AI Search Platform designed for developing and operating large-scale applications that combine big data, vector search, machine-learned ranking, and real-time inference. It provides native tensor support for complex ranking and decisioning, enabling real-time AI applications like RAG, recommendation, and intelligent search at enterprise scale. Vespa supports querying, organizing, and making inferences across vectors, tensors, text, and structured data, scaling to billions of constantly changing data items with thousands of queries per second at sub-100ms latencies. The platform is open source at its core and also available as a fully managed cloud service.

Vector & Text Search — Combines leading open text search with a capable vector database, enabling hybrid search applications with superior relevance.
Generative AI (RAG) — Supports hybrid search, relevance models, and multi-vector representations for high-quality retrieval-augmented generation pipelines.
Recommendation & Personalization — Combines retrieval of eligible content with machine-learned model evaluation for recommendation, personalization, and ad targeting at any scale.
Semi-Structured Navigation — Handles e-commerce and similar use cases that blend structured data, text, and images with seamless search and navigation.
Personal/Private Search — Streaming search mode delivers full Vespa capabilities for personal data use cases at up to 20x lower cost than indexed search.
Distributed Machine-Learned Ranking — Integrates distributed ML model inference directly into the serving layer for relevance ranking without external round-trips.
Infinite Automated Scalability — Auto-scales to handle billions of documents and thousands of queries per second with continuous deployment and upgrades.
Vespa Cloud — Fully managed cloud offering with strong security, operational monitoring, and support tiers for production deployments.
Open Source Core — The Vespa engine is open source on GitHub, allowing self-hosting and community contributions alongside the managed cloud option.

Community Discussions

Be the first to start a conversation about Vespa.ai

Share your experience with Vespa.ai, ask questions, or help others learn from your insights.

Pricing

Startup

For testing and getting started. Managed operations with restrictions including shared resources, no SSO, no autoscaling, and dev zones only.

Custom

contact sales

vCPU at $0.05/hour
Memory GB at $0.005/hour
Disk GB at $0.0002/hour
GPU Memory GB at $0.03/hour
Community support only, no SLA
Runs on shared resources
No redundancy by default
No CI/CD pipeline
Dev zones only

Basic

Cloud plan suitable for applications that don't need 24/7 operational support.

Custom

contact sales

vCPU at $0.10/hour
Memory GB at $0.01/hour
Disk GB at $0.0004/hour
GPU Memory GB at $0.07/hour
Pro-active remediation of issues
Production support: next business day
Deployment support: next business day
Other support: next 2 business days
Prices go down with volume

Commercial

Popular

Cloud plan suitable for production applications with 24/7 support included.

Custom

contact sales

vCPU at $0.145/hour
Memory GB at $0.0145/hour
Disk GB at $0.0005/hour
GPU Memory GB at $0.10/hour
Unlimited support cases
Production support: 1 hour 24/7
Deployment support: next business day
Other support: next 2 business days
Automated ops, deployments and upgrades
Prices go down with volume

Enterprise

Cloud plan for enterprises with 24/7 deployment support, dedicated services, and minimum monthly spend of $20,000.

$20000

per month

vCPU at $0.18/hour
Memory GB at $0.018/hour
Disk GB at $0.0007/hour
GPU Memory GB at $0.125/hour
Production support: 15 minutes 24/7
Deployment support: 1 hour 24/7
Other support: next business day
Single sign-on (SSO)
Named support representative
Tune-up program participation
Dedicated Slack channel
On-site visits
Prices go down with volume

Self Managed

Self-managed Vespa deployment with dedicated support including a support representative and Slack channel.

Custom

contact sales

Self-managed Vespa deployment
Unlimited support cases
Dedicated support representative
Dedicated Slack channel
Support response time per contract

View official pricing

Capabilities

Key Features

Vector search
Text search
Hybrid search
Machine-learned ranking
Real-time inference
RAG (Retrieval-Augmented Generation)
Recommendation and personalization
Streaming search for personal data
Tensor formalism
Distributed ML model inference
Auto-scaling
Continuous deployment
Fully managed cloud (Vespa Cloud)
Semi-structured navigation
Visual retrieval

Integrations

Voyage AI

API Available

View Docs

Back to all tools Suggest an edit

About Vespa.ai

Vector & Text Search — Combines leading open text search with a capable vector database, enabling hybrid search applications with superior relevance.
Generative AI (RAG) — Supports hybrid search, relevance models, and multi-vector representations for high-quality retrieval-augmented generation pipelines.
Recommendation & Personalization — Combines retrieval of eligible content with machine-learned model evaluation for recommendation, personalization, and ad targeting at any scale.
Semi-Structured Navigation — Handles e-commerce and similar use cases that blend structured data, text, and images with seamless search and navigation.
Personal/Private Search — Streaming search mode delivers full Vespa capabilities for personal data use cases at up to 20x lower cost than indexed search.
Distributed Machine-Learned Ranking — Integrates distributed ML model inference directly into the serving layer for relevance ranking without external round-trips.
Infinite Automated Scalability — Auto-scales to handle billions of documents and thousands of queries per second with continuous deployment and upgrades.
Vespa Cloud — Fully managed cloud offering with strong security, operational monitoring, and support tiers for production deployments.
Open Source Core — The Vespa engine is open source on GitHub, allowing self-hosting and community contributions alongside the managed cloud option.

Vespa.ai