Zvec
An open-source, in-process vector database from Alibaba Group that delivers millisecond semantic search at billion-vector scale with a simple Python API and no external services required.
At a Glance
About Zvec
Zvec is an open-source, in-process vector database developed by Alibaba Group and released under the Apache 2.0 license. It is designed to embed directly into applications — running as a library rather than a separate server — and the project states it has been battle-tested within Alibaba Group for production-grade, low-latency similarity search. The repository reached v0.5.0 in June 2026 and has accumulated over 11,800 GitHub stars.
What It Is
Zvec is a lightweight vector database that runs in-process alongside application code, eliminating the need for external services, network round-trips, or complex configuration. It supports dense and sparse vector embeddings, full-text search, hybrid retrieval, and structured scalar filters — all queryable in a single call. The core engine is written in C++ and exposed through official SDKs for Python, Node.js, Go, Rust, and Dart/Flutter. Its primary use cases include RAG (Retrieval-Augmented Generation) pipelines, image search, and code search.
In-Process Architecture
Unlike client-server vector databases, Zvec runs directly inside the host process. This means:
- No external daemon or container to manage
- No network latency between the application and the index
- Works in notebooks, CLI tools, servers, and edge devices
- Multiple processes can read the same collection simultaneously; writes are single-process exclusive
- Write-ahead logging (WAL) guarantees persistence even on process crash or power failure
The DiskANN index type (added in v0.5.0) keeps the bulk of the index on disk, reducing memory pressure for large-scale datasets.
Performance Benchmarks
The project publishes benchmark results on the Cohere 10M vector dataset. According to the Zvec documentation, the database indexes 10 million vectors in approximately one hour and sustains over 8,500 queries per second (QPS). The homepage describes this as "millisecond search at billion-vector scale," though the published benchmark covers the 10M dataset specifically.
SDK and Platform Coverage
Zvec ships official SDKs across five languages:
- Python (
pip install zvec, Python 3.10–3.14) - Node.js (
npm install @zvec/zvec) - Go (via the
zvec-gorepository) - Rust (via the
zvec-rustrepository) - Dart/Flutter (
flutter pub add zvec)
Supported platforms include Linux (x86_64, ARM64), macOS (ARM64), Windows (x86_64), and RISC-V (added in v0.5.0). A companion visual tool, Zvec Studio, allows browsing data and debugging queries without writing code.
Update: v0.5.0 (June 2026)
The v0.5.0 release, published June 12, 2026, introduced several significant capabilities:
- Full-Text Search (FTS): Native keyword-based full-text search attached to any string field, with no external search engine required
- Hybrid Retrieval: A single
MultiQuerycan combine dense vectors, sparse vectors, scalar filters, and full-text search - DiskANN Index: On-disk index type that drastically reduces memory usage for large datasets
- New SDKs: Official Go and Rust bindings, plus RISC-V platform support
- Zvec Studio: A visual GUI tool for data browsing and query debugging
The project roadmap is tracked publicly on GitHub issues, and the repository shows active CI/CD pipelines and ongoing community contributions.
Community Discussions
Be the first to start a conversation about Zvec
Share your experience with Zvec, ask questions, or help others learn from your insights.
Pricing
Open Source
Fully free and open-source under Apache 2.0. All features included.
- In-process vector database
- Dense and sparse vector support
- Full-text search
- Hybrid retrieval
- DiskANN on-disk index
Capabilities
Key Features
- In-process vector database (no external server required)
- Dense and sparse vector support
- Multi-vector queries
- Full-text search (FTS) on string fields
- Hybrid retrieval combining vector, full-text, and scalar filters in a single query
- Filtered vector search
- Grouped search (GROUP BY-style)
- DiskANN on-disk index for large-scale datasets
- HNSW index support
- Write-ahead logging (WAL) for durability
- Concurrent multi-process read access
- Python, Node.js, Go, Rust, and Dart/Flutter SDKs
- Zvec Studio visual GUI tool
- RISC-V platform support
- Benchmarked at 8500+ QPS on 10M vectors
