Weaviate Vector Database

Weaviate is an open-source, cloud-native vector database designed for storing, searching, and managing data objects alongside their vector embeddings. It supports semantic search, hybrid search (combining vector and keyword search), generative AI workflows, and scales to billions of objects in production.

1. Overview & Key Features

What Makes Weaviate Different

Weaviate vs Other Vector Databases

2. Installation & Setup

Install the Python Client

Run Weaviate with Docker Compose

Connect from Python

3. Schema Design & Collections

Weaviate organizes data into collections (formerly called "classes"). Each collection has a name, properties (fields), and a vectorizer configuration.

Create a Collection

Supported Data Types

List and Inspect Collections

4. CRUD Operations

Insert Objects

Read Objects

Update Objects

Delete Objects

5. Vector Search (Semantic Search)

Vector search finds objects whose embeddings are closest to a query vector. Weaviate calls this near_text (auto-vectorizes your query) or near_vector (you provide the raw vector).

Near Text Search

Near Vector Search

Distance Metrics

6. Hybrid Search

Hybrid search combines BM25 keyword matching with vector similarity and fuses the results. The alpha parameter controls the balance: alpha=0 is pure keyword, alpha=1 is pure vector, and alpha=0.5 is an equal mix.

Basic Hybrid Query

Hybrid Search with Filtering

When to Use Hybrid vs Pure Vector

7. Filtering & Aggregation

Property Filters

Filter Operators

Aggregations

8. Generative Search (RAG)

Weaviate's generative module sends retrieved objects to an LLM to produce grounded answers — a built-in RAG pipeline with no external orchestration needed.

Single-Prompt Generation

Grouped Generation

RAG with Filters

9. Multi-Tenancy

Multi-tenancy isolates data per tenant within a single collection. Each tenant gets its own vector index partition, so tenants cannot see each other's data and inactive tenants can be offloaded to cold storage.

Enable Multi-Tenancy

Manage Tenants

Tenant-Scoped Operations

10. Production Deployment & Best Practices

Performance Tuning

Vector Compression Example

Replication & Sharding

Authentication & Security

Backup & Restore

Production Checklist

Complete Working Example

A self-contained example that creates a collection, inserts documents, and performs vector, hybrid, and generative searches:

Common Interview Questions:

Featured Videos

Curated talks and tutorials covering Weaviate setup, vector and hybrid search, generative RAG, multi-tenancy, and HNSW indexing — chosen to map to the sections above.

Vector databases are so hot right now — Fireship

Fireship — A 2-minute primer on what vector databases are and why they exploded with LLMs. The fastest possible setup for the rest of this page.

Weaviate — Getting Started meetup: setup, vectorizers, schema definition, importing data, and the GraphQL API in one walkthrough. Maps to sections 2–3.

How to set up Weaviate Embedded with Python

Weaviate — How to set up Weaviate Embedded directly inside a Python process. The lightweight alternative to the Docker workflow in section 2.

How to choose an embedding model — Weaviate

Weaviate — How to choose an embedding model. Dimensions, quality, cost, and locality trade-offs — the decision behind vectorizer_config.

Weaviate — Hello Weaviate Query Examples (Part 1). Live walkthrough of CRUD operations and vector search queries from the Python client.

Advanced RAG — Hybrid Search BM25 & Ensembles

Sam Witteveen — Advanced RAG 03: hybrid search combining BM25 keyword scoring with vector similarity, plus rank fusion. Section 6 in practice.

Weaviate Explained — Schema, Querying, Filtering

Kamalraj M M — Schema design plus optimizing results with querying and filtering. Direct match for section 7 on filters and aggregation.

Supercharging RAG with Generative Feedback Loops from Weaviate

AI Coffee Break — Generative Feedback Loops in Weaviate: writing LLM-generated content back into the index for richer retrieval. Section 8 deep-dive.

Solving Multi-Tenancy In Vector Search — Etienne Dilocker, CTO, Weaviate

Etienne Dilocker (Weaviate CTO) — Why multi-tenancy in vector search is fundamentally different and how Weaviate’s tenant-per-shard model works. Section 9.

Weaviate — What is HNSW? Visual explainer of the Hierarchical Navigable Small World graph and the ef / maxConnections tuning levers from section 10.

Product Quantization for Vector Similarity Search

James Briggs — Product Quantization (PQ) explained with Python. The compression technique behind Weaviate’s quantizer.pq config in section 10.

Weaviate Architectural Deep Dive — CMU Database Group

CMU Database Group — Etienne Dilocker walks through Weaviate’s internal architecture in an academic database lecture. The most technical end-to-end talk on this list.