Skip to main content

Menu

LEVEL 0
0/5 XP
HomeAboutTopicsPricingMy VaultStats

Categories

πŸ€– Artificial Intelligence
☁️ Cloud and Infrastructure
πŸ’Ύ Data and Databases
πŸ’Ό Professional Skills
🎯 Programming and Development
πŸ”’ Security and Networking
πŸ“š Specialized Topics
HomeAboutTopicsPricingMy VaultStats
LEVEL 0
0/5 XP
GitHub
Β© 2026 CheatGridβ„’. All rights reserved.
Privacy PolicyTerms of UseAboutContact

Weaviate (Vector Database) Cheat Sheet

Weaviate (Vector Database) Cheat Sheet

Back to Generative AI
Updated 2026-05-21
Next Topic: World Models and Neural Simulators Cheat Sheet

Weaviate is an open-source, cloud-native vector database that stores data objects together with their vector embeddings, enabling both semantic (vector) and keyword (BM25) search in a single system. It sits at the center of modern RAG (Retrieval-Augmented Generation) pipelines, combining retrieval with integrated generative AI modules so that search and generation happen in one query. Unlike standalone vector stores, Weaviate ships with a full schema system, inverted indexes, multi-tenancy, replication, and a pluggable module ecosystem β€” making the key mental model that each "collection" in Weaviate is simultaneously a vector index, an inverted index, and a structured object store.

What This Cheat Sheet Covers

This topic spans 17 focused tables and 166 indexed concepts. Below is a complete table-by-table outline of this topic, spanning foundational concepts through advanced details.

Table 1: Collection (Class) Definition β€” Top-Level ParametersTable 2: Property Data Types and TokenizationTable 3: Vectorizer Modules (text2vec-*)Table 4: Vector Index Types and ConfigurationTable 5: Vector Compression (Quantization)Table 6: BM25 and Hybrid SearchTable 7: Vector Similarity Search OperatorsTable 8: Conditional Filters (where)Table 9: Generative Modules (RAG)Table 10: Reranker ModulesTable 11: GraphQL Query APITable 12: REST API β€” CRUD and Admin EndpointsTable 13: Multi-Tenancy OperationsTable 14: Replication and ShardingTable 15: Authentication, Authorization, and RBACTable 16: Python Client v4 β€” Key PatternsTable 17: ref2vec-centroid Module

Table 1: Collection (Class) Definition β€” Top-Level Parameters

A Weaviate collection (formerly called a "class") is the core schema unit that defines how objects are stored, vectorized, and indexed. Every collection must be planned carefully before import because several top-level settings β€” vectorizer, index type, sharding, and multi-tenancy β€” are immutable after creation.

ParameterExampleDescription
class
"class": "Article"
Collection name; must start with an uppercase letter; immutable after creation.
description
"description": "News articles"
Human-readable documentation string; mutable.
vectorizer
"vectorizer": "text2vec-openai"
Module that auto-generates vectors at import/query time; immutable after creation; set "none" for BYOV.
vectorIndexType
"vectorIndexType": "hnsw"
Vector index algorithm: hnsw (default), flat, dynamic, or hfresh; immutable after creation.
vectorIndexConfig
"vectorIndexConfig": {"ef": 64, "efConstruction": 128}
Fine-tunes the chosen index (e.g., HNSW ef, efConstruction, maxConnections); partially mutable.
vectorConfig
"vectorConfig": {"title": {"vectorizer": {...}, "vectorIndexConfig": {...}}}
Named vectors β€” defines multiple independent vector spaces per object, each with its own vectorizer and index.
properties
"properties": [{"name": "title", "dataType": ["text"]}]
Array of property definitions; new properties can be added later but existing ones cannot be deleted.

More in Generative AI

  • vLLM (LLM Inference Engine) Cheat Sheet
  • World Models and Neural Simulators Cheat Sheet
  • Advanced RAG Patterns and Optimization Cheat Sheet
  • ColBERT and Late Interaction Retrieval Cheat Sheet
  • LangSmith Cheat Sheet
  • NL-to-SQL and Text-to-Code Generation Cheat Sheet
View all 95 topics in Generative AI