Knowledge Base Schema
JSON Schema for knowledge base definitions
Knowledge Base Definition
Schema Definition
BaseConfig declares a knowledge base and governs how it is ingested and retrieved.
chunking ChunkingConfig tunes how documents are split before embedding.
description embedder id ingest metadata MetadataConfig carries optional descriptive metadata for knowledge bases.
preprocess PreprocessConfig configures preprocessing steps applied to raw content.
retrieval RetrievalConfig manages how stored chunks are queried and injected into prompts.
sources vector_db Example
knowledge_bases:
- id: quickstart_docs
embedder: openai_default
vector_db: filesystem_faststart
sources:
- kind: markdown_glob
glob: "docs/**/*.md"
chunking:
strategy: token
size: 512
overlap: 64
retrieval:
top_k: 5
min_score: 0.2
filters:
tag: "policy"sourcessupportsmarkdown_globandurlin the MVP. Additional kinds appear when feature flags are enabled.- Omit
size,overlap, ortop_kto inherit defaults fromconfig.knowledge. - Use metadata under
sources[].metadatato attach tags for filtered retrieval.