Knowledge Base Schema
JSON Schema for knowledge base definitions
Knowledge Base Definition
Schema Definition
BaseConfig declares a knowledge base and governs how it is ingested and retrieved.
chunking
ChunkingConfig tunes how documents are split before embedding.
description
embedder
id
ingest
metadata
MetadataConfig carries optional descriptive metadata for knowledge bases.
preprocess
PreprocessConfig configures preprocessing steps applied to raw content.
retrieval
RetrievalConfig manages how stored chunks are queried and injected into prompts.
sources
vector_db
Example
knowledge_bases:
- id: quickstart_docs
embedder: openai_default
vector_db: filesystem_faststart
sources:
- kind: markdown_glob
glob: "docs/**/*.md"
chunking:
strategy: token
size: 512
overlap: 64
retrieval:
top_k: 5
min_score: 0.2
filters:
tag: "policy"
sources
supportsmarkdown_glob
andurl
in the MVP. Additional kinds appear when feature flags are enabled.- Omit
size
,overlap
, ortop_k
to inherit defaults fromconfig.knowledge
. - Use metadata under
sources[].metadata
to attach tags for filtered retrieval.