Playground

Playground

Document

/api/v1/knowledge/documents

A document indexed for search and RAG.

listgetcreateupdatedelete

Schema

FieldTypeNotes
id*uuidServer-assigned ULID with type prefix (e.g. per_…).
tenant_id*stringTenant scope — auto-stamped from the caller's JWT.
app_idstringApp scope. Stamped ONLY when the caller's JWT was minted for a specific Application (integration API keys). Absent for human-user sessions. Filters reads when present.
created_at*date-timeServer stamp.
updated_at*date-timeServer stamp; updated on every patch.
created_byuuidPerson id from the caller's JWT (sub).
updated_byuuidPerson id from the last writer's JWT (sub).
sourcestringProvenance tag — defaults to 'edm'.
source_typeenumfrontend | backend | server | system | apiWhere the write originated. Defaults to 'api'.
is_deletedbooleanSoft-delete flag. Excluded from default list queries.
deleted_atdate-timeStamped when soft-deleted; null otherwise.
deleted_byuuidPerson id who soft-deleted; null otherwise.
schema_versionnumberDocument schema version. Bumped on incompatible writes.
account_idstringOptional account scope.
collection_idstringLogical grouping, e.g. kb-security, kb-product.
kindstringDocument kind classification.
titlestringHuman-readable document title.
descriptionstringOptional long-form description of the document.
mime_typestringMIME type of the document bytes.
size_bytesnumberSize of the document in bytes.
file_hashstringSHA-256 hash of the document bytes, used as a deduplication key.
minio_keystringMinIO object key for the document bytes. Format: cpod-files/{tenantId}/{collectionId}/{docId}/source.{ext}.
languagestringBCP-47 language tag for the document content.
page_countnumberNumber of pages in the document.
extractorstringExtraction pipeline used to process this document.
extraction_statusstringCurrent status of the document extraction pipeline.
extracted_atdate-timeISO 8601 timestamp when extraction completed.
chunk_countnumberCached count of chunks produced from this document.
embedding_modelstringEmbedding model used to generate chunk vectors.
embedding_dimensionsnumberDimensionality of the embedding vectors.
chunk_idsjsonMaterial list of chunk IDs (also derivable from graph edges).
entity_idsjsonIDs of extracted KnowledgeEntity nodes (kne-<uuid>).
effective_fromdate-timeStart of the validity window for policy or contract documents.
effective_todate-timeEnd of the validity window for policy or contract documents.
versionstringDocument's own version string, independent of the record version header.
parent_document_idstringID of the document this one supersedes, if applicable.
acl_scopestringAccess control scope for this document.
acl_allowed_role_idsjsonRole IDs permitted to access this document when aclScope is restricted.
tagsjsonFree-form tags for categorisation and search.
statusenumactive | archived | draftLifecycle status of this document record.

API

client.knowledge.documents.*
Loading manifest…