Curation
Quality, deduplication, and metadata consistency rules for wiki-base updates.
Curation
Use this when writing/updating data.
Core checks
- Search first (
queryNodes,searchContentEmbeddings) before creating nodes. - Reuse existing dimensions when possible; avoid near-duplicate names.
- Keep node descriptions explicit: what this is + why it matters.
- Prefer updating existing nodes over creating duplicates for the same source URL.
- Add edges only when relationship direction and explanation are clear.
Event data checks
- Scheduled sessions are
node_type='event'withmetadata.event_status='scheduled'. - Recordings are
paper-club/builders-clubnodes and should not be mixed with upcoming events. - For event updates/cancellations, preserve ownership and status semantics.