Knowledge Base

A structured collection of information organized for retrieval, reasoning, and reuse — the foundation of any system that needs to "know" something.

A knowledge base is a structured collection of information — facts, concepts, relationships, procedures — organized for retrieval, reasoning, and reuse. The term is used in many fields: customer support (FAQ-style KBs), enterprise IT (internal documentation), AI (training data, RAG indexes), and personal knowledge management (notes, second brains).

Anatomy of a knowledge base

Most KBs share a common structure:

The schema is the key decision. A loose schema (free-form notes) maximizes flexibility but degrades into a search problem. A strict schema (typed fields, controlled vocabularies) makes retrieval reliable but slows down writing.

Personal vs organizational

DimensionPersonal KBOrganizational KB
AuthorOne personMany contributors
SchemaLoose, opinionatedStandardized, controlled
Update cadenceDailyContinuous, often with review
Trust modelSelfRoles, permissions, audit
Search”What did I read about X?""What’s the policy on Y?”

This site is a personal knowledge base. The schema is intentionally flexible (see schema.md for the source of truth), and the user is the only author and consumer.

How LLMs change the equation

The most important shift in KB design in the last few years is the introduction of embeddings and Retrieval-Augmented Generation. A modern KB is often:

  1. Markdown files for the canonical content (human-readable, version-controlled, portable).
  2. Vector index for semantic search (find by meaning, not just keywords).
  3. LLM as the consumption interface (ask questions in natural language; the KB retrieves and synthesizes).

This stack — markdown + embeddings + LLM — is the pattern Karpathy’s LLM Wiki describes, and the one that this site is built on. The KB becomes a knowledge store for an AI, not just for a human.

Common failure modes

Open vs closed

A personal KB is private by default. But many personal KBs are semi-open — published as a digital garden or Obsidian Publish-style site. The benefits:

The costs:

This site is public. See the about page for what is and isn’t shared.

See also