Playbook: Knowledge Base Health¶

What it is¶

Knowledge Base Health is a set of operational procedures and automated checks designed to ensure the repository remains accurate, up-to-date, and discoverable. It combines periodic manual audits with continuous integration (CI) quality gates.

What problem it solves¶

In a rapidly evolving technical environment, documentation quickly becomes stale or fragmented. This playbook prevents "documentation rot" by establishing clear ownership, a structured review cadence, and automated enforcement of formatting standards, ensuring users can always trust the information in the repository.

Where it fits in the stack¶

This playbook belongs to the Governance and Maintenance layer. It provides the "metabolic process" that keeps the KnowledgeOps system healthy and prevents entropy from degrading the quality of the shared knowledge base.

Typical use cases¶

Performing a weekly quality check to identify non-compliant documents.
Managing the intake of new tools and patterns from community sources.
Cleaning up obsolete information and ensuring model names are current.
Maintaining consistency between the filesystem, the mkdocs.yml navigation, and the data/all_tools.json catalog.

Strengths¶

Multi-Layered Enforcement: Combines automated PR gates with deeper, periodic manual/automated audits.
Traceability: Uses Last reviewed and Confidence metadata to provide clear signals to users about data reliability.
Automated Discovery: Includes scripts to detect "drift" between user stars/activity and what is actually documented.

Limitations¶

Maintenance Overhead: Requires dedicated effort to fix the issues identified by the audit scripts.
Script Dependency: The health of the system relies on the accuracy and maintenance of the underlying Python scripts.

When to use it¶

During every pull request to ensure immediate compliance.
Once a week to perform a broader scan of the entire repository.
When onboarding new automated agents to define their quality boundaries.

When not to use it¶

For very small, personal repositories where formal governance processes are overkill.
For temporary "scratchpad" notes that are not intended to be part of the canonical knowledge base.

Objective¶

Maintain content quality, freshness, and discoverability across the knowledge base through regular audits and automated checks.

flowchart TD
    A[Start Audit] --> B[Run audit_docs_quality.py]
    B --> C{Issues Found?}
    C -- No --> D[Check Staleness]
    C -- Yes --> E[Fix Priority Issues]
    E --> F[Update data/all_tools.json]
    F --> G[Verify Nav/Index Consistency]
    G --> D
    D --> H[Update Review Dates]
    H --> I[End Audit]

Pre-requisites¶

Quality audit script (scripts/audit_docs_quality.py)
Docs contract checker (scripts/check_docs_contract.py)
Catalog consistency checker (scripts/check_catalog_consistency.py)
Doc freshness checker (scripts/check_doc_freshness.py)
API pricing summary generator (scripts/update_api_pricing_capability_summary.py)
Model account policy validator (scripts/validate_model_account_pool.py)
Starred repo intake checker (scripts/check_starred_repo_intake.py)
Standards reference

Review cadence¶

Check	Frequency	Owner	How
Intake queue (`new-sources/`)	Daily	Jules (automated)	`daily-jules-maintenance.yml` opens a structured issue
Doc contract CI gate	Every PR	CI	`docs-quality-gates.yml` runs `check_docs_contract.py`
Catalog consistency CI gate	Every PR	CI	`catalog-quality-gates.yml` runs `check_catalog_consistency.py`
Generated pricing summary gate	PRs touching pricing tracker	CI	`generated-content-gates.yml` runs summary sync + freshness checks
API pricing maintenance	Weekly	CI	`api-pricing-maintenance.yml` refreshes capacity summaries and flags stale review metadata
External link health	Weekly + docs PRs	CI	`docs-link-health.yml` (Lychee) checks markdown links
Model-account routing policy gate	PRs touching policy file	CI	`model-account-policy-gates.yml` validates multi-account routing rules
Vikunja bidirectional sync	Disabled	Maintainer	Workflow intentionally removed from CI until the integration is reviewed and re-enabled safely
Full quality audit	Weekly (manual)	Maintainer	`python3 scripts/audit_docs_quality.py`
Starred repo drift check	Weekly (manual/local)	Maintainer	`python3 scripts/check_starred_repo_intake.py --ai-only --min-stars 5000`
Staleness review (docs >90 days old)	Monthly	Maintainer	See "Staleness check" below
Taxonomy alignment	Quarterly	Maintainer	Verify category dirs match `standards.md`

Step-by-step: weekly quality audit¶

Run the audit script:
```
python3 scripts/audit_docs_quality.py
```
Review the output:
- Legacy-format docs: prioritise upgrading high-traffic pages first (tools you reference in playbooks or architecture docs).
- Missing metadata: add Last reviewed / Confidence / Sources blocks.
- Per-category breakdown: identify categories with the lowest compliance rate.
Fix the top 5 issues: focus on the docs that will fail CI the next time they are touched.
Update data/all_tools.json: ensure every page in mkdocs.yml nav has a matching entry.
Verify nav ↔ index consistency: each index.md in docs/tools/*/ should list all sibling tool pages.

Check starred-repo drift:

python3 scripts/check_starred_repo_intake.py --ai-only --min-stars 5000

If repos are missing, add them to today's intake log before documenting them.

Step-by-step: staleness check¶

Find docs not reviewed in 90+ days:

grep -rl "Last reviewed:" docs/ | xargs grep "Last reviewed:" | \
  awk -F': ' '{print $NF, $1}' | sort | head -20

For each stale doc, decide:
- Still accurate — update the Last reviewed date.
- Needs refresh — update content and bump the date.
- Obsolete — remove from mkdocs.yml, data/all_tools.json, and the category index.md.

Quality metrics¶

Track these over time to measure knowledge base health:

Metric	Target	How to measure
Template compliance rate	>90%	`audit_docs_quality.py` → compliant / total
Legacy-format docs remaining	0	`audit_docs_quality.py` → legacy count
Docs with metadata	100%	`audit_docs_quality.py` → missing metadata count
Average doc age (days since last review)	<60 days	`grep "Last reviewed"` across all docs
Catalog consistency (nav ↔ JSON)	100%	`check_catalog_consistency.py` exit code
Intake queue backlog	0 `new` items	`grep "new" docs/new-sources/*.md`

Common failure modes¶

Category index out of sync: a new tool doc is added to mkdocs.yml but not to its index.md. Fix: update the index file whenever adding a nav entry.
Orphaned JSON entries: a tool page is deleted but its all_tools.json entry remains. Fix: always update both when removing a page.
Duplicate pages: two pages document the same tool. Fix: merge into the canonical page and redirect/remove the duplicate.
Stale model references: docs reference old model names (e.g., "Claude 4.6" instead of "Claude 4.7"). Fix: search-and-replace during staleness reviews.
Starred-repo drift: you star new GitHub repos but never stage them into docs/new-sources/. Fix: run check_starred_repo_intake.py locally and append any real candidates to today's daily log.

Sources / references¶

Contribution Metadata¶

Last reviewed: 2026-06-07
Confidence: high