Documentation metadata is infrastructure, not decoration. This repository contains governance artifacts used to define, detect, and enforce edition- and tier-aware metadata in large documentation systems.
The focus is correctness, determinism, and drift prevention—not content generation.
This repo documents a detection-first DocsOps system for enforcing metadata contracts at scale, using Real-Time Customer Data Platform (RTCDP) as the concrete problem space.
It captures the minimum viable governance layer required to:
- Prevent uncontrolled metadata drift
- Support machine consumption of documentation
- Enable automation without breaking trust
- Keep human review where ambiguity exists
This is not a tool. It is a specification and enforcement model.
Large documentation platforms fail in predictable ways:
- Edition-specific content leaks into generic pages
- Tier entitlements are inferred instead of asserted
- Search and AI systems receive contradictory signals
- Metadata becomes editorial opinion instead of contract
Once this happens, automation amplifies the damage.
This repo shows how to lock the taxonomy first, then automate safely.
-
Detection before automation No inference. No ML. Only explicit signals.
-
Contracts over conventions If it isn't written down, it isn't allowed.
-
Additive enforcement only (MVP) Never remove metadata automatically.
-
Confidence-based decisions Automation is allowed only at high confidence.
-
Human review is a feature, not a failure
docsops-metadata-enforcement/
├── taxonomy/
│ ├── taxonomy-contract-rtcdp.md
│ └── feature-badge-mapping.md
│
├── signals/
│ └── hard-signal-dictionary.md
│
├── gold-set/
│ └── gold-list.csv
│
├── models/
│ └── confidence-model.md
│
├── contracts/
│ └── decision-output-schema.json
│
├── scope/
│ └── mvp-scope-guardrails.md
│
├── prompts/
│ └── prompt-a-analysis.md
│ └── prompt-b-dedupe-jira.md
│
└── README.md
I only committed artifacts that define constraints, contracts, or invariants. Execution details stay out of the repo until they stabilize."
-
Taxonomy Contract The authoritative definition of valid editions, tiers, and combinations.
-
Hard-Signal Dictionary A curated list of explicit, contract-faithful signals that may trigger metadata suggestions.
-
Gold Set A small, hand-labeled calibration set used to validate thresholds and reduce noise.
-
Confidence Model Defines when automation is allowed and when humans must intervene.
-
Decision Output Schema The machine-readable contract that downstream systems depend on.
- No machine learning
- No heuristics
- No editorial inference
- No UI rendering logic
- No marketing interpretation
- No roadmap speculation
Those belong elsewhere.
MVP / Case study
- Detection and reporting only
- Additive suggestions only
- Manual execution cadence
- Single reviewer model
Designed to be retired or re-scoped once a formal taxonomy system exists.
This work treats documentation metadata as infrastructure rather than content. By locking a taxonomy contract and enforcing it through explicit, confidence-based detection, we prevent metadata drift, protect search and AI consumers, and enable safe automation without breaking user trust. This approach reduces long-term maintenance cost and avoids scaling editorial inconsistencies into system-level failures.
Documentation is increasingly consumed by:
- Search systems
- AI assistants
- Programmatic integrations
- Internal tooling
If metadata is wrong, everything downstream is wrong faster.
This repository demonstrates how to prevent that—before automation makes it irreversible.