Quality & Trust
Every record in the BantuNomics corpus is validated, certified, and auditable. Our three-line defense system ensures linguistic accuracy from generation through delivery.
Three Lines of Defense
Engine Validation
Pre-Recording
The BTS engine generates every form deterministically from the cartridge. Built-in constraint checks reject impossible morpheme combinations before any data leaves the system.
Acoustic Validation
Post-Recording
Automated pipeline measures F₀ contours, duration, spectral features, and compares against the engine's tonal predictions. Each recording receives a pass/fail per thesis.
Human Review
Peer Verification
Native speaker reviewers verify naturalness, intelligibility, and correctness. Critical records receive multiple independent reviews before certification.
The 7-Dimension Analysis Matrix
Every recording is analyzed across seven independent acoustic dimensions.
Validation Passport
Every record carries a machine-readable certificate documenting exactly which rules were verified.
MRS Audit
~130 stress-test records exercise every boundary case. Independently reviewed by frontier AI.
Gemini 2.5 Flash independently reviewed 132 stress-test records and found zero tonal logic errors.
LDR Certification
Mathematical proof of dataset internal consistency.
How It Works
The Linguistic Delta Report computes the variance (Δ) between MRS stress-test metrics and the same metrics across the full bulk dataset.
If the same rules are applied uniformly, the delta approaches zero. This is the mathematical "check engine light" for the entire corpus.