Files
claudetools/projects/dataforth-dos/PARSING-FIDELITY-REPORT-2026-06-17.txt
Mike Swanson bbcde2be8e dataforth(datasheet): parsing-fidelity validation — all staged originals vs DB
Validated all 11,922 staged original .TXT datasheets against test_records.
0 genuine parse faults across 11,239 comparable records; mismatches all explained
(retests, reused serials, VAS format, legacy out-of-scope units). Adds the
validate-parsing.js tool, raw report, and verdict. Two follow-ups (NOT parse bugs):
608 staged units absent from DB (ingestion completeness), and same-day retests keep
the first run (ON CONFLICT strictly-greater-date).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-06-18 13:02:32 -07:00

35 lines
1.8 KiB
Plaintext

========== PARSING FIDELITY REPORT ==========
Staged .TXT files scanned : 11922
- no SN line (non-standard fmt): 1
- SN found / compared : 11921
- .TXT w/o 5 accuracy rows : 239
Unique SNs looked up in DB : 11811
SNs present in DB : 11239
EXPLAINED (not parsing faults):
Consistent (SN+model+date+5 error% match) : 11226
Retest, DB newer date than .TXT : 35
Retest same-day (stim matches, run differs): 42
VAS/single-point fmt (no 5-row block) : 5
Serial collision (generic SN, diff family): 2
NEEDS REVIEW (potential genuine issues):
Missing from DB (after hex-decode) : 608
Model variant mismatch (same family) : 2
DB OLDER than .TXT (stale DB?) : 1
GENUINE error% fault (stim ALSO differs) : 0
Accuracy-row-count diff : 0
COLLISION (informational) (first 20):
1-1: txt=SCM5B34-02 db=SCMVAS-M300
1-2: txt=SCM5B34-02 db=SCMVAS-M300
MODEL VARIANT MISMATCH (first 20):
A819-1: txt=8B35-01 db=8B36-04
A821-2: txt=8B35-04 db=8B36-01
DB OLDER THAN .TXT (first 20):
A821-1: txt=02-25-2026 db=2026-01-13
MISSING-FROM-DB (first 30): A243-1 (dec 10243-1), A243-2 (dec 10243-2), A244-1 (dec 10244-1), A255-1 (dec 10255-1), A255-2 (dec 10255-2), A276-1 (dec 10276-1), A276-2 (dec 10276-2), A328-1 (dec 10328-1), A328-2 (dec 10328-2), A376-1 (dec 10376-1), A376-2 (dec 10376-2), A376-3 (dec 10376-3), A377-1 (dec 10377-1), A377-2 (dec 10377-2), A377-3 (dec 10377-3), A405-1 (dec 10405-1), A405-2 (dec 10405-2), A405-3 (dec 10405-3), A405-4 (dec 10405-4), A417-1 (dec 10417-1), A417-2 (dec 10417-2), A561-1 (dec 10561-1), A561-2 (dec 10561-2), A561-3 (dec 10561-3), A561-4 (dec 10561-4), A561-5 (dec 10561-5), A561-6 (dec 10561-6), A601-1 (dec 10601-1), A601-2 (dec 10601-2), A602-1 (dec 10602-1)