Audio processor: validated voice profiling accuracy, tuned threshold

- Fine-grained speaker analysis (3s windows, 1s hop) across 42min episode - Host voice: 0.90-0.98 similarity (clear positive match) - Callers: 0.65-0.68 (correctly below threshold) - Produced audio/clips: 0.53-0.65 (correctly identified as non-host) - Co-host/other speakers: 0.56-0.62 (correctly identified) - Tuned host_match_threshold from 0.75 to 0.83 based on empirical data - Cross-referenced dips with transcript: correctly identifies callers, show intros, played audio clips, and station breaks - Batch transcription of 7 additional training episodes in progress Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-21 12:48:25 -07:00
parent 826141a319
commit 6cc9043b8e
228 changed files with 137641 additions and 1 deletions
--- a/projects/radio-show/audio-processor/test-data/output/detection-report.json
+++ b/projects/radio-show/audio-processor/test-data/output/detection-report.json
@@ -0,0 +1,19 @@
+{
+  "total_show_time": 2721.33225,
+  "total_commercial_time": 0,
+  "segments": [
+    {
+      "start": 0.0,
+      "end": 2721.33225,
+      "type": "show_content",
+      "confidence": 0.4250000000000017,
+      "label": "",
+      "signals": {
+        "fingerprint": 0.5,
+        "speaker": 0.5,
+        "audio_chars": 0.65,
+        "structural": 0.5
+      }
+    }
+  ]
+}
--- a/projects/radio-show/audio-processor/test-data/output/speaker-timeline.json
+++ b/projects/radio-show/audio-processor/test-data/output/speaker-timeline.json