Audio processor: validated voice profiling accuracy, tuned threshold
- Fine-grained speaker analysis (3s windows, 1s hop) across 42min episode - Host voice: 0.90-0.98 similarity (clear positive match) - Callers: 0.65-0.68 (correctly below threshold) - Produced audio/clips: 0.53-0.65 (correctly identified as non-host) - Co-host/other speakers: 0.56-0.62 (correctly identified) - Tuned host_match_threshold from 0.75 to 0.83 based on empirical data - Cross-referenced dips with transcript: correctly identifies callers, show intros, played audio clips, and station breaks - Batch transcription of 7 additional training episodes in progress Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -39,7 +39,7 @@ diarization:
|
||||
min_speakers: 1
|
||||
max_speakers: 6
|
||||
voice_profiles_dir: "voice-profiles/"
|
||||
host_match_threshold: 0.75
|
||||
host_match_threshold: 0.83
|
||||
|
||||
llm:
|
||||
model: "qwen3:14b"
|
||||
|
||||
Reference in New Issue
Block a user