- Fine-grained speaker analysis (3s windows, 1s hop) across 42min episode - Host voice: 0.90-0.98 similarity (clear positive match) - Callers: 0.65-0.68 (correctly below threshold) - Produced audio/clips: 0.53-0.65 (correctly identified as non-host) - Co-host/other speakers: 0.56-0.62 (correctly identified) - Tuned host_match_threshold from 0.75 to 0.83 based on empirical data - Cross-referenced dips with transcript: correctly identifies callers, show intros, played audio clips, and station breaks - Batch transcription of 7 additional training episodes in progress Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
26 lines
561 B
JSON
26 lines
561 B
JSON
{
|
|
"Mike Swanson": {
|
|
"role": "host",
|
|
"num_samples": 180,
|
|
"source_episodes": [
|
|
"2010-10-02-hr1.mp3",
|
|
"2011-06-04-hr1.mp3",
|
|
"2011-09-10-hr1.mp3",
|
|
"2014-s6e05.mp3",
|
|
"2015-s7e30.mp3",
|
|
"2016-s8e42.mp3",
|
|
"2017-s9e26.mp3",
|
|
"2018-s10e17.mp3",
|
|
"2018-s10e21.mp3",
|
|
"2010-10-02-hr1.mp3",
|
|
"2011-06-04-hr1.mp3",
|
|
"2011-09-10-hr1.mp3",
|
|
"2014-s6e05.mp3",
|
|
"2015-s7e30.mp3",
|
|
"2016-s8e42.mp3",
|
|
"2017-s9e26.mp3",
|
|
"2018-s10e17.mp3",
|
|
"2018-s10e21.mp3"
|
|
]
|
|
}
|
|
} |