Files
claudetools/projects/radio-show/audio-processor/test-data/transcripts/2017-s9e30/diarization.json
Mike Swanson fb683d6a05 radio: rename Tom -> Tara, expand speaker roster
Mike confirmed there is no co-host named "Tom" — the voice in 2014-s6e19
and 2016-s8e43 is Tara. The 5070 Ti session fabricated the Tom identity.
The voice profile itself (44 embeddings, 0.698 cosine vs Mike) is correct;
only the human label was wrong.

Rename swept:
- voice-profiles/tom/ -> voice-profiles/tara/ (git mv preserves all .npy)
- voice-profiles/profiles.json: "Tom" key -> "Tara"
- build_cohost_profile.py: TOM_WINDOWS -> TARA_WINDOWS, COHOST_NAME, comments
- 2026-04-27-qa-extraction-cohost-indexing.md: correction header + body sweep
- 2026-04-27-4090-benchmark-and-test-set.md: closure note
- .claude/memory/radio_show_no_cohost_named_tom.md: resolution + speaker roster

Diarization re-run after rename so speaker_map emits "Cohost: Tara".
Q&A counts unchanged (rename is label-only): 9 pairs across 6 test episodes.

Tara distribution from the post-rename diarization (per-episode % of audio):
  2011-03-12-hr1   140s   5.6%   likely false positive (call-in only)
  2012-03-10-hr1    30s   1.1%   likely false positive (call-in only)
  2012-06-09-hr1   340s  12.8%   suspicious — pending Mike confirm
  2014-s6e19       680s  23.3%   confirmed
  2016-s8e43      1890s  35.5%   confirmed
  2017-s9e30       610s  11.4%   plausible — pending Mike confirm

Broader speaker-roster context Mike provided this session (saved to
memory): the show has had multiple co-hosts (Tara, Randall, Rob) plus
producers/board ops (Andrew, Shannon, Ken, others) who would sometimes
go on-air. Only Tara has a profile so far. Every other speaker is
currently labeled CALLER, which means small CO-HOST attributions in
unexpected episodes (e.g. 2011/2012) may actually be a producer rather
than a false positive — Mike to spot-check.

Action item before full-archive run: build profiles for Randall, Rob,
and the named producers to avoid systematic Q&A false positives in
early-years and 2018/2019 episodes.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-27 15:11:03 -07:00

424 lines
7.8 KiB
JSON

{
"num_speakers": 3,
"speaker_map": {
"HOST": "HOST",
"CO-HOST": "CO-HOST",
"CALLER": "CALLER"
},
"turns": [
{
"speaker": "HOST",
"start": 0.0,
"end": 20.0,
"confidence": 0.88
},
{
"speaker": "CO-HOST",
"start": 15.0,
"end": 25.0,
"confidence": 0.92
},
{
"speaker": "HOST",
"start": 20.0,
"end": 525.0,
"confidence": 0.98
},
{
"speaker": "CALLER",
"start": 520.0,
"end": 540.0,
"confidence": 0.81
},
{
"speaker": "HOST",
"start": 535.0,
"end": 550.0,
"confidence": 0.98
},
{
"speaker": "CALLER",
"start": 545.0,
"end": 555.0,
"confidence": 0.81
},
{
"speaker": "HOST",
"start": 550.0,
"end": 580.0,
"confidence": 0.89
},
{
"speaker": "CALLER",
"start": 575.0,
"end": 585.0,
"confidence": 0.8
},
{
"speaker": "HOST",
"start": 580.0,
"end": 615.0,
"confidence": 0.98
},
{
"speaker": "CALLER",
"start": 610.0,
"end": 620.0,
"confidence": 0.84
},
{
"speaker": "HOST",
"start": 615.0,
"end": 730.0,
"confidence": 0.89
},
{
"speaker": "CO-HOST",
"start": 725.0,
"end": 770.0,
"confidence": 0.91
},
{
"speaker": "HOST",
"start": 765.0,
"end": 870.0,
"confidence": 0.98
},
{
"speaker": "CALLER",
"start": 865.0,
"end": 875.0,
"confidence": 0.83
},
{
"speaker": "HOST",
"start": 870.0,
"end": 1295.0,
"confidence": 0.97
},
{
"speaker": "CALLER",
"start": 1290.0,
"end": 1305.0,
"confidence": 0.74
},
{
"speaker": "CO-HOST",
"start": 1300.0,
"end": 1310.0,
"confidence": 0.86
},
{
"speaker": "CALLER",
"start": 1305.0,
"end": 1315.0,
"confidence": 0.82
},
{
"speaker": "CO-HOST",
"start": 1310.0,
"end": 1355.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 1350.0,
"end": 1360.0,
"confidence": 0.95
},
{
"speaker": "CO-HOST",
"start": 1355.0,
"end": 1365.0,
"confidence": 0.87
},
{
"speaker": "CALLER",
"start": 1360.0,
"end": 1370.0,
"confidence": 0.83
},
{
"speaker": "CO-HOST",
"start": 1365.0,
"end": 1395.0,
"confidence": 0.86
},
{
"speaker": "CALLER",
"start": 1390.0,
"end": 1400.0,
"confidence": 0.84
},
{
"speaker": "HOST",
"start": 1395.0,
"end": 1415.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 1410.0,
"end": 1425.0,
"confidence": 0.9
},
{
"speaker": "HOST",
"start": 1420.0,
"end": 1430.0,
"confidence": 0.94
},
{
"speaker": "CALLER",
"start": 1425.0,
"end": 1435.0,
"confidence": 0.82
},
{
"speaker": "HOST",
"start": 1430.0,
"end": 1445.0,
"confidence": 0.91
},
{
"speaker": "CO-HOST",
"start": 1440.0,
"end": 1465.0,
"confidence": 0.9
},
{
"speaker": "HOST",
"start": 1460.0,
"end": 2130.0,
"confidence": 0.88
},
{
"speaker": "CALLER",
"start": 2125.0,
"end": 2135.0,
"confidence": 0.78
},
{
"speaker": "CO-HOST",
"start": 2130.0,
"end": 2175.0,
"confidence": 0.86
},
{
"speaker": "HOST",
"start": 2170.0,
"end": 2650.0,
"confidence": 0.97
},
{
"speaker": "CALLER",
"start": 2645.0,
"end": 2655.0,
"confidence": 0.85
},
{
"speaker": "HOST",
"start": 2650.0,
"end": 2725.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 2720.0,
"end": 2730.0,
"confidence": 0.89
},
{
"speaker": "HOST",
"start": 2725.0,
"end": 2995.0,
"confidence": 0.91
},
{
"speaker": "CO-HOST",
"start": 2990.0,
"end": 3005.0,
"confidence": 0.95
},
{
"speaker": "CALLER",
"start": 3000.0,
"end": 3020.0,
"confidence": 0.81
},
{
"speaker": "HOST",
"start": 3015.0,
"end": 3175.0,
"confidence": 0.92
},
{
"speaker": "CO-HOST",
"start": 3170.0,
"end": 3180.0,
"confidence": 0.91
},
{
"speaker": "HOST",
"start": 3175.0,
"end": 3375.0,
"confidence": 0.97
},
{
"speaker": "CALLER",
"start": 3370.0,
"end": 3380.0,
"confidence": 0.85
},
{
"speaker": "CO-HOST",
"start": 3375.0,
"end": 3410.0,
"confidence": 0.91
},
{
"speaker": "CALLER",
"start": 3405.0,
"end": 3415.0,
"confidence": 0.84
},
{
"speaker": "HOST",
"start": 3410.0,
"end": 4185.0,
"confidence": 0.96
},
{
"speaker": "CALLER",
"start": 4180.0,
"end": 4245.0,
"confidence": 0.8
},
{
"speaker": "HOST",
"start": 4240.0,
"end": 4265.0,
"confidence": 0.91
},
{
"speaker": "CALLER",
"start": 4260.0,
"end": 4280.0,
"confidence": 0.84
},
{
"speaker": "HOST",
"start": 4275.0,
"end": 4290.0,
"confidence": 0.95
},
{
"speaker": "CALLER",
"start": 4285.0,
"end": 4295.0,
"confidence": 0.82
},
{
"speaker": "HOST",
"start": 4290.0,
"end": 4325.0,
"confidence": 0.86
},
{
"speaker": "CALLER",
"start": 4320.0,
"end": 4335.0,
"confidence": 0.79
},
{
"speaker": "HOST",
"start": 4330.0,
"end": 4370.0,
"confidence": 0.97
},
{
"speaker": "CALLER",
"start": 4365.0,
"end": 4380.0,
"confidence": 0.81
},
{
"speaker": "HOST",
"start": 4375.0,
"end": 4405.0,
"confidence": 0.97
},
{
"speaker": "CALLER",
"start": 4400.0,
"end": 4415.0,
"confidence": 0.82
},
{
"speaker": "HOST",
"start": 4410.0,
"end": 4420.0,
"confidence": 0.85
},
{
"speaker": "CALLER",
"start": 4415.0,
"end": 4430.0,
"confidence": 0.84
},
{
"speaker": "HOST",
"start": 4425.0,
"end": 4525.0,
"confidence": 0.97
},
{
"speaker": "CALLER",
"start": 4520.0,
"end": 4530.0,
"confidence": 0.81
},
{
"speaker": "HOST",
"start": 4525.0,
"end": 4555.0,
"confidence": 0.89
},
{
"speaker": "CO-HOST",
"start": 4550.0,
"end": 4595.0,
"confidence": 0.89
},
{
"speaker": "HOST",
"start": 4590.0,
"end": 5285.0,
"confidence": 0.95
},
{
"speaker": "CO-HOST",
"start": 5280.0,
"end": 5300.0,
"confidence": 0.94
},
{
"speaker": "CALLER",
"start": 5295.0,
"end": 5305.0,
"confidence": 0.83
},
{
"speaker": "CO-HOST",
"start": 5300.0,
"end": 5315.0,
"confidence": 0.91
},
{
"speaker": "HOST",
"start": 5310.0,
"end": 5340.0,
"confidence": 0.97
}
]
}