Files
claudetools/projects/radio-show/audio-processor/test-data/transcripts/2016-s8e43/diarization.json
Mike Swanson a4f527f31e radio: per-year test set (one episode per year, 2010-2018)
Added 2010, 2015, 2018 test episodes to round out the test set to one
per available year:
- 2010-05-08-hr1 (May 2010, earliest available; pre-Tara era)
- 2015-s7e19 (Jan 2015, avoids training's s7e30)
- 2018-s10e18 (only 3 non-training 2018 episodes exist)

Archive has no 2019 directory — Rob's "2018/2019 appearances" are
constrained to the 5 available 2018 episodes only.

Per-year diarization summary (Tara presence, post-rename):
  2010-05-08    30s   1.2%   likely false positive (pre-Tara)
  2011-03-12   140s   5.6%   likely false positive (call-in only)
  2012-03-10    30s   1.1%   likely false positive (call-in only)
  2012-06-09   340s  12.8%   suspicious — Mike to confirm
  2014-s6e19   680s  23.3%   confirmed
  2015-s7e19   280s   9.9%   plausible — Mike to confirm
  2016-s8e43  1890s  35.5%   confirmed
  2017-s9e30   610s  11.4%   plausible
  2018-s10e18  880s  17.1%   COULD BE ROB — Mike flagged Rob for
                              2018/2019 appearances; cosine threshold may
                              be hitting on Rob being acoustically similar
                              to Tara

Total Tara across 9 episodes: 1h 21m / 8h 52m audio (15.3%).

Q&A counts (still suspect — every voice that isn't Mike-or-Tara is
labeled CALLER, so Randall/Rob/producers inflate the bucket):
  2010=4, 2011=1, 2012a=2, 2012b=0, 2014=0, 2015=1, 2016=2, 2017=4, 2018=3
  Total: 17 pairs across 9 episodes

4090 perf on the expanded set:
- Diarization: 31928s in 121.5s = 262.7x realtime (vs 209.7x on 5070 Ti, +25.3%)
- Transcription (3 new episodes only): 10554s in 112.4s = 93.9x

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-27 15:20:09 -07:00

814 lines
15 KiB
JSON

{
"num_speakers": 3,
"speaker_map": {
"CALLER": "CALLER",
"HOST": "HOST",
"CO-HOST": "CO-HOST"
},
"turns": [
{
"speaker": "CO-HOST",
"start": 0.0,
"end": 40.0,
"confidence": 0.96
},
{
"speaker": "HOST",
"start": 35.0,
"end": 55.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 50.0,
"end": 60.0,
"confidence": 0.96
},
{
"speaker": "HOST",
"start": 55.0,
"end": 100.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 95.0,
"end": 120.0,
"confidence": 0.9
},
{
"speaker": "HOST",
"start": 115.0,
"end": 140.0,
"confidence": 0.94
},
{
"speaker": "CO-HOST",
"start": 135.0,
"end": 160.0,
"confidence": 0.97
},
{
"speaker": "HOST",
"start": 155.0,
"end": 175.0,
"confidence": 0.96
},
{
"speaker": "CALLER",
"start": 170.0,
"end": 185.0,
"confidence": 0.77
},
{
"speaker": "HOST",
"start": 180.0,
"end": 275.0,
"confidence": 0.87
},
{
"speaker": "CALLER",
"start": 270.0,
"end": 295.0,
"confidence": 0.82
},
{
"speaker": "HOST",
"start": 290.0,
"end": 345.0,
"confidence": 0.97
},
{
"speaker": "CALLER",
"start": 340.0,
"end": 355.0,
"confidence": 0.81
},
{
"speaker": "HOST",
"start": 350.0,
"end": 370.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 365.0,
"end": 380.0,
"confidence": 0.94
},
{
"speaker": "HOST",
"start": 375.0,
"end": 525.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 520.0,
"end": 535.0,
"confidence": 0.99
},
{
"speaker": "HOST",
"start": 530.0,
"end": 550.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 545.0,
"end": 555.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 550.0,
"end": 580.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 575.0,
"end": 600.0,
"confidence": 0.96
},
{
"speaker": "CALLER",
"start": 595.0,
"end": 605.0,
"confidence": 0.84
},
{
"speaker": "HOST",
"start": 600.0,
"end": 1055.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 1050.0,
"end": 1060.0,
"confidence": 0.93
},
{
"speaker": "HOST",
"start": 1055.0,
"end": 1190.0,
"confidence": 0.99
},
{
"speaker": "CO-HOST",
"start": 1185.0,
"end": 1215.0,
"confidence": 0.98
},
{
"speaker": "CALLER",
"start": 1210.0,
"end": 1220.0,
"confidence": 0.8
},
{
"speaker": "CO-HOST",
"start": 1215.0,
"end": 1235.0,
"confidence": 0.86
},
{
"speaker": "HOST",
"start": 1230.0,
"end": 1295.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 1290.0,
"end": 1300.0,
"confidence": 0.95
},
{
"speaker": "HOST",
"start": 1295.0,
"end": 1335.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 1330.0,
"end": 1345.0,
"confidence": 0.94
},
{
"speaker": "HOST",
"start": 1340.0,
"end": 1380.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 1375.0,
"end": 1395.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 1390.0,
"end": 1410.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 1405.0,
"end": 1415.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 1410.0,
"end": 1795.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 1790.0,
"end": 1875.0,
"confidence": 0.99
},
{
"speaker": "HOST",
"start": 1870.0,
"end": 1950.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 1945.0,
"end": 1960.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 1955.0,
"end": 2025.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 2020.0,
"end": 2055.0,
"confidence": 0.92
},
{
"speaker": "HOST",
"start": 2050.0,
"end": 2105.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 2100.0,
"end": 2115.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 2110.0,
"end": 2145.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 2140.0,
"end": 2150.0,
"confidence": 0.97
},
{
"speaker": "HOST",
"start": 2145.0,
"end": 2235.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 2230.0,
"end": 2240.0,
"confidence": 0.93
},
{
"speaker": "HOST",
"start": 2235.0,
"end": 2270.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 2265.0,
"end": 2280.0,
"confidence": 0.95
},
{
"speaker": "HOST",
"start": 2275.0,
"end": 2285.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 2280.0,
"end": 2290.0,
"confidence": 0.92
},
{
"speaker": "HOST",
"start": 2285.0,
"end": 2340.0,
"confidence": 0.89
},
{
"speaker": "CO-HOST",
"start": 2335.0,
"end": 2360.0,
"confidence": 0.94
},
{
"speaker": "HOST",
"start": 2355.0,
"end": 2375.0,
"confidence": 0.94
},
{
"speaker": "CO-HOST",
"start": 2370.0,
"end": 2410.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 2405.0,
"end": 2415.0,
"confidence": 0.93
},
{
"speaker": "CO-HOST",
"start": 2410.0,
"end": 2425.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 2420.0,
"end": 2545.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 2540.0,
"end": 2550.0,
"confidence": 0.97
},
{
"speaker": "HOST",
"start": 2545.0,
"end": 2610.0,
"confidence": 0.93
},
{
"speaker": "CO-HOST",
"start": 2605.0,
"end": 2620.0,
"confidence": 0.93
},
{
"speaker": "HOST",
"start": 2615.0,
"end": 2680.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 2675.0,
"end": 2685.0,
"confidence": 0.9
},
{
"speaker": "HOST",
"start": 2680.0,
"end": 2690.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 2685.0,
"end": 2770.0,
"confidence": 0.95
},
{
"speaker": "HOST",
"start": 2765.0,
"end": 2795.0,
"confidence": 0.9
},
{
"speaker": "CO-HOST",
"start": 2790.0,
"end": 2820.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 2815.0,
"end": 2855.0,
"confidence": 0.91
},
{
"speaker": "CO-HOST",
"start": 2850.0,
"end": 2860.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 2855.0,
"end": 2865.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 2860.0,
"end": 2885.0,
"confidence": 0.91
},
{
"speaker": "HOST",
"start": 2880.0,
"end": 2905.0,
"confidence": 0.95
},
{
"speaker": "CO-HOST",
"start": 2900.0,
"end": 2935.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 2930.0,
"end": 2995.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 2990.0,
"end": 3010.0,
"confidence": 0.97
},
{
"speaker": "HOST",
"start": 3005.0,
"end": 3035.0,
"confidence": 0.95
},
{
"speaker": "CO-HOST",
"start": 3030.0,
"end": 3060.0,
"confidence": 0.97
},
{
"speaker": "HOST",
"start": 3055.0,
"end": 3140.0,
"confidence": 0.95
},
{
"speaker": "CO-HOST",
"start": 3135.0,
"end": 3150.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 3145.0,
"end": 3205.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 3200.0,
"end": 3210.0,
"confidence": 0.89
},
{
"speaker": "HOST",
"start": 3205.0,
"end": 3220.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 3215.0,
"end": 3225.0,
"confidence": 0.96
},
{
"speaker": "HOST",
"start": 3220.0,
"end": 3230.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 3225.0,
"end": 3260.0,
"confidence": 0.91
},
{
"speaker": "HOST",
"start": 3255.0,
"end": 3270.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 3265.0,
"end": 3275.0,
"confidence": 0.97
},
{
"speaker": "HOST",
"start": 3270.0,
"end": 3350.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 3345.0,
"end": 3375.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 3370.0,
"end": 3395.0,
"confidence": 0.94
},
{
"speaker": "CO-HOST",
"start": 3390.0,
"end": 3435.0,
"confidence": 0.85
},
{
"speaker": "HOST",
"start": 3430.0,
"end": 3970.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 3965.0,
"end": 3980.0,
"confidence": 0.96
},
{
"speaker": "HOST",
"start": 3975.0,
"end": 3990.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 3985.0,
"end": 4025.0,
"confidence": 0.86
},
{
"speaker": "CALLER",
"start": 4020.0,
"end": 4030.0,
"confidence": 0.85
},
{
"speaker": "HOST",
"start": 4025.0,
"end": 4050.0,
"confidence": 0.92
},
{
"speaker": "CO-HOST",
"start": 4045.0,
"end": 4055.0,
"confidence": 0.93
},
{
"speaker": "HOST",
"start": 4050.0,
"end": 4095.0,
"confidence": 0.87
},
{
"speaker": "CO-HOST",
"start": 4090.0,
"end": 4100.0,
"confidence": 0.92
},
{
"speaker": "HOST",
"start": 4095.0,
"end": 4190.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 4185.0,
"end": 4200.0,
"confidence": 0.93
},
{
"speaker": "HOST",
"start": 4195.0,
"end": 4215.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 4210.0,
"end": 4225.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 4220.0,
"end": 4240.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 4235.0,
"end": 4250.0,
"confidence": 0.99
},
{
"speaker": "HOST",
"start": 4245.0,
"end": 4385.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 4380.0,
"end": 4400.0,
"confidence": 0.99
},
{
"speaker": "HOST",
"start": 4395.0,
"end": 4425.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 4420.0,
"end": 4435.0,
"confidence": 0.99
},
{
"speaker": "HOST",
"start": 4430.0,
"end": 4440.0,
"confidence": 0.95
},
{
"speaker": "CO-HOST",
"start": 4435.0,
"end": 4445.0,
"confidence": 0.91
},
{
"speaker": "HOST",
"start": 4440.0,
"end": 4460.0,
"confidence": 0.98
},
{
"speaker": "CO-HOST",
"start": 4455.0,
"end": 4465.0,
"confidence": 0.94
},
{
"speaker": "HOST",
"start": 4460.0,
"end": 4515.0,
"confidence": 0.94
},
{
"speaker": "CO-HOST",
"start": 4510.0,
"end": 4525.0,
"confidence": 0.97
},
{
"speaker": "HOST",
"start": 4520.0,
"end": 4570.0,
"confidence": 0.96
},
{
"speaker": "CO-HOST",
"start": 4565.0,
"end": 4580.0,
"confidence": 0.94
},
{
"speaker": "HOST",
"start": 4575.0,
"end": 4680.0,
"confidence": 0.97
},
{
"speaker": "CO-HOST",
"start": 4675.0,
"end": 4715.0,
"confidence": 0.92
},
{
"speaker": "HOST",
"start": 4710.0,
"end": 4790.0,
"confidence": 0.97
},
{
"speaker": "CALLER",
"start": 4785.0,
"end": 4795.0,
"confidence": 0.83
},
{
"speaker": "HOST",
"start": 4790.0,
"end": 4805.0,
"confidence": 0.89
},
{
"speaker": "CALLER",
"start": 4800.0,
"end": 4815.0,
"confidence": 0.79
},
{
"speaker": "HOST",
"start": 4810.0,
"end": 4935.0,
"confidence": 0.92
},
{
"speaker": "CALLER",
"start": 4930.0,
"end": 4940.0,
"confidence": 0.82
},
{
"speaker": "HOST",
"start": 4935.0,
"end": 4945.0,
"confidence": 0.86
},
{
"speaker": "CALLER",
"start": 4940.0,
"end": 4950.0,
"confidence": 0.82
},
{
"speaker": "HOST",
"start": 4945.0,
"end": 5080.0,
"confidence": 0.86
},
{
"speaker": "CALLER",
"start": 5075.0,
"end": 5085.0,
"confidence": 0.85
},
{
"speaker": "HOST",
"start": 5080.0,
"end": 5220.0,
"confidence": 0.86
},
{
"speaker": "CO-HOST",
"start": 5215.0,
"end": 5225.0,
"confidence": 0.98
},
{
"speaker": "HOST",
"start": 5220.0,
"end": 5325.0,
"confidence": 0.95
}
]
}