Added 2010, 2015, 2018 test episodes to round out the test set to one
per available year:
- 2010-05-08-hr1 (May 2010, earliest available; pre-Tara era)
- 2015-s7e19 (Jan 2015, avoids training's s7e30)
- 2018-s10e18 (only 3 non-training 2018 episodes exist)
Archive has no 2019 directory — Rob's "2018/2019 appearances" are
constrained to the 5 available 2018 episodes only.
Per-year diarization summary (Tara presence, post-rename):
2010-05-08 30s 1.2% likely false positive (pre-Tara)
2011-03-12 140s 5.6% likely false positive (call-in only)
2012-03-10 30s 1.1% likely false positive (call-in only)
2012-06-09 340s 12.8% suspicious — Mike to confirm
2014-s6e19 680s 23.3% confirmed
2015-s7e19 280s 9.9% plausible — Mike to confirm
2016-s8e43 1890s 35.5% confirmed
2017-s9e30 610s 11.4% plausible
2018-s10e18 880s 17.1% COULD BE ROB — Mike flagged Rob for
2018/2019 appearances; cosine threshold may
be hitting on Rob being acoustically similar
to Tara
Total Tara across 9 episodes: 1h 21m / 8h 52m audio (15.3%).
Q&A counts (still suspect — every voice that isn't Mike-or-Tara is
labeled CALLER, so Randall/Rob/producers inflate the bucket):
2010=4, 2011=1, 2012a=2, 2012b=0, 2014=0, 2015=1, 2016=2, 2017=4, 2018=3
Total: 17 pairs across 9 episodes
4090 perf on the expanded set:
- Diarization: 31928s in 121.5s = 262.7x realtime (vs 209.7x on 5070 Ti, +25.3%)
- Transcription (3 new episodes only): 10554s in 112.4s = 93.9x
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
424 lines
7.8 KiB
JSON
424 lines
7.8 KiB
JSON
{
|
|
"num_speakers": 3,
|
|
"speaker_map": {
|
|
"CALLER": "CALLER",
|
|
"HOST": "HOST",
|
|
"CO-HOST": "CO-HOST"
|
|
},
|
|
"turns": [
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 0.0,
|
|
"end": 20.0,
|
|
"confidence": 0.88
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 15.0,
|
|
"end": 25.0,
|
|
"confidence": 0.92
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 20.0,
|
|
"end": 525.0,
|
|
"confidence": 0.98
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 520.0,
|
|
"end": 540.0,
|
|
"confidence": 0.81
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 535.0,
|
|
"end": 550.0,
|
|
"confidence": 0.98
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 545.0,
|
|
"end": 555.0,
|
|
"confidence": 0.81
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 550.0,
|
|
"end": 580.0,
|
|
"confidence": 0.89
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 575.0,
|
|
"end": 585.0,
|
|
"confidence": 0.8
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 580.0,
|
|
"end": 615.0,
|
|
"confidence": 0.98
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 610.0,
|
|
"end": 620.0,
|
|
"confidence": 0.84
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 615.0,
|
|
"end": 730.0,
|
|
"confidence": 0.89
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 725.0,
|
|
"end": 770.0,
|
|
"confidence": 0.91
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 765.0,
|
|
"end": 870.0,
|
|
"confidence": 0.98
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 865.0,
|
|
"end": 875.0,
|
|
"confidence": 0.83
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 870.0,
|
|
"end": 1295.0,
|
|
"confidence": 0.97
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 1290.0,
|
|
"end": 1305.0,
|
|
"confidence": 0.74
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 1300.0,
|
|
"end": 1310.0,
|
|
"confidence": 0.86
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 1305.0,
|
|
"end": 1315.0,
|
|
"confidence": 0.82
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 1310.0,
|
|
"end": 1355.0,
|
|
"confidence": 0.98
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 1350.0,
|
|
"end": 1360.0,
|
|
"confidence": 0.95
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 1355.0,
|
|
"end": 1365.0,
|
|
"confidence": 0.87
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 1360.0,
|
|
"end": 1370.0,
|
|
"confidence": 0.83
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 1365.0,
|
|
"end": 1395.0,
|
|
"confidence": 0.86
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 1390.0,
|
|
"end": 1400.0,
|
|
"confidence": 0.84
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 1395.0,
|
|
"end": 1415.0,
|
|
"confidence": 0.96
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 1410.0,
|
|
"end": 1425.0,
|
|
"confidence": 0.9
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 1420.0,
|
|
"end": 1430.0,
|
|
"confidence": 0.94
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 1425.0,
|
|
"end": 1435.0,
|
|
"confidence": 0.82
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 1430.0,
|
|
"end": 1445.0,
|
|
"confidence": 0.91
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 1440.0,
|
|
"end": 1465.0,
|
|
"confidence": 0.9
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 1460.0,
|
|
"end": 2130.0,
|
|
"confidence": 0.88
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 2125.0,
|
|
"end": 2135.0,
|
|
"confidence": 0.78
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 2130.0,
|
|
"end": 2175.0,
|
|
"confidence": 0.86
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 2170.0,
|
|
"end": 2650.0,
|
|
"confidence": 0.97
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 2645.0,
|
|
"end": 2655.0,
|
|
"confidence": 0.85
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 2650.0,
|
|
"end": 2725.0,
|
|
"confidence": 0.97
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 2720.0,
|
|
"end": 2730.0,
|
|
"confidence": 0.89
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 2725.0,
|
|
"end": 2995.0,
|
|
"confidence": 0.91
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 2990.0,
|
|
"end": 3005.0,
|
|
"confidence": 0.95
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 3000.0,
|
|
"end": 3020.0,
|
|
"confidence": 0.81
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 3015.0,
|
|
"end": 3175.0,
|
|
"confidence": 0.92
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 3170.0,
|
|
"end": 3180.0,
|
|
"confidence": 0.91
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 3175.0,
|
|
"end": 3375.0,
|
|
"confidence": 0.97
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 3370.0,
|
|
"end": 3380.0,
|
|
"confidence": 0.85
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 3375.0,
|
|
"end": 3410.0,
|
|
"confidence": 0.91
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 3405.0,
|
|
"end": 3415.0,
|
|
"confidence": 0.84
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 3410.0,
|
|
"end": 4185.0,
|
|
"confidence": 0.96
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 4180.0,
|
|
"end": 4245.0,
|
|
"confidence": 0.8
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4240.0,
|
|
"end": 4265.0,
|
|
"confidence": 0.91
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 4260.0,
|
|
"end": 4280.0,
|
|
"confidence": 0.84
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4275.0,
|
|
"end": 4290.0,
|
|
"confidence": 0.95
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 4285.0,
|
|
"end": 4295.0,
|
|
"confidence": 0.82
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4290.0,
|
|
"end": 4325.0,
|
|
"confidence": 0.86
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 4320.0,
|
|
"end": 4335.0,
|
|
"confidence": 0.79
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4330.0,
|
|
"end": 4370.0,
|
|
"confidence": 0.97
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 4365.0,
|
|
"end": 4380.0,
|
|
"confidence": 0.81
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4375.0,
|
|
"end": 4405.0,
|
|
"confidence": 0.97
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 4400.0,
|
|
"end": 4415.0,
|
|
"confidence": 0.82
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4410.0,
|
|
"end": 4420.0,
|
|
"confidence": 0.85
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 4415.0,
|
|
"end": 4430.0,
|
|
"confidence": 0.84
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4425.0,
|
|
"end": 4525.0,
|
|
"confidence": 0.97
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 4520.0,
|
|
"end": 4530.0,
|
|
"confidence": 0.81
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4525.0,
|
|
"end": 4555.0,
|
|
"confidence": 0.89
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 4550.0,
|
|
"end": 4595.0,
|
|
"confidence": 0.89
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 4590.0,
|
|
"end": 5285.0,
|
|
"confidence": 0.95
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 5280.0,
|
|
"end": 5300.0,
|
|
"confidence": 0.94
|
|
},
|
|
{
|
|
"speaker": "CALLER",
|
|
"start": 5295.0,
|
|
"end": 5305.0,
|
|
"confidence": 0.83
|
|
},
|
|
{
|
|
"speaker": "CO-HOST",
|
|
"start": 5300.0,
|
|
"end": 5315.0,
|
|
"confidence": 0.91
|
|
},
|
|
{
|
|
"speaker": "HOST",
|
|
"start": 5310.0,
|
|
"end": 5340.0,
|
|
"confidence": 0.97
|
|
}
|
|
]
|
|
} |