claudetools

Files

Mike Swanson 826141a319 Audio processor: working voice profiler with WavLM speaker embeddings

- Voice profiler using microsoft/wavlm-base-sv (512-dim x-vector embeddings)
- Bootstrap from archive: 180 embeddings from 9 episodes across 2010-2018
- Host identification accuracy: 0.87-0.98 similarity for live speech,
  0.60-0.64 for non-host audio (produced intros, co-host)
- Dropped speechbrain dependency (requires torchaudio, CUDA version conflicts)
- Patched torchaudio CUDA 12.8/13.1 version check (warning instead of error)
- Profile stored in voice-profiles/mike-swanson/ with per-chunk embeddings

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-21 12:19:13 -07:00

audio-processor

Audio processor: working voice profiler with WavLM speaker embeddings

2026-03-21 12:19:13 -07:00

episodes

Session log: repo reorganization, GrepAI test, radio show prep

2026-03-20 18:34:11 -07:00

website

Reorganize repo: compartmentalize scripts by client/project

2026-03-20 17:15:07 -07:00

post-show-workflow.md

Add radio show audio processor and post-show workflow

2026-03-21 11:51:59 -07:00