Files
claudetools/projects/radio-show/audio-processor/config.yaml
Mike Swanson 6cc9043b8e Audio processor: validated voice profiling accuracy, tuned threshold
- Fine-grained speaker analysis (3s windows, 1s hop) across 42min episode
- Host voice: 0.90-0.98 similarity (clear positive match)
- Callers: 0.65-0.68 (correctly below threshold)
- Produced audio/clips: 0.53-0.65 (correctly identified as non-host)
- Co-host/other speakers: 0.56-0.62 (correctly identified)
- Tuned host_match_threshold from 0.75 to 0.83 based on empirical data
- Cross-referenced dips with transcript: correctly identifies callers,
  show intros, played audio clips, and station breaks
- Batch transcription of 7 additional training episodes in progress

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-21 12:48:25 -07:00

58 lines
1.3 KiB
YAML

show:
name: "The Computer Guru Show"
host: "Mike Swanson"
typical_duration_minutes: 120
segment_count: 6
has_commercials: true
audio:
whisper_model: "large-v3"
whisper_language: "en"
output_format: "mp3"
output_bitrate: "192k"
normalize: true
crossfade_ms: 500
segment_detection:
fingerprint_db: "element-library/fingerprints.db"
fingerprint_match_threshold: 0.85
discover_unknown_elements: true
min_element_duration_s: 1.0
max_element_duration_s: 30.0
cluster_similarity_threshold: 0.90
min_cluster_occurrences: 3
min_break_duration_s: 30
max_break_duration_s: 300
silence_threshold_db: -40
confidence_threshold: 0.70
weights:
fingerprint_match: 0.30
speaker_identity: 0.25
audio_characteristics: 0.20
break_pattern: 0.15
structural_heuristic: 0.10
diarization:
min_speakers: 1
max_speakers: 6
voice_profiles_dir: "voice-profiles/"
host_match_threshold: 0.83
llm:
model: "qwen3:14b"
ollama_host: "http://localhost:11434"
paths:
episodes_dir: "episodes/"
voice_profiles: "voice-profiles/"
element_library: "element-library/"
output_dir: "processed/"
archive:
server: "172.16.3.10"
path: "/home/gurushow/public_html/archive/"
elements_path: "/home/gurushow/public_html/archive/Radio/Elements/"