Description
Buzz currently only supports standard Faster-Whisper models (tiny/small/medium/large + .eng variants), limiting language-specific accuracy for non-English audio.
I'd like to request the ability to:
- Download/import custom Whisper models directly from Hugging Face repositories
- Use fine-tuned models like
yuriyvnv/whisper-small-mixed-cv-nl for superior Dutch transcription (WER ~13% vs ~18% multilingual small) .
Use case
- Located in Netherlands (Roosendaal), need accurate Dutch speech-to-text
- Recent HF models (2025) outperform multilingual baselines significantly on accents/bruit
- Current workflow: CLI
faster-whisper → SRT import in Buzz (inefficient)
Proposed implementation
- Settings > Models > "Download from Hugging Face" field (repo ID)
- Auto-conversion to Faster-Whisper format with CUDA/NVIDIA support
- Model cache in
~/.local/share/buzz/models/
Current limitations
- No custom model support found in docs/FAQ
- No related feature requests in Issues/Discussions (searched 2023-2026)
Screenshots (if applicable)
[Attach Buzz model selection screen]
Environment
- OS: Fedora 43 KDE Plasma 6.5 + NVIDIA (RPM Fusion)
- Buzz: [version from flatpak list]
- GPU acceleration working correctly with standard models
Description
Buzz currently only supports standard Faster-Whisper models (tiny/small/medium/large + .eng variants), limiting language-specific accuracy for non-English audio.
I'd like to request the ability to:
yuriyvnv/whisper-small-mixed-cv-nlfor superior Dutch transcription (WER ~13% vs ~18% multilingual small) .Use case
faster-whisper→ SRT import in Buzz (inefficient)Proposed implementation
~/.local/share/buzz/models/Current limitations
Screenshots (if applicable)
[Attach Buzz model selection screen]
Environment