Skip to content

Feature Request: Auto convert custom whisper models to Faster whisper format when downloading #1360

@Cherkah

Description

@Cherkah

Description

Buzz currently only supports standard Faster-Whisper models (tiny/small/medium/large + .eng variants), limiting language-specific accuracy for non-English audio.

I'd like to request the ability to:

  1. Download/import custom Whisper models directly from Hugging Face repositories
  2. Use fine-tuned models like yuriyvnv/whisper-small-mixed-cv-nl for superior Dutch transcription (WER ~13% vs ~18% multilingual small) .

Use case

  • Located in Netherlands (Roosendaal), need accurate Dutch speech-to-text
  • Recent HF models (2025) outperform multilingual baselines significantly on accents/bruit
  • Current workflow: CLI faster-whisper → SRT import in Buzz (inefficient)

Proposed implementation

  • Settings > Models > "Download from Hugging Face" field (repo ID)
  • Auto-conversion to Faster-Whisper format with CUDA/NVIDIA support
  • Model cache in ~/.local/share/buzz/models/

Current limitations

  • No custom model support found in docs/FAQ
  • No related feature requests in Issues/Discussions (searched 2023-2026)

Screenshots (if applicable)
[Attach Buzz model selection screen]

Environment

  • OS: Fedora 43 KDE Plasma 6.5 + NVIDIA (RPM Fusion)
  • Buzz: [version from flatpak list]
  • GPU acceleration working correctly with standard models

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions