A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation
deep-learning pytorch speech-synthesis codec vector-quantization wavlm vocos focal-modulation neural-speech-coding
-
Updated
Nov 30, 2025 - Jupyter Notebook