A graphical interface for voice-to-text transcription using Python and the Deepgram API. This application allows you to convert speech to text in real-time and automatically types the transcribed text at your cursor position.
Alternative to Mac Whisper, Voice Access, and other voice typing tools.
- Real-time speech-to-text transcription
- Automatic text insertion at cursor position
- Keyboard shortcut support (F2)
- Transcription logging
- User-friendly GUI interface
- Python 3.7 or higher
- Deepgram API key (get one at https://deepgram.com)
- Operating System: Windows, macOS, or Linux
- Install the required dependencies:
pip install -r requirements.txt- Run the application:
python main.py-
Add Deepgram API key to the application.
-
Enjoy!
- Click the "Start Recording" button or press F2 to begin recording
- Click again or press F2 to stop recording
- The transcribed text will appear in the window and be typed at your cursor position
- All transcriptions are logged in transcribe.log
If you find this tool helpful, you can support the development by:
- Buying me a coffee at https://ko-fi.com/perrypixel
- UPI to kevinp@apl