A beautiful, modern voice recording application with real-time transcription using OpenAI's Whisper AI model. Features a minimalist UI with dark/light mode, recording history, and multiple model support.
- High-Quality Audio Recording - Record, pause, resume, and stop with intuitive controls
- AI-Powered Transcription - Automatic speech-to-text using OpenAI Whisper
- Multiple Model Support - Choose from different Whisper models (tiny, base, small, medium)
| Model | Size | Speed | Accuracy |
|---|---|---|---|
tiny.en |
~79 MB | Fastest | Dull |
small |
~544 MB | Fast | Good |
git clone https://github.com/SimpleCyber/OpenAI-Whisper-Ollama-LLMs.git
cd OpenAI-Whisper-Ollama-LLMspip install git+https://github.com/openai/whisper.git
pip install torchpip install flaskEdit app.py and update the model cache path if needed:
model_cache = r"D:\whisper_models" # Change to your preferred locationpython app.pyThe server will start at http://127.0.0.1:5000
Navigate to http://localhost:5000 in your web browser.
- Click the model dropdown in the top-right corner
- Choose your preferred Whisper model
- First-time model usage will download it automatically
- Start Recording - Click the microphone button
- Pause/Resume - Use pause/resume buttons during recording
- Stop - Click stop when finished
- Wait for Transcription - The AI will process and display the text
- Click the hamburger menu (☰) to open the sidebar
- See all your past recordings with timestamps
- Click any recording to view full details
- Download audio files from the detail modal
Contributions are welcome! Please feel free to submit a Pull Request.
This project is open source and available under the MIT License.
- OpenAI Whisper - AI transcription model
- Flask - Web framework
- Tailwind CSS - UI styling
- Font Awesome - Icons
- Pattern Craft - Background
Made with ❤️ by SimpleCyber