dictate-google

Real-time voice dictation using Google Cloud Speech-to-Text streaming API. Text appears as you speak, directly typed into any application via xdotool.

Features

Real-time streaming: Text appears progressively as you speak
Smart display updates: Handles interim results with minimal flicker
Toggle mode: Run once to start, run again to stop
Continuous dictation: Stream stays open for natural pauses

Requirements

Linux with X11 (uses xdotool for typing)
Python 3.8+
Google Cloud account with Speech-to-Text API enabled

Installation

# Install dependencies
pip install google-cloud-speech pyaudio

# On Debian/Ubuntu, you may also need:
sudo apt install python3-pyaudio xdotool portaudio19-dev

# Copy the script to your PATH
cp dictate-google ~/.local/bin/
chmod +x ~/.local/bin/dictate-google

Setup

Create a Google Cloud project and enable the Speech-to-Text API
Create a service account and download the JSON credentials
Place credentials at ~/.config/stt-credentials.json or set GOOGLE_APPLICATION_CREDENTIALS

# Option 1: Default location
cp your-credentials.json ~/.config/stt-credentials.json

# Option 2: Environment variable
export GOOGLE_APPLICATION_CREDENTIALS=/path/to/credentials.json

Usage

# Start dictation (English)
dictate-google

# Start dictation (German)
dictate-google --lang=de-DE

# Stop dictation (run again or Ctrl+C)
dictate-google

Keyboard Shortcut

Bind dictate-google to a key (e.g., Super+D) in your desktop environment for quick toggle.

How It Works

Opens microphone stream and sends audio to Google Cloud Speech-to-Text
Receives interim results (may change) and final results (committed)
Types text into the focused application using xdotool
Tracks what's typed to handle corrections without flickering

Limitations

Streams have a 5-minute maximum duration (Google API limit)
Requires X11 (Wayland users need XWayland or alternative input method)
Microphone must be accessible to the script

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE		LICENSE
README.md		README.md
dictate-google		dictate-google

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dictate-google

Features

Requirements

Installation

Setup

Usage

Keyboard Shortcut

How It Works

Limitations

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

dictate-google

Features

Requirements

Installation

Setup

Usage

Keyboard Shortcut

How It Works

Limitations

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages