Turn your coding agent into a screen aware, voice aware, context rich collaborator.
Record your screen, microphone, and system audio in real time, then search what happened in natural language.
Works with Claude Code, Cursor, Codex, and other skill compatible agents.
Explore the docs · Report an issue · Join Discord
Pair Programmer is an agentic skill that gives your AI coding assistant real time perception.
It captures:
- Screen for visual context like terminals, editors, browser tabs, errors, and UI state
- Microphone for your spoken intent, ideas, and debugging notes
- System audio for tutorials, meetings, demos, and anything else your computer is playing
Once captured, that context becomes searchable.
So instead of re explaining what was on screen, copy pasting logs, or summarizing a 20 minute debugging session, you can ask:
- What was I doing when the auth flow broke?
- What did I say about the database migration?
- Show me what was on screen when the test failed
- What happened in the last 10 minutes?
This is the missing perception layer for coding agents.
PP 2 is a major upgrade.
-
Now an agentic skill
Not limited to Claude Code anymore -
Works across tools
Use it with Claude Code, Cursor, Codex, and other compatible agent environments -
Install with
npx
Faster setup, less friction -
Cleaner UI
Improved picker and recording widget for a smoother experience
Pair.Programmer.mp4
If you have an older version installed, remove it first before upgrading.
npx skills add video-db/pair-programmer/plugin marketplace add video-db/pair-programmer
/plugin install pair-programmerGet a free VideoDB API key from console.videodb.io
No credit card required.
Set your API key:
export VIDEO_DB_API_KEY=your-keyOr add it to a .env file in your project root
Then run:
/pair-programmer setupStart recording your screen, mic, and system audio:
/pair-programmer recordA source picker will open so you can choose what to capture. Once recording starts, a lightweight overlay shows recording status, active channels, and elapsed time.
Search your session in natural language:
/pair-programmer search "what was I working on when I mentioned the auth bug?"/pair-programmer search "what did I say in the last 5 minutes?"/pair-programmer search "show me what was on screen when the test failed"Get a summary of recent activity:
/pair-programmer what-happenedStop recording when you're done:
/pair-programmer stopMost coding agents can write code.
Very few can stay grounded in the same context as you.
Pair Programmer helps your agent stay on the same page by giving it access to what you saw, what you said, and what your machine was playing. That means less manual explanation, fewer broken handoffs, and a much more natural way to work.
Use it for:
- debugging sessions
- tutorial driven development
- bug reproduction
- meeting follow ups
- architecture walkthroughs
- voice first coding workflows
| Command | Description |
|---|---|
/pair-programmer record |
Start recording and open the source picker |
/pair-programmer stop |
Stop the active recording |
/pair-programmer search "<query>" |
Search screen, mic, and audio context using natural language |
/pair-programmer what-happened |
Summarize recent activity |
/pair-programmer setup |
Install dependencies and complete local setup |
/pair-programmer config |
Update indexing and recording settings |
- Node.js 18+
- macOS 12+
Windows support is currently in beta - VideoDB API key
Get one at console.videodb.io
Pair Programmer is open source and designed to be adapted for your own workflows and agent use cases.
- Issues: GitHub Issues
- Docs: docs.videodb.io
- Discord: Join community
VideoDB is the perception, memory, and action layer for AI agents working with video and audio.
Pair Programmer is one example of what becomes possible when agents can understand continuous media in real time.
Learn more at videodb.io
Made with ❤️ by the VideoDB team