EasySpeak¶
Voice control for Linux desktops. Fully local, no cloud, Wayland-native.
Say "Hey Jarvis" and control your desktop with your voice.
Early development
This project works but is not polished. Expect bugs, incomplete docs, and changes without notice.
Why EasySpeak?¶
Linux desktop voice control is a gap. Talon exists but has a steep learning curve and costs money for the full version. Most other tools are X11-only, abandoned, or cloud-dependent.
EasySpeak is:
- Free and open source — GPL-3.0 licensed, no paywalls
- Fully local — No cloud, no accounts, no data leaving your machine
- Wayland-native — Works on modern GNOME desktops where X11 tools fail
- Simple — Say "Hey Jarvis, open downloads" and it works
- Extensible — Drop a Python file in
plugins/to add commands
Built for people with RSI, accessibility needs, hands-busy workflows, or anyone who wants to talk to their computer.
Features¶
Current and in active development:
- Wake word activation — Hands-free with "Hey Jarvis"
- Mouse grid — Navigate anywhere on screen with voice ("grid", "3 7 5", "click")
- Head tracking — Control cursor with head movement (experimental)
- Browser control — Qutebrowser integration with link hints, tabs, scrolling
- Dictation — Voice-to-text in any text field with punctuation commands
- App launcher — Open and close applications by name
- Media control — Play, pause, skip via MPRIS
- System controls — Volume, brightness, do not disturb
- Fully local — OpenWakeWord + Whisper + Piper, no cloud services
- Plugin architecture — Easy to extend
Demo¶
Where to next?¶
- Installation — system packages, Python, Piper TTS
- Usage — running the daemon and the wake word
- Commands — the full command reference
- Writing Plugins — extend EasySpeak with a Python file
- Troubleshooting — common problems and fixes
- How It Works — the architecture
- API Reference — generated from the source docstrings
- License
Acknowledgments¶
- OpenWakeWord — Wake word detection
- faster-whisper — Speech recognition
- Piper — Text-to-speech (we use the last standalone binary from the original rhasspy/piper repo)
- Talon — Inspiration for voice control concepts
