Linux software to perform different operations using voice control:
- open folders and websites
- search google
- control music and video playback: pause, mute, etc.
- type text
Install the following packages if you want to have the corresponding functionality:
- python3 (must-have, usually already present)
- python3-pyaudio (also must-have)
- xautomation (to control media and type text with your voice)
- python3-espeak (to receive computer voice notifications)
To install all requirements on Ubuntu:
sudo apt-get install xautomation python3 python3-pyaudio python3-espeak
No installation is required. Just download uspeak and it's ready.
Get to program directory and run
./uspeak -h
to get the list of supported parameters. By now the following languages are supported for commands:
- en (English)
- ru (Russian)
Usually you'll want to run the tool two ways:
Execute single voice command and exit
./uspeak.py --lang=en
Useful to set system hotkey for this command. You'll have a couple of second to start speaking otherwise it will exit.
Wait for loud sound to trigger the voice input
./uspeak.py --lang=en --continuous
Almost the same, but the tool will run continuously and be triggered each time you produce some load noise: say something, clap your hands, tap the mic, etc. After that it will wait a couple of seconds for your command, execute it and continue waiting for next trigger event.
You'd better setup your microphone volume to comfortable level then. If the tool is triggered too often, decrease the volume. And vice a versa.
Open ~/Documents: 'open documents'
Open ~/Downloads: 'open downloads'
Open ~/Dropbox: 'open dropbox'
Open ~/Music: 'open music'
Open ~/Videos: 'open videos'
Open website: 'open' or 'go (to)' <site address or site name>, e.g. 'open youtube'
Search Google: 'search' or 'google' <text to search> or just ask any question
Insrease volume: 'louder' or 'volume up'
Decrease volume: 'volume down'
Mute all sounds: 'mute'
Unmute all sounds: 'unmute'
Play next song/video in queue: 'next'
Play previous song/video in queue: 'latest' or 'previous'
Pause/unpause: 'pause' or 'play'
Type text: 'type' <text>
For complete list of commands and their variations look at dictionary files, e.g. for English: dictionary/data/main_en.dic (you should be familiar with regular expressions)
For help on commands in Russian run:
./uspeak.py --lang=ru --list-commands
- Based on LiSpeak