A Hackers AI Voice Assistant

Build your own voice ai. This repo is for my YouTube video series on building an AI voice assistant with PyTorch

TODO:

wake word model and engine
pre-trained wake word model use for fine tuning on your own wakeword
speech recognition model, pretrained model, and engine
natural langauge understanding model, pretrained model, and engine
speech synthesis model, pretrained model, and engine

Running on native machine

dependencies

python3
portaudio (for pyaudio to work)

If you're on mac you can install portaudio using homebrew

using virtualenv (recommend)

virtualenv virtualassistant.venv
source voiceassistant.venv/bin/activate

pip packages

pip install -r requirements.txt

Running with Docker

setup

If you are running with just the cpu docker build -f cpu.Dockerfile -t voiceassistant .

If you are running on a cuda enabled machine docker build -f cpu.Dockerfile -t voiceassistant .

Wake word

scripts

For more details make sure to visit these files to look at script arguments and description

neuralnet/train.py is used to train the model

neuralnet/optimize_graph.py is used to create a production ready graph that can be used in engine.py

engine.py is used to demo the wakeword model

collect_wakeword_audio.py - used to collect wakeword and environment data

split_audio_into_chunks.py - used to split audio into n second chunks

split_commonvoice.py - if you download the common voice dataset, use this script to split it into n second chunks

create_wakeword_json.py - used to create the wakeword json for training

Steps to train and demo your wakeword model

For more details make sure to visit these files to look at script arguments and description

collect data
1. environment and wakeword data can be collected using python collect_wakeword_audio.py
2. be sure to collect other speech data like common voice. split the data into n seconds chunk with split_audio_into_chunks.py.
3. put data into two seperate directory named 0 and 1. 0 for non wakeword, 1 for wakeword. use create_wakeword_json.py to create train and test json
4. create a train and test json in this format...
```
// make each sample is on a seperate line
{"key": "/path/to/audio/sample, "label" 0}
{"key": "/path/to/audio/sample, "label" 1}
```
train model
1. use train.py to train model
2. after model training us optimize_graph.py to create an optimized pytorch model
test
1. test using the engine.py script

Raspberry pi

documenation to get this running on rpi is in progress...

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
VoiceAssistant		VoiceAssistant
fun/arnold_audio		fun/arnold_audio
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
cpu.Dockerfile		cpu.Dockerfile
requirements.txt		requirements.txt
rpi-requirements.txt		rpi-requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Hackers AI Voice Assistant

Running on native machine

dependencies

using virtualenv (recommend)

pip packages

Running with Docker

setup

Wake word

scripts

Steps to train and demo your wakeword model

Raspberry pi

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A Hackers AI Voice Assistant

Running on native machine

dependencies

using virtualenv (recommend)

pip packages

Running with Docker

setup

Wake word

scripts

Steps to train and demo your wakeword model

Raspberry pi

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages