This is an old revision of the document!
simon
speech model basically consists of two parts:
- Language model: Describes all existing words and what sentences are grammatically correct. It will be created from your active scenario
- Acoustic model: Describes how words sound. It will be either built solely through your voice recordings (training) or with the help of a base model
SPHINX backend“ uses:
- PocketSphinx recognizer
- SphinxTrain model generation backend
models:
- US English HUB4 WSJ SPHINX model. This model is part of pocketsphinx-extra where it is called \”hub4_wsj_sc_3s_8k.cd_semi_5000\“.
install
sudo apt install simon sudo apt install libsphinxbase3 python-sphinxbase sphinxbase-utils pocketsphinx sphinxtrain
edit
- /usr/lib/sphinxtrain/etc/sphinx_train.cfg
$CFG_CD_TRAIN = 'no';
model
For small amount of data for training without context-dependent models use continuous model instead of semi-continuous one and this error will be solved.
This warning is caused by the fact you disabled context-dependent training and used semi-continuous model. You can ignore this error until you have more data and use context-dependent training.
training
mkdir /tmp/test cd /tmp/test sphinxtrain -t test setup
now we have
. └── etc │ ├── feat.params │ ├── sphinx_train.cfg │ ├── test.dic │ └── test.filler
adjust sphinx_train.cfg TODO