simon

Book Creator
Add this page to your book

Book Creator
Remove this page from your book

Manage book(0 page(s))

Help

This is an old revision of the document!

speech model basically consists of two parts:

Language model: Describes all existing words and what sentences are grammatically correct. It will be created from your active scenario
Acoustic model: Describes how words sound. It will be either built solely through your voice recordings (training) or with the help of a base model

SPHINX backend“ uses:

PocketSphinx recognizer
SphinxTrain model generation backend

models:

US English HUB4 WSJ SPHINX model. This model is part of pocketsphinx-extra where it is called \”hub4_wsj_sc_3s_8k.cd_semi_5000\“.

sudo apt install simon
 
sudo apt install libsphinxbase3 python-sphinxbase sphinxbase-utils pocketsphinx sphinxtrain

edit

/usr/lib/sphinxtrain/etc/sphinx_train.cfg

$CFG_CD_TRAIN = 'no';

For small amount of data for training without context-dependent models use continuous model instead of semi-continuous one and this error will be solved.

This warning is caused by the fact you disabled context-dependent training and used semi-continuous model. You can ignore this error until you have more data and use context-dependent training.

training

mkdir /tmp/test
cd /tmp/test
 
sphinxtrain -t test setup

now we have

.
└── etc
│   ├── feat.params
│   ├── sphinx_train.cfg
│   ├── test.dic
│   └── test.filler

adjust sphinx_train.cfg TODO