TY - GEN
T1 - HMM-based speech recognition using decision trees instead of GMMs
AU - Teunen, Remco
AU - Akamine, Masami
PY - 2007
Y1 - 2007
N2 - In this paper, we experiment with decision trees as replacements for Gaussian mixture models to compute the observation likelihoods for a given HMM state in a speech recognition system. Decision trees have a number of advantageous properties, such as that they do not impose restrictions on the number or types of features, and that they automatically perform feature selection. In fact, due to the conditional nature of the decision tree evaluation process, the subset of features that is actually used during recognition depends on the input signal. Automatic state-tying can be incorporated directly into the acoustic model as well, and it too becomes a function of the input signal. Experimental results for the Aurora 2 speech database show that a system using decision trees offers state-of-the-art performance, even without taking advantage of its full potential.
AB - In this paper, we experiment with decision trees as replacements for Gaussian mixture models to compute the observation likelihoods for a given HMM state in a speech recognition system. Decision trees have a number of advantageous properties, such as that they do not impose restrictions on the number or types of features, and that they automatically perform feature selection. In fact, due to the conditional nature of the decision tree evaluation process, the subset of features that is actually used during recognition depends on the input signal. Automatic state-tying can be incorporated directly into the acoustic model as well, and it too becomes a function of the input signal. Experimental results for the Aurora 2 speech database show that a system using decision trees offers state-of-the-art performance, even without taking advantage of its full potential.
KW - Acoustic modeling
KW - Decision trees
KW - Likelihood computation
KW - Probability estimation
KW - Speech recognition
UR - http://www.scopus.com/inward/record.url?scp=56149112746&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=56149112746&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:56149112746
SN - 9781605603162
T3 - International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007
SP - 617
EP - 620
BT - International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007
PB - Unavailable
T2 - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007
Y2 - 27 August 2007 through 31 August 2007
ER -