Abstract
This paper presents recent developments at our site toward speech recognition using decision tree based acoustic models. Previously, robust decision trees have been shown to achieve better performance compared to standard Gaussian mixture model (GMM) acoustic models. This was achieved by converting hard questions (decisions) of a standard tree into soft questions using sigmoid function. In this paper, we report our work where soft-decision trees are trained from scratch. These soft-decision trees are shown to yield better speech recognition accuracy compared to standard GMM acoustic models on Aurora digit recognition task.
Original language | English |
---|---|
Pages (from-to) | 940-943 |
Number of pages | 4 |
Journal | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
Publication status | Published - 2008 |
Event | INTERSPEECH 2008 - 9th Annual Conference of the International Speech Communication Association - Brisbane, QLD, Australia Duration: 2008 Sept 22 → 2008 Sept 26 |
Keywords
- Decision trees
- Gaussian mixture model (GMM)
- Hidden Markov model (HMM)
- Speech recognition