Learning lexicons from spoken utterances based on statistical model selection

Ryo Taguchi, Naoto Iwahashi, Takashi Nose, Kotaro Funakoshi, Mikio Nakano

Research output: Contribution to journalConference articlepeer-review

9 Citations (Scopus)

Abstract

This paper proposes a method for the unsupervised learning of lexicons from pairs of a spoken utterance and an object as its meaning without any a priori linguistic knowledge other than a phoneme acoustic model. In order to obtain a lexicon, a statistical model of the joint probability of a spoken utterance and an object is learned based on the minimum description length principle. This model consists of a list of word phoneme sequences and three statistical models: the phoneme acoustic model, a word-bigram model, and a word meaning model. Experimental results show that the method can acquire acoustically, grammatically and semantically appropriate words with about 85% phoneme accuracy.

Original languageEnglish
Pages (from-to)2731-2734
Number of pages4
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Publication statusPublished - 2009
Event10th Annual Conference of the International Speech Communication Association, INTERSPEECH 2009 - Brighton, United Kingdom
Duration: 2009 Sept 62009 Sept 10

Keywords

  • Language acquisition
  • Lexical learning
  • Model selection

Fingerprint

Dive into the research topics of 'Learning lexicons from spoken utterances based on statistical model selection'. Together they form a unique fingerprint.

Cite this