Automatic evaluation system of english prosody for Japanese Learner's Speech

Motoyuki Suzuki, Tatsuki Konno, Akinori Ito, Shozo Makino

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Prosody plays an important role in speech communication between humans. Several computer-assisted language learning (CALL) systems with utterance evaluation have been developed so far; however, accuracy of their prosody evaluation is still poor. In this paper, we develop new methods to evaluate rhythm and intonation of English sentence uttered by Japanese learners. The new points of our work are that (1) new prosodic features are added to traditional features, and (2) word importance factors are introduced in the calculation of intonation score. The word importance score is automatically estimated using the ordinary least squares method, and optimized based on word clusters generated by a decision tree. The rhythm evaluator uses two acoustic features, time duration ratio of each word and normalized log-power. From the experiments, correlation coefficient (±1.0 denotes the best correlation) between the rhythm score given by native speakers and the system was -0.55. On the other hand, a conventional feature (pause insertion error rate) gave only -0.11. The intonation evaluator uses four acoustic features, pitch, normalized log-power, and first-order regression coefficients of those two features. Prom the experiments, correlation coefficient between the intonation score given by native speakers and the system was 0.45.

Original languageEnglish
Title of host publicationIMSCI 2007 - International Multi-Conference on Society, Cybernetics and Informatics, Proceedings
PublisherInternational Institute of Informatics and Systemics, IIIS
Pages48-53
Number of pages6
ISBN (Print)1934272116, 9781934272114
Publication statusPublished - 2007
EventInternational Multi-Conference on Society, Cybernetics and Informatics, IMSCI 2007 - Orlando, FL, United States
Duration: 2007 Jul 122007 Jul 15

Publication series

NameIMSCI 2007 - International Multi-Conference on Society, Cybernetics and Informatics, Proceedings
Volume1

Conference

ConferenceInternational Multi-Conference on Society, Cybernetics and Informatics, IMSCI 2007
Country/TerritoryUnited States
CityOrlando, FL
Period07/7/1207/7/15

Keywords

  • Computer assisted language learning system
  • Decision tree
  • Intonation
  • Prosody evaluation
  • Rhythm

Fingerprint

Dive into the research topics of 'Automatic evaluation system of english prosody for Japanese Learner's Speech'. Together they form a unique fingerprint.

Cite this