Effects of intermodal timing difference and speed difference on intelligibility of auditory-visual speech in younger and older adults

Akihiro Tanaka, Shuichi Sakamoto, Komi Tsumura, Yôiti Suzuki

Research output: Contribution to conferencePaperpeer-review

2 Citations (Scopus)

Abstract

Previous studies have revealed a temporal window during which human observers perceive physically desynchronized auditory and visual signals as synchronous. This study investigated effects of intermodal timing differences and speed differences on intelligibility of auditory-visual speech. We used 20 minimal pairs of Japanese four-mora words such as “mi-zu-a-ge” (catch landing) versus “mi-zu-a-me” (starch syrup) and administered intelligibility tests. Words were presented under visual-only, auditory-only, and auditory-visual (AV) conditions. Two types of AV conditions were used: asynchronous and expansion conditions. In asynchronous (i.e. timing difference) conditions, the audio lag was 0-400 ms. In expansion (i.e. speed difference) conditions, the auditory signal was time-expanded while the visual signal was kept at the original speed. The amount of expansion was 0-400 ms. Results showed that the word intelligibility declined as the timing difference and speed difference increased. Results of AV benefit (i.e. the superiority of AV performance over auditory-only performance) revealed that the AV benefit at the end of words declined as the speed difference increased, although it did not decline as timing difference increased. These results suggest that intermodal lag recalibration requires a constant timing difference between auditory and visual signals. Older adults recalibrated neither the timing difference nor the speed difference. These results might be useful for design of a multimodal speech-rate conversion system.

Original languageEnglish
Publication statusPublished - 2007
Event2007 International Conference on Auditory-Visual Speech Processing, AVSP 2007 - Hilvarenbeek, Netherlands
Duration: 2007 Aug 312007 Sept 3

Conference

Conference2007 International Conference on Auditory-Visual Speech Processing, AVSP 2007
Country/TerritoryNetherlands
CityHilvarenbeek
Period07/8/3107/9/3

Keywords

  • asynchrony
  • intelligibility
  • older listeners
  • speech-rate conversion

ASJC Scopus subject areas

  • Language and Linguistics
  • Speech and Hearing
  • Otorhinolaryngology

Fingerprint

Dive into the research topics of 'Effects of intermodal timing difference and speed difference on intelligibility of auditory-visual speech in younger and older adults'. Together they form a unique fingerprint.

Cite this