TY - GEN
T1 - Directional Early-to-Late Energy Ratios to Quantify Clarity
T2 - 2021 Immersive and 3D Audio: From Architecture to Automotive, I3DA 2021
AU - Campos, Alexis
AU - Sakamoto, Shuichi
AU - Salvador Castaneda, Cesar Daniel
N1 - Funding Information:
This study was partly supported by Universidad Peruana Unión (UPeU), Lima, Perú, and JSPS KAKENHI Grant Number 19H04145, Japan.
Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - Early-to-late energy ratios (ELER) are used to quantity speech intelligibility and music clarity in acoustic spaces from measurements of omnidirectional room impulse responses (RIR). Nowadays, the capture of directional RIRs is possible with spherical microphone arrays and the spherical Fourier transform. These tools are thus motivating the enhancement of omnidirectional metrics and the search for new metrics to quantity directional features of sound. This research explores a directional metric of intelligibility and clarity based on ELERs of directional RIRs. The early-to-late transition times are chosen according to the content: 50 ms for speech and 80 ms for music. The proposed metrics can therefore be interpreted as directional versions of the standard clarity indexes of speech (C50) and music (C80). Directional RIRs were captured at many seats in a large auditorium using a first-order ambisonics microphone. Supporting acoustic simulations of a cuboid room with a second-order ambisonics microphone were also used. Directional ELERs were calculated in the octave bands within the operation range of the microphones. Three directional ELER patterns were identified: an omnidirectional pattern, a dipole pointing forward and backward, and a beam pointing towards the source.
AB - Early-to-late energy ratios (ELER) are used to quantity speech intelligibility and music clarity in acoustic spaces from measurements of omnidirectional room impulse responses (RIR). Nowadays, the capture of directional RIRs is possible with spherical microphone arrays and the spherical Fourier transform. These tools are thus motivating the enhancement of omnidirectional metrics and the search for new metrics to quantity directional features of sound. This research explores a directional metric of intelligibility and clarity based on ELERs of directional RIRs. The early-to-late transition times are chosen according to the content: 50 ms for speech and 80 ms for music. The proposed metrics can therefore be interpreted as directional versions of the standard clarity indexes of speech (C50) and music (C80). Directional RIRs were captured at many seats in a large auditorium using a first-order ambisonics microphone. Supporting acoustic simulations of a cuboid room with a second-order ambisonics microphone were also used. Directional ELERs were calculated in the octave bands within the operation range of the microphones. Three directional ELER patterns were identified: an omnidirectional pattern, a dipole pointing forward and backward, and a beam pointing towards the source.
KW - Ambisonics
KW - Clarity
KW - Early-to-late energy ratio
KW - Room impulse response
KW - Spherical Fourier transform
UR - http://www.scopus.com/inward/record.url?scp=85123179816&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123179816&partnerID=8YFLogxK
U2 - 10.1109/I3DA48870.2021.9610935
DO - 10.1109/I3DA48870.2021.9610935
M3 - Conference contribution
AN - SCOPUS:85123179816
T3 - 2021 Immersive and 3D Audio: From Architecture to Automotive, I3DA 2021
BT - 2021 Immersive and 3D Audio
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 8 September 2021 through 10 September 2021
ER -