TY - JOUR

T1 - Vector quantization of speech signals using principal component analysis

AU - Kohata, Minoru

AU - Sone, Hideaki

AU - Echigo, Hiroshi

AU - Takagis, Tasuku

PY - 1987

Y1 - 1987

N2 - A new method of designing a vector quantizer is presented for bandwidth compression of speech signals, and some experimental results are shown. In this method, spectral parameters are extracted first from the DFT spectrum of input speech signals using a psychological frequency scale‐the so‐called Mel scale. This parameter is called the Mel‐scaled spectrum. The number of Mel‐scaled spectrum is reduced and the cepstrum of this parameter is calculated. Then this Mel‐scaled cepstrum is vector quantized and the codebook‐vector of the vector quantizer is determined by the algorithm using principal component analysis. With this algorithm, codebook‐vectors can be designed considering the statistical characteristics of the Mel‐scaled cepstrum. Also, the reduction of parameters by Mel‐scale can decrease the size of the codebook memory without greatly degrading the synthesized speech quality. Using the forementioned method two codebooks are designed: one contains 256 vectors, and the other contains 2048 vectors. The quantization error is compared with those designed by the well‐known LBG algorithm. The simulation results show that the codebooks designed by the proposed method present less quantization error and degradation of synthesized speech quality than those designed by LBG algorithm.

AB - A new method of designing a vector quantizer is presented for bandwidth compression of speech signals, and some experimental results are shown. In this method, spectral parameters are extracted first from the DFT spectrum of input speech signals using a psychological frequency scale‐the so‐called Mel scale. This parameter is called the Mel‐scaled spectrum. The number of Mel‐scaled spectrum is reduced and the cepstrum of this parameter is calculated. Then this Mel‐scaled cepstrum is vector quantized and the codebook‐vector of the vector quantizer is determined by the algorithm using principal component analysis. With this algorithm, codebook‐vectors can be designed considering the statistical characteristics of the Mel‐scaled cepstrum. Also, the reduction of parameters by Mel‐scale can decrease the size of the codebook memory without greatly degrading the synthesized speech quality. Using the forementioned method two codebooks are designed: one contains 256 vectors, and the other contains 2048 vectors. The quantization error is compared with those designed by the well‐known LBG algorithm. The simulation results show that the codebooks designed by the proposed method present less quantization error and degradation of synthesized speech quality than those designed by LBG algorithm.

UR - http://www.scopus.com/inward/record.url?scp=0023345528&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0023345528&partnerID=8YFLogxK

U2 - 10.1002/ecja.4410700502

DO - 10.1002/ecja.4410700502

M3 - Article

AN - SCOPUS:0023345528

SN - 8756-6621

VL - 70

SP - 16

EP - 26

JO - Electronics and Communications in Japan, Part I: Communications (English translation of Denshi Tsushin Gakkai Ronbunshi)

JF - Electronics and Communications in Japan, Part I: Communications (English translation of Denshi Tsushin Gakkai Ronbunshi)

IS - 5

ER -