TY - JOUR
T1 - Vector quantization of speech signals using principal component analysis
AU - Kohata, Minoru
AU - Sone, Hideaki
AU - Echigo, Hiroshi
AU - Takagis, Tasuku
PY - 1987
Y1 - 1987
N2 - A new method of designing a vector quantizer is presented for bandwidth compression of speech signals, and some experimental results are shown. In this method, spectral parameters are extracted first from the DFT spectrum of input speech signals using a psychological frequency scale‐the so‐called Mel scale. This parameter is called the Mel‐scaled spectrum. The number of Mel‐scaled spectrum is reduced and the cepstrum of this parameter is calculated. Then this Mel‐scaled cepstrum is vector quantized and the codebook‐vector of the vector quantizer is determined by the algorithm using principal component analysis. With this algorithm, codebook‐vectors can be designed considering the statistical characteristics of the Mel‐scaled cepstrum. Also, the reduction of parameters by Mel‐scale can decrease the size of the codebook memory without greatly degrading the synthesized speech quality. Using the forementioned method two codebooks are designed: one contains 256 vectors, and the other contains 2048 vectors. The quantization error is compared with those designed by the well‐known LBG algorithm. The simulation results show that the codebooks designed by the proposed method present less quantization error and degradation of synthesized speech quality than those designed by LBG algorithm.
AB - A new method of designing a vector quantizer is presented for bandwidth compression of speech signals, and some experimental results are shown. In this method, spectral parameters are extracted first from the DFT spectrum of input speech signals using a psychological frequency scale‐the so‐called Mel scale. This parameter is called the Mel‐scaled spectrum. The number of Mel‐scaled spectrum is reduced and the cepstrum of this parameter is calculated. Then this Mel‐scaled cepstrum is vector quantized and the codebook‐vector of the vector quantizer is determined by the algorithm using principal component analysis. With this algorithm, codebook‐vectors can be designed considering the statistical characteristics of the Mel‐scaled cepstrum. Also, the reduction of parameters by Mel‐scale can decrease the size of the codebook memory without greatly degrading the synthesized speech quality. Using the forementioned method two codebooks are designed: one contains 256 vectors, and the other contains 2048 vectors. The quantization error is compared with those designed by the well‐known LBG algorithm. The simulation results show that the codebooks designed by the proposed method present less quantization error and degradation of synthesized speech quality than those designed by LBG algorithm.
UR - http://www.scopus.com/inward/record.url?scp=0023345528&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0023345528&partnerID=8YFLogxK
U2 - 10.1002/ecja.4410700502
DO - 10.1002/ecja.4410700502
M3 - Article
AN - SCOPUS:0023345528
SN - 8756-6621
VL - 70
SP - 16
EP - 26
JO - Electronics and Communications in Japan, Part I: Communications (English translation of Denshi Tsushin Gakkai Ronbunshi)
JF - Electronics and Communications in Japan, Part I: Communications (English translation of Denshi Tsushin Gakkai Ronbunshi)
IS - 5
ER -