Wideband-speech APVQ coding from 16 to 32 KBPS
Document typeConference report
Rights accessOpen Access
This paper describes a coding scheme for broadband speech (sampling frequency 16KHz). We present a wideband speech encoder called APVQ (Adaptive Predictive Vector Quantization). It combines Subband Coding, Vector Quantization and Adaptive Prediction as it is represented in Fig.1. Speech signal is split in 16 subbands by means of a QMF filter bank and so every subband is 500Hz wide. This APVQ encoder can be seen either as a vectorial extension of a conventional ADPCM encoder or as a scalar Subband AVPC encoder ,. In this scheme, signal vector is formed with one sample of the normalized prediction error signal coming from different subbands and then it is vector quantized. Prediction error signal is normalized by its gain and normalized prediction error signal is the input of the VQ and therefore an adaptive Gain-Shape VQ is considered. This APVQ Encoder combines the advantages of Scalar Prediction and those of Vector Quantization. We evaluate wideband speech coding in the range from 1 to 2 bits/sample.
CitationSalavedra, J. Wideband-speech APVQ coding from 16 to 32 KBPS. A: European Conference on Speech Communication and Technology. "EUROSPEECH'97: 5th European Conference on Speech Communication and Technology: Rhodes, Greece: 22-25 September 1997". Rhodes: 1997, p. 1511-1514.
|Wideband-speech APVQ coding from 16 to 32 KBPS.pdf||405.4Kb||View/