Codificación APVQ de voz en banda ancha para velocidades entre 16 y 32 KBPS
Document typeConference report
Rights accessOpen Access
This paper describes a coding scheme for broadband speech (sampling frequency 16KHz). We present a wideband speech encoder called APVQ (Adaptive Predictive Vector Quantization). It combines Subband Coding, Vector Quantization and Adaptive Prediction as it is represented in Fig. I. Speech signal is split in 16 subbands by means of a QMF filter bank and so every subband is 500Hz wide. This APVQ encoder can be seen as a vectorial extension of a conventional ADPCM encoder. In this scheme, signal vector is formed with one sample of the normalized prediction error signal coming from different subbands and then it is vector quantized. Prediction error signal is normalized by its gain and normalized prediction error signal is the input of the VQ and therefore an adaptive Gain-Shape VQ is considered. This APVQ Encoder combines the advantages of Scalar Prediction and those of Vector Quantization. We evaluate wideband speech coding in the range from 1 to 2 bits/sample, that leads to a coding rate from 16 to 32 kbps.
CitationSalavedra, J., Masgrau, E. Codificación APVQ de voz en banda ancha para velocidades entre 16 y 32 KBPS. A: Simposium Nacional de la Unión Científica Internacional de Radio. "URSI 1996: XI Simposium Nacional de la Unión Científica Internacional de Radio: Madrid: 18 al 20 septiembre de 1996". Madrid: 1996, p. II-482-II-485.