On the decorrelation of filter-bank energies in speech recognition

Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Gorricho Moreno, Monica

Visualitza/Obre

On the decorrelation of filter-bank energies in speech recognition.pdf (30,32Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Nadeu Camprubí, Climent

Hernando Pericás, Francisco Javier

Gorricho Moreno, Monica

Tipus de documentText en actes de congrés

Data publicació1995

Condicions d'accésAccés obert

Attribution-NonCommercial-NoDerivs 3.0 Spain

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya

Abstract

Cepstral coefficients are widely used in speech recognition. In this paper, we claim that they are not the best way of representing the spectral envelope, at least for some usual speech recognition systems. In fact, cepstrum has several disadvantages: poor physical meaning, need of transformation, and low capacity of adaptation to some recognition systems. In this paper, we propose a new representation that significantly outperforms both mel-cepstrum and LPC-cepstrum techniques in both recognition rate and computational cost. It consists of filtering the frequency sequence of filter-bank energies with an extremely simple filter that equalizes the variance of the cepstral coefficients. Excellent results of the new technique using a continuous observation density HMM recognition system and two very different recognition tasks, connected digits and phone recognition, are presented.

CitacióNadeu, C., Hernando, J., Gorricho, M. On the decorrelation of filter-bank energies in speech recognition. A: European Conference on Speech Communication and Technology. "EUROSPEECH '95: 4th European Conference on Speech Communication and Technology: Madrid, Spain: 18-21 September 1995". Madrid: 1995, p. 1381-1384.

URIhttp://hdl.handle.net/2117/88731

ISBN1018-4074

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
On the decorrel ... in speech recognition.pdf		30,32Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

On the decorrelation of filter-bank energies in speech recognition

Visualitza/Obre

Explora