Feature decorrelation methods in speech recognition. A comparative study

Batlle Mont, Eloi; Nadeu Camprubí, Climent; Rodríguez Fonollosa, José Adrián

Visualitza/Obre

i98_0473.pdf (152,8Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Batlle Mont, Eloi

Nadeu Camprubí, Climent

Rodríguez Fonollosa, José Adrián

Tipus de documentText en actes de congrés

Data publicació1998

EditorInternational Speech Communication Association (ISCA)

Condicions d'accésAccés obert

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

In this paper we study various decorrelation methods for the features used in speech recognition and we compare the performance of each one by running several tests with a speech database. First of all we study the Principal Components Analysis (PCA). PCA extracts the dimensions along which the data vary the most, and thus it allows us to reduce the dimension of the data point without significant loss of performance. The second transform we study is the Discrete Cosine Transform (DCT). As it will be shown, it is an approximation of the PCA analysis. By applying this transform to FBE parameters we obtain the MFCC coeficients. A further step is taken with the Linear Discriminant Analysis (LDA), which, not only reduces the dimensionality of the problem, but also discriminates among classes to reduce the confusion error. The last method we study is Frequency Filtering (FF). This method consists of a linear filtering of the frequency sequence of the log FBE that both decorrelates and equalizes the variance of the coeficients.

CitacióBatlle, E., Nadeu, C., Fonollosa, J. A. R. Feature decorrelation methods in speech recognition. A comparative study. A: International Conference on Spoken Language Processing. "ICSLP 98: the 5th International Conference on Spoken Language Processing; incorporating the 7th Australian International Speech Science and Technology Conference; Sydney Convention Centre, Sydney, Australia, 30th November-4th December 1998". Baixas: International Speech Communication Association (ISCA), 1998.

URIhttp://hdl.handle.net/2117/117385

ISBN1-876346-17- 5

Versió de l'editorhttps://www.isca-speech.org/archive/archive_papers/icslp_1998/i98_0473.pdf

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
i98_0473.pdf		152,8Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Feature decorrelation methods in speech recognition. A comparative study

Visualitza/Obre

Explora