Improving the robustness of the usual fbe-based asr front-end

Nadeu Camprubí, Climent; Macho, D; Hernando Pericás, Francisco Javier

Visualitza/Obre

tecn_habla.pdf (65,70Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Nadeu Camprubí, Climent

Macho, D

Hernando Pericás, Francisco Javier

Tipus de documentText en actes de congrés

Data publicació2000

EditorMergablum

Condicions d'accésAccés obert

Attribution-NonCommercial-NoDerivs 3.0 Spain

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya

Abstract

All speech recognition systems require some form of signal representation that parametrically models the temporal evolution of the spectral envelope. Current parameterizations involve, either explicitly or implicitly, a set of energies from frequency bands which are often distributed in a mel scale. The computation of those filterbank energies (FBE) always includes smoothing of basic spectral measurements and non-linear amplitude compression. A variety of linear transformations are typically applied to this time-frequency representation prior to the Hidden Markov Model (HMM) pattern-matching stage of recognition. In the paper, we will discuss some robustness issues involved in both the computation of the FBEs and the posterior linear transformations, presenting alternative techniques that can improve robustness in additive noise conditions. In particular, the root non-linearity, a voicing-dependent FBE computation technique and a time&frequency filtering (tiffing) technique will be considered. Recognition results for the Aurora database will be shown to illustrate the potential application of these alternatives techniques for enhancing the robustness of speech recognition systems.

CitacióNadeu, C., Macho, D., Hernando, J. Improving the robustness of the usual fbe-based asr front-end. A: Jornadas en Tecnología del Habla. "Las tecnologías del Habla". Sevilla: Mergablum, 2000, p. 1-20.

URIhttp://hdl.handle.net/2117/103813

ISBN84-95118-58-0

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
tecn_habla.pdf		65,70Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Improving the robustness of the usual fbe-based asr front-end

Visualitza/Obre

Explora