Improving the robustness of the usual fbe-based asr front-end
Visualitza/Obre
Estadístiques de LA Referencia / Recolecta
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/103813
Tipus de documentText en actes de congrés
Data publicació2000
EditorMergablum
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
All speech recognition systems require some form of signal representation that parametrically models the
temporal evolution of the spectral envelope. Current parameterizations involve, either explicitly or implicitly, a
set of energies from frequency bands which are often distributed in a mel scale. The computation of those filterbank
energies (FBE) always includes smoothing of basic spectral measurements and non-linear amplitude
compression. A variety of linear transformations are typically applied to this time-frequency representation prior
to the Hidden Markov Model (HMM) pattern-matching stage of recognition. In the paper, we will discuss some
robustness issues involved in both the computation of the FBEs and the posterior linear transformations,
presenting alternative techniques that can improve robustness in additive noise conditions. In particular, the root
non-linearity, a voicing-dependent FBE computation technique and a time&frequency filtering (tiffing)
technique will be considered. Recognition results for the Aurora database will be shown to illustrate the potential
application of these alternatives techniques for enhancing the robustness of speech recognition systems.
CitacióNadeu, C., Macho, D., Hernando, J. Improving the robustness of the usual fbe-based asr front-end. A: Jornadas en Tecnología del Habla. "Las tecnologías del Habla". Sevilla: Mergablum, 2000, p. 1-20.
ISBN84-95118-58-0
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
tecn_habla.pdf | 65,70Kb | Visualitza/Obre |