Discriminative weighting of dynamic feautres in continuous-density hidden Markov models for word recognition
Document typeConference report
Rights accessOpen Access
Speech dynamic features, which provide smoothed estimates of the derivatives of the spectral parameter trajectories in the current frame, are routinely used in current speech recognition systems in combination with short-term (static) spectral features. The aim of this paper is to propose a method to automatically estimate the optimum ponderation of static and dynamic features in a speech recognition system. The recognition system considered in this paper is based on Continuous-Density Hidden Markov Modelling (CDHMM), widely used in speech recognition. Our approach consists basically in 1) adding two new parameters for each state of each model that weight both kinds of speech features, and 2) estimating those parameters by means of a discriminative training algorithm that minimizes the recognition error using the recently proposed Generalized Probabilistic Descent (GPO) method. Experimental results in speaker independent digit recognition show an important increase of recognition accuracy.
CitationHernando, J. Discriminative weighting of dynamic feautres in continuous-density hidden Markov models for word recognition. A: Spanish Symposium on Pattern recognition and Image Analysis. "VI Spanish Symposium on Pattern Recognition and Image Anlalysis: Córdoba: 3-7 April 1995". Córdoba: 1995, p. 293-300.