Study of subword units for spanish speech recognition
Tipo de documentoTexto en actas de congreso
Fecha de publicación1995
EditorESCA - J.M. PARDO, E. ENRIQUEZ, J. ORTEGA, J. FERREIROS GTM-UPM
Condiciones de accesoAcceso abierto
This paper studies different sets of subword speech units to be used for recognizing Spanish. In particular it compares context dependent phones, syllables and demisyllables. It shows how context dependent units can effectively reduce the error in a 15% with respect to context independent phones. The benefit of merging similar contexts when there are not enough training data is also validated. On the other hand the paper study the behavior of syllables based units: first, the study reveals that syllables give a similar performance than triphones whereas demisyllables give a similar performance than right (or left) context dependent phones. However, when different types of units are used, context dependent phones give the best results. Results achieved with these sets of units exceed 70% in acoustic-phonetic decoding of Spanish speech.
CitaciónBonafonte, A., Estany, R., Vives, E. Study of subword units for spanish speech recognition. A: 4th European Conference on Speech Communication and Technology. "Proceedings of the 4th EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY". Madrid: ESCA - J.M. PARDO, E. ENRIQUEZ, J. ORTEGA, J. FERREIROS GTM-UPM, 1995, p. 1607-1610.
Versión del editorhttp://www.isca-speech.org/archive/eurospeech_1995/e95_1607.html