The demiphone:an efficient subword unit for Continuous Speech Recognition
Visualitza/Obre
Estadístiques de LA Referencia / Recolecta
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/103116
Tipus de documentText en actes de congrés
Data publicació1997
EditorEditors: G. Kokkinakis, N. Fakotakis, E. Dermatas; Editorial: WCL, University of Patras, Grece
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
In this paper we introduce the demiphone as a contextual phonetic unit for continuous speech recognition. A phone is divided into two parts: a left demiphone that accounts for the left side coarticulation and a right demiphone that copes with the right side context. This new unit discards the dependence between the effects of both side contexts, but provides a better training of the transition between phones. The demiphone can be seen as a heuristic clustering of states that allows a more smoothed training of hidden Markov models and additionally supplies a simple way to create unseen triphones. We report experimental evidence that demiphones outperform the usual combination of triphones, right-side and left-side biphones and monophones.
CitacióMariño, J.B., Nogueiras, A., Bonafonte, A. The demiphone:an efficient subword unit for Continuous Speech Recognition. A: 5th European Conference on Speech Communication and Technology (EUROSPEECH '97). "EUROSPEECH 1997: 5th European Conference on Speech Communication and Technology: Rhodes, Greece: September 22-25, 1997". Rhodes: Editors: G. Kokkinakis, N. Fakotakis, E. Dermatas; Editorial: WCL, University of Patras, Grece, 1997, p. 1215-1218.
ISBN1018-4074
Versió de l'editorhttp://www.isca-speech.org/archive/eurospeech_1997/e97_1215.html
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
e97_1215.pdf | 377,7Kb | Visualitza/Obre |