Minimum confusibility training of context dependent demiphones
Visualitza/Obre
Estadístiques de LA Referencia / Recolecta
Inclou dades d'ús des de 2022
Cita com:
hdl:2117/103637
Tipus de documentText en actes de congrés
Data publicació1999
EditorG. Olaszy, G. Németh, K. Erdohegyi
Condicions d'accésAccés obert
Llevat que s'hi indiqui el contrari, els
continguts d'aquesta obra estan subjectes a la llicència de Creative Commons
:
Reconeixement-NoComercial-SenseObraDerivada 3.0 Espanya
Abstract
During the last years two different approaches have been widely used in order to improve the acoustic modeling in continuous speech recognition systems: discriminative training algorithms and context dependent subword units. However, while the use of each of these techniques leads to much better results than standard maximum likelihood trained phone models, their combination, i.e. discriminative training of context dependent units, has revealed to be a much more dificult task. In this paper we deal with minimum confusibility training of demiphones using TIMIT database. By applying this approach recently introduced by the authors, the string error rate in the recognition of TIDIGITS using demiphones is reduced some 24% with respect to maximum likelihood training. This improvement is added to the 8% reduction already provided by demiphones with respect to minimum confusibility trained phones.
CitacióNogueiras, A., Mariño, J.B. Minimum confusibility training of context dependent demiphones. A: European Conference on Speech Communication and Technology. "Sixth European Conference on Speech Communication and Technology (EUROSPEECH'99), Budapest, Hungary, September 5-9, 1999". Budapest: G. Olaszy, G. Németh, K. Erdohegyi, 1999, p. 2741-2744.
ISBN1018-4074
Fitxers | Descripció | Mida | Format | Visualitza |
---|---|---|---|---|
e99_2741.pdf | 155,4Kb | Visualitza/Obre |