|
E-prints UPC >
Altres >
Enviament des de DRAC >
Empreu aquest identificador per citar o enllaçar aquest ítem:
http://hdl.handle.net/2117/16051
|
Ítem no disponible en accés obert per política de l'editorial
| Arxiu |
Descripció |
Mida | Format |
| 4jth_162.pdf | Article principal | 233.98 kB | Adobe PDF |  |
|
| Citació: | Nogueiras, A. [et al.]. First experiments on an HMM based double layer framework for automatic continuous speech recognition. A: Jornadas en Tecnologia del Habla. "Actas de las IV Jornadas en Tecnologia del Habla". Zaragoza: 2006, p. 225-229. |
| Títol: | First experiments on an HMM based double layer framework for automatic continuous speech recognition |
| Autor: | Nogueiras Rodríguez, Albino ; Casar López, Marta ; Rodríguez Fonollosa, José Adrián ; Caballero Galeote, Mónica  |
| Data: | 2006 |
| Tipus de document: | Conference lecture |
| Resum: | The usual approach to automatic continuous speech recognition is what can be called the acoustic-phonetic modelling approach. In this approach, voice is considered
to hold two different kinds of information acoustic and phonetic . Acoustic information is represented by some kind of feature extraction out of the voice signal, and phonetic information is extracted from the vocabulary of the task by means of a lexicon or some other procedure. The
main assumption in this approach is that models can be constructed that capture the correlation existing between
both kinds of information.
The main limitation of acoustic-phonetic modelling in speech recognition is its poor treatment of the variability
present both in the phonetic level and the acoustic one. In this paper, we propose the use of a slightly modified framework where the usual acoustic-phonetic modelling
is divided into two different layers: one closer to the voice signal, and the other closer to the phonetics of the sentence. By doing so we expect an improvement of
the modelling accuracy, as well as a better management of acoustic and phonetic variability. Experiments carried out so far, using a very simpli ed version of the proposed framework, show a signi cant improvement in the recognition of a large vocabulary continuous speech task, and represent a promising start point for
future research. |
| ISBN: | 84-96214-82-6 |
| URI: | http://hdl.handle.net/2117/16051 |
| Versió de l'editor: | http://jth2006.unizar.es/finals/4jth_162.pdf |
| Apareix a les col·leccions: | Departament de Teoria del Senyal i Comunicacions. Ponències/Comunicacions de congressos VEU - Grup de Tractament de la Parla. Ponències/Comunicacions de congressos Altres. Enviament des de DRAC
|
| Comparteix: |
|
Queda prohibida la reproducció, transformació, distribució i comunicació pública d'aquesta obra. Es permet, en tot cas, la reproducció per a ús privat sempre i quan la còpia que se'n faci no sigui objecte d'utilització col·lectiva ni lucrativa (art. 31.2 del Reial Decret Legislatiu 1/1996, de 12 d'abril, pel qual s'aprova el Text Refós de la Llei de Propietat Intel·lectual, http://bibliotecnica.upc.es/sepi/legislacio.asp).
Per a qualsevol ús que es vulgui fer diferent al permès, dirigiu-vos a: sepi@upc.edu
|