Exploració per autor "Nogueiras Rodríguez, Albino"
Ara es mostren els items 31-43 de 43
-
Joint training of codebooks and acoustic models in automatic speech recognition using semi-continuous HMMs
Nogueiras Rodríguez, Albino; Caballero Galeote, Mónica; Mariño Acebal, José Bernardo (Eduardo Lleida Solano, 2006)
Comunicació de congrés
Accés restringit per política de l'editorialIn this paper, three different techniques for building semicontinuousHMMbased speech recognisers are compared: the classical one, using Euclidean generated codebooks and independently trained acoustic models; jointly ... -
Maximum likelihood based discriminative training of acoustic models
Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo (European Speech Communication Association (ESCA), 1995)
Text en actes de congrés
Accés obertIn this paper, a framework for discriminative training of acoustic models based on Generalised Probabilistic Descent (GPD) method is presented. The key feature of our proposal, Maximum Likelihood based Discriminative ... -
Minimum confusibility training of context dependent demiphones
Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo (G. Olaszy, G. Németh, K. Erdohegyi, 1999)
Text en actes de congrés
Accés obertDuring the last years two different approaches have been widely used in order to improve the acoustic modeling in continuous speech recognition systems: discriminative training algorithms and context dependent subword ... -
Multi-dialectal Spanish speech recognition
Nogueiras Rodríguez, Albino; Caballero Galeote, Mónica; Moreno Bilbao, M. Asunción (Institute of Electrical and Electronics Engineers (IEEE), 2002)
Text en actes de congrés
Accés restringit per política de l'editorialSpanish is a global language, spoken in a big number of different countries with a big dialectal variability‥ This paper deals with the suitability of using a single multi-dialectal acoustic modeling for all the Spanish ... -
Multidialectal acoustic modeling: a comparative study
Caballero Galeote, Mónica; Moreno Bilbao, M. Asunción; Nogueiras Rodríguez, Albino (2006)
Comunicació de congrés
Accés restringit per política de l'editorialIn this paper, multidialectal acoustic modeling based on shar- ing data across dialects is addressed. A comparative study of different methods of combining data based on decision tree clustering algorithms is presented. ... -
NaniBD: a set of tools for transcribing and validating speech databases
Nogueiras Rodríguez, Albino; Moreno Bilbao, M. Asunción (European Language Resources Association (ELRA), 1998)
Text en actes de congrés
Accés obertThis paper describes NaniBD, a set of tools designed for transcribing and validating speech databases, developed at the Signal Processing Group (GPS) of the Department of Signal Theory and Communications of the Polytechnic ... -
Ponderación ML de parámetros en un sistema de reconocimiento de palabras basado en CDHMM
Valverde Amador, Antonio Javier; Hernando Pericás, Francisco Javier; Nogueiras Rodríguez, Albino (1996)
Comunicació de congrés
Accés obertSpeech dynamic feature are routinely used in current speech recognition systems in combination with short-term (static) spectral features. The aim of this paper is to propose a method to automatically estimate the optimum ... -
SETHOS: the UPC speech understanding system
Bonafonte Cávez, Antonio; Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
Comunicació de congrés
Accés restringit per política de l'editorialIn EuroSpeech'95, the authors presented the first version of Sethos, the speech understanding system which has been developed at the UPC. In this paper some improvements are incorporated at different levels of Sethos: ... -
Speech emotion recognition using hidden Markov models
Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción (2001)
Comunicació de congrés
Accés restringit per política de l'editorialThis paper introduces a first approach to emotion recognition using RAMSES, the UPC’s speech recognition system. The approach is based on standard speech recognition technology using hidden semi-continuous Markov models. ... -
Task adaptation of sub-lexical unit models using the minimum confusibility criterion on task independent databases
Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo (Robert H. Mannel and Jordi Robert-Ribes, 1998)
Comunicació de congrés
Accés restringit per política de l'editorialDiscriminative training is a powerful tool in acoustic modeling for automatic speech recognition. Its strength is based on the direct minimisation of the number of errors committed by the system at recognition time. This ... -
Task independent minimum confusability training for continuous speech recognition
Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo (1998)
Comunicació de congrés
Accés restringit per política de l'editorialIn this paper, a task independent discriminative training framework for subword units based continuous speech recognition is presented. Instead of aiming at the optimisation of any task independent figure, say the phone ... -
The demiphone: An efficient contextual subword unit for continuous speech recognition
Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Pachès Leal, Pau; Bonafonte Cávez, Antonio (2000-09)
Article
Accés restringit per política de l'editorialIn this paper, we introduce the demiphone as a context-dependent phonetic unit for continuous speech recognition. A phoneme is divided into two parts: a left demiphone that accounts for the left coarticulation and a right ... -
The demiphone:an efficient subword unit for Continuous Speech Recognition
Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Bonafonte Cávez, Antonio (Editors: G. Kokkinakis, N. Fakotakis, E. Dermatas; Editorial: WCL, University of Patras, Grece, 1997)
Text en actes de congrés
Accés obertIn this paper we introduce the demiphone as a contextual phonetic unit for continuous speech recognition. A phone is divided into two parts: a left demiphone that accounts for the left side coarticulation and a right ...