Ara es mostren els items 31-43 de 43

    • Joint training of codebooks and acoustic models in automatic speech recognition using semi-continuous HMMs 

      Nogueiras Rodríguez, Albino; Caballero Galeote, Mónica; Mariño Acebal, José Bernardo (Eduardo Lleida Solano, 2006)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      In this paper, three different techniques for building semicontinuousHMMbased speech recognisers are compared: the classical one, using Euclidean generated codebooks and independently trained acoustic models; jointly ...
    • Maximum likelihood based discriminative training of acoustic models 

      Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo (European Speech Communication Association (ESCA), 1995)
      Text en actes de congrés
      Accés obert
      In this paper, a framework for discriminative training of acoustic models based on Generalised Probabilistic Descent (GPD) method is presented. The key feature of our proposal, Maximum Likelihood based Discriminative ...
    • Minimum confusibility training of context dependent demiphones 

      Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo (G. Olaszy, G. Németh, K. Erdohegyi, 1999)
      Text en actes de congrés
      Accés obert
      During the last years two different approaches have been widely used in order to improve the acoustic modeling in continuous speech recognition systems: discriminative training algorithms and context dependent subword ...
    • Multi-dialectal Spanish speech recognition 

      Nogueiras Rodríguez, Albino; Caballero Galeote, Mónica; Moreno Bilbao, M. Asunción (Institute of Electrical and Electronics Engineers (IEEE), 2002)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Spanish is a global language, spoken in a big number of different countries with a big dialectal variability‥ This paper deals with the suitability of using a single multi-dialectal acoustic modeling for all the Spanish ...
    • Multidialectal acoustic modeling: a comparative study 

      Caballero Galeote, Mónica; Moreno Bilbao, M. Asunción; Nogueiras Rodríguez, Albino (2006)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      In this paper, multidialectal acoustic modeling based on shar- ing data across dialects is addressed. A comparative study of different methods of combining data based on decision tree clustering algorithms is presented. ...
    • NaniBD: a set of tools for transcribing and validating speech databases 

      Nogueiras Rodríguez, Albino; Moreno Bilbao, M. Asunción (European Language Resources Association (ELRA), 1998)
      Text en actes de congrés
      Accés obert
      This paper describes NaniBD, a set of tools designed for transcribing and validating speech databases, developed at the Signal Processing Group (GPS) of the Department of Signal Theory and Communications of the Polytechnic ...
    • Ponderación ML de parámetros en un sistema de reconocimiento de palabras basado en CDHMM 

      Valverde Amador, Antonio Javier; Hernando Pericás, Francisco Javier; Nogueiras Rodríguez, Albino (1996)
      Comunicació de congrés
      Accés obert
      Speech dynamic feature are routinely used in current speech recognition systems in combination with short-term (static) spectral features. The aim of this paper is to propose a method to automatically estimate the optimum ...
    • SETHOS: the UPC speech understanding system 

      Bonafonte Cávez, Antonio; Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      In EuroSpeech'95, the authors presented the first version of Sethos, the speech understanding system which has been developed at the UPC. In this paper some improvements are incorporated at different levels of Sethos: ...
    • Speech emotion recognition using hidden Markov models 

      Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción (2001)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      This paper introduces a first approach to emotion recognition using RAMSES, the UPC’s speech recognition system. The approach is based on standard speech recognition technology using hidden semi-continuous Markov models. ...
    • Task adaptation of sub-lexical unit models using the minimum confusibility criterion on task independent databases 

      Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo (Robert H. Mannel and Jordi Robert-Ribes, 1998)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      Discriminative training is a powerful tool in acoustic modeling for automatic speech recognition. Its strength is based on the direct minimisation of the number of errors committed by the system at recognition time. This ...
    • Task independent minimum confusability training for continuous speech recognition 

      Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo (1998)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      In this paper, a task independent discriminative training framework for subword units based continuous speech recognition is presented. Instead of aiming at the optimisation of any task independent figure, say the phone ...
    • The demiphone: An efficient contextual subword unit for continuous speech recognition 

      Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Pachès Leal, Pau; Bonafonte Cávez, Antonio (2000-09)
      Article
      Accés restringit per política de l'editorial
      In this paper, we introduce the demiphone as a context-dependent phonetic unit for continuous speech recognition. A phoneme is divided into two parts: a left demiphone that accounts for the left coarticulation and a right ...
    • The demiphone:an efficient subword unit for Continuous Speech Recognition 

      Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Bonafonte Cávez, Antonio (Editors: G. Kokkinakis, N. Fakotakis, E. Dermatas; Editorial: WCL, University of Patras, Grece, 1997)
      Text en actes de congrés
      Accés obert
      In this paper we introduce the demiphone as a contextual phonetic unit for continuous speech recognition. A phone is divided into two parts: a left demiphone that accounts for the left side coarticulation and a right ...