Ara es mostren els items 41-52 de 52

    • Study of subword units for spanish speech recognition 

      Bonafonte Cávez, Antonio; Estany, Rafael; Vives, Eugenio (ESCA - J.M. PARDO, E. ENRIQUEZ, J. ORTEGA, J. FERREIROS GTM-UPM, 1995)
      Text en actes de congrés
      Accés obert
      This paper studies different sets of subword speech units to be used for recognizing Spanish. In particular it compares context dependent phones, syllables and demisyllables. It shows how context dependent units can ...
    • Synthesis of filled pauses based on a disfluent speech model 

      Adell Roig, Jordi; Bonafonte Cávez, Antonio; Escudero Mancebo, David (2010)
      Comunicació de congrés
      Accés restringit per política de l'editorial
    • Synthesis using speaker adaptation from speech recognition DB 

      Oller Moreno, Sergio; Moreno Bilbao, M. Asunción; Bonafonte Cávez, Antonio (Universidad de Vigo, 2010)
      Comunicació de congrés
      Accés obert
      This paper deals with the creation of multiple voices from a Hidden Markov Model based speech synthesis system (HTS). More than 150 Catalan synthetic voices were built using Hidden Markov Models (HMM) and speaker adaptation ...
    • Tecnologías del habla : conversión de texto a voz 

      Bonafonte Cávez, Antonio (Escola Tècnica Superior d'Enginyers de Telecomunicació de Barcelona, 1997)
      Article
      Accés obert
    • The demiphone: An efficient contextual subword unit for continuous speech recognition 

      Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Pachès Leal, Pau; Bonafonte Cávez, Antonio (2000-09)
      Article
      Accés restringit per política de l'editorial
      In this paper, we introduce the demiphone as a context-dependent phonetic unit for continuous speech recognition. A phoneme is divided into two parts: a left demiphone that accounts for the left coarticulation and a right ...
    • The demiphone:an efficient subword unit for Continuous Speech Recognition 

      Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Bonafonte Cávez, Antonio (Editors: G. Kokkinakis, N. Fakotakis, E. Dermatas; Editorial: WCL, University of Patras, Grece, 1997)
      Text en actes de congrés
      Accés obert
      In this paper we introduce the demiphone as a contextual phonetic unit for continuous speech recognition. A phone is divided into two parts: a left demiphone that accounts for the left side coarticulation and a right ...
    • The UPC Text-to-Speech System for Spanish and Catalan 

      Bonafonte Cávez, Antonio; Esquerra Llucià, Ignasi; Febrer, A; Rodríguez Fonollosa, José Adrián; Vallverdú Bayés, Sisco (ISCA, 1998)
      Text en actes de congrés
      Accés obert
      This paper summarizes the text-to-speech system that has been developed in the Speech Group of the Universitat Politècnica de Catalunya (UPC). The system is composed of a core and different interfaces so that it is compatible ...
    • Time-domain speech enhancement using generative adversarial networks 

      Pascual de la Puente, Santiago; Serra, Joan; Bonafonte Cávez, Antonio (2019-11-01)
      Article
      Accés obert
      Speech enhancement improves recorded voice utterances to eliminate noise that might be impeding their intelligibility or compromising their quality. Typical speech enhancement systems are based on regression approaches ...
    • TTS evaluation campaign with a common spanish database 

      Sainz, Iñaki; Navas, Eva; Hernáez, Inma; Bonafonte Cávez, Antonio; Campillo, Francisco (2010)
      Text en actes de congrés
      Accés obert
      This paper describes the first TTS evaluation campaign designed for Spanish. Seven research institutions took part in the evaluation campaign and developed a voice from a common speech database provided by the organisation. ...
    • Using x-gram for efficient speech recognition 

      Bonafonte Cávez, Antonio; Mariño Acebal, José Bernardo (Robert H. Mannel and Jordi Robert-Ribes, 1998)
      Text en actes de congrés
      Accés obert
      X-grams are a generalization of the n-grams, where the number of previous conditioning words is different for each case and decided from the training data. X-grams reduce perplexity with respect to trigrams and need less ...
    • Visualizing punctuation restoration in speech transcripts with prosograph 

      Oktem, A.; Farrús, M.; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2018)
      Text en actes de congrés
      Accés obert
      We have developed a neural architecture that tests the effect of lexical, morphosyntactic and prosodic features in restoring punctuation in speech transcriptions. Having outperformed a baseline model in terms of precision ...
    • Work in progress - Cooperative and competitive projects for engaging students in advanced ICT subjects 

      Pardàs Feliu, Montse; Bonafonte Cávez, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2011)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      In this paper we present a specific kind of projects that can be used for project-based learning in engineering subjects. The subjects must combine lectures with projects, in order to provide the technical competences ...