Ara es mostren els items 28-47 de 52

    • Nativization of English words in Spanish using analogy 

      Polyakova, Tatyana; Bonafonte Cávez, Antonio (2010)
      Text en actes de congrés
      Accés obert
      Nowadays modern speech technologies need to be flexible and adaptable to any framework. Mass media globalization introduces the challenge of multilingualism into most popular speech applications such as text-to-speech ...
    • Out-of-vocabulary word modelling and rejection for keyword spotting 

      Lleida Solano, Eduardo; Mariño, José B.; Salavedra Molí, Josep; Bonafonte Cávez, Antonio; Monte Moreno, Enrique (International Speech Communication Association (ISCA), 1993)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      This paper presents a combination of out-of-vocabulary (OOV) word modeling and rejection techniques in an attempt to accept utterances embedding a keyword and reject utterances with nonkeywords. The goal of this research ...
    • Parametric modeling of PDF using a convolution of one-sided exponentials: application to HMM 

      Vidal Manzano, José; Bonafonte Cávez, Antonio; Rodríguez Fonollosa, José Adrián (European Association for Signal Processing (EURASIP), 1994)
      Text en actes de congrés
      Accés obert
    • Prosodic and spectral iVectors for expressive speech synthesis 

      Jauk, Igor; Bonafonte Cávez, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Comunicació de congrés
      Accés obert
      This work presents a study on the suitability of prosodic andacoustic features, with a special focus on i-vectors, in expressivespeech analysis and synthesis. For each utterance of two dif-ferent databases, a laboratory ...
    • Prosodic break prediction with RNNs 

      Pascual de la Puente, Santiago; Bonafonte Cávez, Antonio (Springer, 2016)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Prosodic breaks prediction from text is a fundamental task to obtain naturalness in text to speech applications. In this work we build a data-driven break predictor out of linguistic features like the Part of Speech (POS) ...
    • Rational characteristic functions and markov chains 

      Vidal Manzano, José; Bonafonte Cávez, Antonio; Losada, N; Rodríguez Fonollosa, José Adrián; Rodríguez Fonollosa, Javier (. S.N., 1995)
      Text en actes de congrés
      Accés obert
      Abstract 1 We investigate in this paper how to estimate the density function of a random variable using a parametric ARMA model for its characteristic function. The choice of this model is motivated by the fact that this ...
    • Recent work on the FESTCAT database for speech synthesis 

      Bonafonte Cávez, Antonio; Esquerra Llucià, Ignasi; Aguilar, Lourdes; Oller Moreno, Sergio; Moreno Bilbao, M. Asunción (2009)
      Text en actes de congrés
      Accés obert
      This paper presents our work around the FESTCAT project, whose main goal was the development of voices for the Festival suite in Catalan. In the first year, we produced the corpus and the speech data needed for build ...
    • Recognition of numbers by using demisyllables and hidden Markov models 

      Mariño Acebal, José Bernardo; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción; Lleida Solano, Eduardo; Nadeu Camprubí, Climent; Monte Moreno, Enrique (Elsevier, 1990)
      Text en actes de congrés
      Accés obert
    • Reconocimiento del habla continua mediante modelos ocultos de Markov utilizando la técnica de búsqueda en haz 

      Lleida Solano, Eduardo; Mariño Acebal, José Bernardo; Bonafonte Cávez, Antonio (Universidad de Málaga, 1992)
      Text en actes de congrés
      Accés obert
    • Search engine for multilingual audiovisual contents 

      Pérez, José David; Bonafonte Cávez, Antonio; Ruiz Costa-Jussà, Marta; Cardenal, Antonio; Rodríguez Fonollosa, José Adrián; Moreno Bilbao, M. Asunción; Navas, Eva; Rodríguez Banga, Eduardo (2012)
      Comunicació de congrés
      Accés obert
      This paper describes the BUCEADOR search engine, a web server that allows retrieving. multimedia documents (text, audio, video) in different languages. All the documents are translated into the user language and are ...
    • SETHOS: the UPC speech understanding system 

      Bonafonte Cávez, Antonio; Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      In EuroSpeech'95, the authors presented the first version of Sethos, the speech understanding system which has been developed at the UPC. In this paper some improvements are incorporated at different levels of Sethos: ...
    • Spanish statistical parametric speech synthesis using a neural vocoder 

      Bonafonte Cávez, Antonio; Pascual de la Puente, Santiago; Dorca, G. (International Speech Communication Association (ISCA), 2018)
      Text en actes de congrés
      Accés obert
      During the 2000s decade, unit-selection based text-to-speech was the dominant commercial technology. Meanwhile, the TTS research community has made a big effort to push statistical-parametric speech synthesis to get similar ...
    • Speech emotion recognition using hidden Markov models 

      Nogueiras Rodríguez, Albino; Mariño Acebal, José Bernardo; Bonafonte Cávez, Antonio; Moreno Bilbao, M. Asunción (2001)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      This paper introduces a first approach to emotion recognition using RAMSES, the UPC’s speech recognition system. The approach is based on standard speech recognition technology using hidden semi-continuous Markov models. ...
    • Study of subword units for spanish speech recognition 

      Bonafonte Cávez, Antonio; Estany, Rafael; Vives, Eugenio (ESCA - J.M. PARDO, E. ENRIQUEZ, J. ORTEGA, J. FERREIROS GTM-UPM, 1995)
      Text en actes de congrés
      Accés obert
      This paper studies different sets of subword speech units to be used for recognizing Spanish. In particular it compares context dependent phones, syllables and demisyllables. It shows how context dependent units can ...
    • Synthesis of filled pauses based on a disfluent speech model 

      Adell Roig, Jordi; Bonafonte Cávez, Antonio; Escudero Mancebo, David (2010)
      Comunicació de congrés
      Accés restringit per política de l'editorial
    • Synthesis using speaker adaptation from speech recognition DB 

      Oller Moreno, Sergio; Moreno Bilbao, M. Asunción; Bonafonte Cávez, Antonio (Universidad de Vigo, 2010)
      Comunicació de congrés
      Accés obert
      This paper deals with the creation of multiple voices from a Hidden Markov Model based speech synthesis system (HTS). More than 150 Catalan synthetic voices were built using Hidden Markov Models (HMM) and speaker adaptation ...
    • Tecnologías del habla : conversión de texto a voz 

      Bonafonte Cávez, Antonio (Escola Tècnica Superior d'Enginyers de Telecomunicació de Barcelona, 1997)
      Article
      Accés obert
    • The demiphone: An efficient contextual subword unit for continuous speech recognition 

      Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Pachès Leal, Pau; Bonafonte Cávez, Antonio (2000-09)
      Article
      Accés restringit per política de l'editorial
      In this paper, we introduce the demiphone as a context-dependent phonetic unit for continuous speech recognition. A phoneme is divided into two parts: a left demiphone that accounts for the left coarticulation and a right ...
    • The demiphone:an efficient subword unit for Continuous Speech Recognition 

      Mariño Acebal, José Bernardo; Nogueiras Rodríguez, Albino; Bonafonte Cávez, Antonio (Editors: G. Kokkinakis, N. Fakotakis, E. Dermatas; Editorial: WCL, University of Patras, Grece, 1997)
      Text en actes de congrés
      Accés obert
      In this paper we introduce the demiphone as a contextual phonetic unit for continuous speech recognition. A phone is divided into two parts: a left demiphone that accounts for the left side coarticulation and a right ...
    • The UPC Text-to-Speech System for Spanish and Catalan 

      Bonafonte Cávez, Antonio; Esquerra Llucià, Ignasi; Febrer, A; Rodríguez Fonollosa, José Adrián; Vallverdú Bayés, Sisco (ISCA, 1998)
      Text en actes de congrés
      Accés obert
      This paper summarizes the text-to-speech system that has been developed in the Speech Group of the Universitat Politècnica de Catalunya (UPC). The system is composed of a core and different interfaces so that it is compatible ...