Ara es mostren els items 12-31 de 52

    • Corpus for cyberbullying prevention 

      Moreno Bilbao, M. Asunción; Bonafonte Cávez, Antonio; Jauk, Igor; Tarrés, Laia; Pereira, Victor (International Speech Communication Association (ISCA), 2018)
      Text en actes de congrés
      Accés obert
      Cyberbullying is the use of digital media to harass a person or group of people, through personal attacks, disclosure of confidential or false information, among other means. That is to say, it ...
    • Creating expressive synthetic voices by unsupervised clustering of audiobooks 

      Jauk, Igor; Bonafonte Cávez, Antonio; López Otero, Paula; Docio Fernández, Laura (International Speech Communication Association (ISCA), 2015)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      In this work we design an approach for automatic feature selection and voice creation for expressive synthesis. Our approach is guided by two main goals: (1) increasing the flexibility of expressive voice creation and (2) ...
    • Deep neural networks for i-vector language identification of short utterances in cars 

      Ghahabi Esfahani, Omid; Bonafonte Cávez, Antonio; Hernando Pericás, Francisco Javier; Moreno Bilbao, M. Asunción (International Speech Communication Association (ISCA), 2016)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      This paper is focused on the application of the Language Identification (LID) technology for intelligent vehicles. We cope with short sentences or words spoken in moving cars in four languages: English, Spanish, German, ...
    • Defining analogy for non-native inclusions in Spanish utterances 

      Polyakova, Tatyana; Bonafonte Cávez, Antonio (2010)
      Text en actes de congrés
      Accés obert
      Mass media globalization introduces the challenge of multilingualism into most popular speech applications such as text-tospeech synthesis and automatic speech recognition. In Spain as well as in the other countries, the ...
    • Direct expressive voice training based on semantic selection 

      Jauk, Igor; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2016)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      This work aims at creating expressive voices from audiobooks using semantic selection. First, for each utterance of the audiobook an acoustic feature vector is extracted, including iVectors built on MFCC and on F0 ...
    • Duration modeling with expanded HMM applied to speech recognition 

      Bonafonte Cávez, Antonio; Vidal Manzano, José; Nogueiras Rodríguez, Albino (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
      Comunicació de congrés
      Accés obert
      The occupancy of the HMM states is modeled by means of a Markov chain. A linear estimator is introduced to compute the probabilities of the Markov chain. The distribution function (DF) represents accurately the observed ...
    • Examen Final 

      Bonafonte Cávez, Antonio; Marqués Acosta, Fernando; Ventura Royo, Carles (Universitat Politècnica de Catalunya, 2012-06-10)
      Examen
      Accés restringit a la comunitat UPC
    • Explicit segmentation of speech using gaussian models 

      Bonafonte Cávez, Antonio; Nogueiras Rodríguez, Albino; Rodriguez-Garrido, A (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      The authors investigate an automatic method to segment labeled speech. The method needs an initial estimation of the segmentation which is provided by an alignment based on HMM. Afterwards, the boundaries are refined moving ...
    • Exploring efficient neural architectures for linguistic-acoustic mapping in text-to-speech 

      Pascual de la Puente, Santiago; Serra, Joan; Bonafonte Cávez, Antonio (Multidisciplinary Digital Publishing Institute, 2019-08-17)
      Article
      Accés obert
      Conversion from text to speech relies on the accurate mapping from linguistic to acoustic symbol sequences, for which current practice employs recurrent statistical models such as recurrent neural networks. Despite the ...
    • Expressive speech synthesis using sentiment embeddings 

      Jauk, Igor; Lorenzo Trueba, J.; Yamagishi, J.; Bonafonte Cávez, Antonio (International Speech Communication Association (ISCA), 2018)
      Text en actes de congrés
      Accés obert
      In this paper we present a DNN based speech synthesis system trained on an audiobook including sentiment features predicted by the Stanford sentiment parser. The baseline system uses DNN to predict acoustic parameters based ...
    • Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan 

      Garrido, Juan Maria; Escudero, David; Aguilar, Lourdes; Cardeñoso Payo, V.; Rodero, Emma; de-la-Mota, Carme; González, César; Vivaracho, C. E.; Rustullet, Sílvia; Larrea, Olatz; Laplaza, Yesika; Vizcaíno, Francisco; Estebas, Eva; Cabrera, Mercedes; Bonafonte Cávez, Antonio (2013-12)
      Article
      Accés restringit per política de l'editorial
      Literature review on prosody reveals the lack of corpora for prosodic studies in Catalan and Spanish. In this paper, we present a corpus intended to fill this gap. The corpus comprises two distinct data-sets, a news subcorpus ...
    • Introducing nativization to Spanish TTS systems 

      Polyakova, Tatyana; Bonafonte Cávez, Antonio (2011-06)
      Article
      Accés restringit per política de l'editorial
      In the modern world, speech technologies must be flexible and adaptable to any framework. Mass media globalization introduces multilingualism as a challenge for the most popular speech applications such as text-to-speech ...
    • Language and noise transfer in speech enhancement generative adversarial network 

      Pascual de la Puente, Santiago; Park, Maruchan; Serra, Joan; Bonafonte Cávez, Antonio; Ahn, Kang-hun (Institute of Electrical and Electronics Engineers (IEEE), 2018)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Speech enhancement deep learning systems usually require large amounts of training data to operate in broad conditions or real applications. This makes the adaptability of those systems into new, low resource environments ...
    • Language modeling using X-grams 

      Bonafonte Cávez, Antonio; Mariño Acebal, José Bernardo (H. TIMOTHY BRUMMELL, WILLIAM IDSARDI CITATION DELAWARE, NEW CASTLE, DELAWARE, 1996)
      Text en actes de congrés
      Accés obert
      In this paper, an extension of n-grams, called x-grams, is proposed. In this extension, the memory of the model (n) is not fixed a priori. Instead, large memories are accepted first, and merging criteria are then applied ...
    • Multi-output RNN-LSTM for multiple speaker speech synthesis and adaptation 

      Pascual, Santiago; Bonafonte Cávez, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      Deep Learning has been applied successfully to speech processing. In this paper we propose an architecture for speech synthesis using multiple speakers. Some hidden layers are shared by all the speakers, while there is a ...
    • Multi-output RNN-LSTM for multiple speaker speech synthesis with a-interpolation model 

      Pascual, Santiago; Bonafonte Cávez, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Text en actes de congrés
      Accés obert
      Deep Learning has been applied successfully to speech processing. In this paper we propose an architecture for speech synthesis using multiple speakers. Some hidden layers are shared by all the speakers, while there is a ...
    • Nativization of English words in Spanish using analogy 

      Polyakova, Tatyana; Bonafonte Cávez, Antonio (2010)
      Text en actes de congrés
      Accés obert
      Nowadays modern speech technologies need to be flexible and adaptable to any framework. Mass media globalization introduces the challenge of multilingualism into most popular speech applications such as text-to-speech ...
    • Out-of-vocabulary word modelling and rejection for keyword spotting 

      Lleida Solano, Eduardo; Mariño, José B.; Salavedra Molí, Josep; Bonafonte Cávez, Antonio; Monte Moreno, Enrique (International Speech Communication Association (ISCA), 1993)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      This paper presents a combination of out-of-vocabulary (OOV) word modeling and rejection techniques in an attempt to accept utterances embedding a keyword and reject utterances with nonkeywords. The goal of this research ...
    • Parametric modeling of PDF using a convolution of one-sided exponentials: application to HMM 

      Vidal Manzano, José; Bonafonte Cávez, Antonio; Rodríguez Fonollosa, José Adrián (European Association for Signal Processing (EURASIP), 1994)
      Text en actes de congrés
      Accés obert
    • Prosodic and spectral iVectors for expressive speech synthesis 

      Jauk, Igor; Bonafonte Cávez, Antonio (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Comunicació de congrés
      Accés obert
      This work presents a study on the suitability of prosodic andacoustic features, with a special focus on i-vectors, in expressivespeech analysis and synthesis. For each utterance of two dif-ferent databases, a laboratory ...