Enviaments recents

  • Speaker recognition by means of restricted Boltzmann machine adaptation 

    Safari, Pooyan; Ghahabi, Omid; Hernando Pericás, Francisco Javier (Universidad Autónoma de Madrid, 2016)
    Comunicació de congrés
    Accés obert
    Restricted Boltzmann Machines (RBMs) have shown success in speaker recognition. In this paper, RBMs are investigated in a framework comprising a universal model training and model adaptation. Taking advantage of RBM ...
  • Nanitrans: a speech labelling tool 

    Portabella, D; Febrer, A; Moreno Bilbao, M. Asunción; Rodríguez Fonollosa, José Adrián (., 2000)
    Text en actes de congrés
    Accés obert
    This paper deals with a description of NaniTrans, a tool for segmentation and labeling of speech. The tool is programmed to work on the MATLAB application interface, in any of the supported platforms (Unix, Windows, ...
  • Mail Access Plus. Sistema de Mensajería Unificada 

    Rodríguez Fonollosa, José Adrián; Guillermo, Vila; David, Font (Universidad de Zaragoza. URSI, 2000)
    Text en actes de congrés
    Accés obert
    This paper describes an unified messaging system which joins voice mail, fax and e-mail access. Its architecture consists in a group of independent servers, each offering an interface between the unified mailbox and an ...
  • Use of voicing information to improve the robustness of the spectral parameter set 

    Macho Ciena, Dusan; Nadeu Camprubí, Climent (ICSLP, 2000)
    Text en actes de congrés
    Accés obert
    Speech recognition systems that operate in real world environments have to be robust against additive noises. In this work, a technique that uses a voicing-dependent exponent in the computation of the filter-bank parameters ...
  • Reconocimiento del locutor en telefonia: actividades del proyecto europeo COST 250 

    Hernando Pericás, Francisco Javier; Garcia, C; Rodriguez, L; González Rodríguez, Joaquin; Ortega García, Javier (Universidad Politecnica de Madrid, 2000)
    Text en actes de congrés
    Accés obert
    El objetivo de esta comunicación es presentar las actividades realizadas desde noviembre de 1994 dentro del proyecto “Speaker Recognition in Telephony”, financiado por la Comunidad Europea en el marco del programa “European ...
  • On the use of filter bank energies driven from the osa sequence for noisy speech recognition 

    Hernando Pericás, Francisco Javier (INSTITUTE OF ACOUSTICS, 2000)
    Text en actes de congrés
    Accés obert
    epresentation of speech signal has shown to be attractive for noisy speech recognition because of both its high recognition performance with respect to the conventional LP in severe conditions of additive broad-band noise ...
  • Reconocimiento del locutor mediante filtrado frecuencial de energías espectrales estimadas por métodos híbridos 

    Hernando Pericás, Francisco Javier; Nadeu Camprubí, Climent (Universidad Politecnica de Madrid, 2000)
    Text en actes de congrés
    Accés obert
    Se han explorado dos formas de obtener parámetros más robustos para reconocimiento del locutor: la hibridación de técnicas de análisis espectral y el filtrado frecuencial de las energías de las bandas. Se ha comprobado que ...
  • Automatic speech recognition with deep neural networks for impaired speech 

    España-i-Bonet, Cristina; Rodríguez Fonollosa, José Adrián (Springer, 2016)
    Text en actes de congrés
    Accés obert
    Automatic Speech Recognition has reached almost human performance in some controlled scenarios. However, recognition of impaired speech is a difficult task for two main reasons: data is (i) scarce and (ii) heterogeneous. ...
  • Improving the robustness of the usual fbe-based asr front-end 

    Nadeu Camprubí, Climent; Macho, D; Hernando Pericás, Francisco Javier (Mergablum, 2000)
    Text en actes de congrés
    Accés obert
    All speech recognition systems require some form of signal representation that parametrically models the temporal evolution of the spectral envelope. Current parameterizations involve, either explicitly or implicitly, ...
  • Monolingual and bilingual spanish-catalan speech recognizers developed from SpeechDat databases 

    Mariño Acebal, José Bernardo; Padrell, J; Moreno Bilbao, M. Asunción; Nadeu Camprubí, Climent (C. Draxler, 2000)
    Text en actes de congrés
    Accés obert
    Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catalan database that includes one thousand speakers. This communication describes some experimental work that has been carried ...

Mostra'n més