Restricted Boltzmann Machine Supervectors for speaker recognition

Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier

doi:10.1109/ICASSP.2015.7178883

Visualitza/Obre

Omid_Ghahabi.pdf (356,1Kb) (Accés restringit) Sol·licita una còpia a l'autor

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Ghahabi Esfahani, Omid

Hernando Pericás, Francisco Javier

Tipus de documentText en actes de congrés

Data publicació2015

EditorInstitute of Electrical and Electronics Engineers (IEEE)

Condicions d'accésAccés restringit per política de l'editorial

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

The use of Restricted Boltzmann Machines (RBM) is proposed in this paper as a non-linear transformation of GMM supervectors for speaker recognition. It will be shown that the RBM transformation will increase the discrimination power of raw GMM supervectors for speaker recognition. The experimental results on the core test condition of the NIST SRE 2006 corpus show that the proposed RBM supervectors will achieve a comparable performance to i-vectors. Furthermore, the combination of RBM supevectors and i-vectors in the score level improves the performance of the i-vector approach by more than 10% in terms of EER.

CitacióGhahabi, O., Hernando, J. Restricted Boltzmann Machine Supervectors for speaker recognition. A: IEEE International Conference on Acoustics, Speech, and Signal Processing. "2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015): South Brisbane, Queensland, Australia: 19-24 April 2015". Brisbane: Institute of Electrical and Electronics Engineers (IEEE), 2015, p. 4804-4808.

URIhttp://hdl.handle.net/2117/84507

DOI10.1109/ICASSP.2015.7178883

ISBN9781467369985

Versió de l'editorhttp://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7178883

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
Omid_Ghahabi.pdf		356,1Kb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

Restricted Boltzmann Machine Supervectors for speaker recognition

Visualitza/Obre

Explora