Speaker recognition by means of restricted Boltzmann machine adaptation
Tipus de documentComunicació de congrés
EditorUniversidad Autónoma de Madrid
Condicions d'accésAccés obert
Restricted Boltzmann Machines (RBMs) have shown success in speaker recognition. In this paper, RBMs are investigated in a framework comprising a universal model training and model adaptation. Taking advantage of RBM unsupervised learning algorithm, a global model is trained based on all available background data. This general speaker-independent model, referred to as URBM, is further adapted to the data of a specific speaker to build speaker-dependent model. In order to show its effectiveness, we have applied this framework to two different tasks. It has been used to discriminatively model target and impostor spectral features for classification. It has been also utilized to produce a vector-based representation for speakers. This vector-based representation, similar to i-vector, can be further used for speaker recognition using either cosine scoring or Probabilistic Linear Discriminant Analysis (PLDA). The evaluation is performed on the core test condition of the NIST SRE 2006 database.
CitacióSafari, P., Ghahabi, O., Hernando, J. Speaker Recognition by means of restricted Boltzmann machine adaptation. A: Simposium Nacional de la Unión Científica Internacional de Radio. "URSI 2016 Madrid. XXXI Simposium Nacional de la Unión Científica Internacional de Radio". Madrid: Universidad Autónoma de Madrid, 2016, p. 1-4.