Restricted Boltzmann Machine Supervectors for speaker recognition
Document typeConference report
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Rights accessRestricted access - publisher's policy
The use of Restricted Boltzmann Machines (RBM) is proposed in this paper as a non-linear transformation of GMM supervectors for speaker recognition. It will be shown that the RBM transformation will increase the discrimination power of raw GMM supervectors for speaker recognition. The experimental results on the core test condition of the NIST SRE 2006 corpus show that the proposed RBM supervectors will achieve a comparable performance to i-vectors. Furthermore, the combination of RBM supevectors and i-vectors in the score level improves the performance of the i-vector approach by more than 10% in terms of EER.
CitationGhahabi, O., Hernando, J. Restricted Boltzmann Machine Supervectors for speaker recognition. A: IEEE International Conference on Acoustics, Speech, and Signal Processing. "2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2015): South Brisbane, Queensland, Australia: 19-24 April 2015". Brisbane: Institute of Electrical and Electronics Engineers (IEEE), 2015, p. 4804-4808.