Mostra el registre d'ítem simple
i-Vector modeling with deep belief networks for multi-session speaker recognition
dc.contributor.author | Ghahabi Esfahani, Omid |
dc.contributor.author | Hernando Pericás, Francisco Javier |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2015-03-25T17:00:18Z |
dc.date.created | 2014 |
dc.date.issued | 2014 |
dc.identifier.citation | Ghahabi, O.; Hernando, J. i-Vector modeling with deep belief networks for multi-session speaker recognition. A: The Speaker and Language Recognition Workshop. "Odyssey 2014: The Speaker and Language Recognition Workshop". Joensuu: 2014, p. 305-310. |
dc.identifier.issn | 2312-2846 |
dc.identifier.uri | http://hdl.handle.net/2117/27035 |
dc.description.abstract | In this paper we propose an impostor selection method for a Deep Belief Network (DBN) based system which models i-vectors in a multi-session speaker verification task. In the proposed method, instead of choosing a fixed number of most informative impostors, a threshold is defined according to the frequencies of impostors. The selected impostors are then clustered and the centroids are considered as the final impostors for target speakers. The system first trains each target speaker unsupervisingly by an adaptation method and then models discriminatively each target speaker using the impostor centroids and target i-vectors. The evaluation is performed on the NIST 2014 i-vector challenge database and it is shown that the proposed DBN-based system achieves 23% relative improvement of minDCF over the baseline system in the challenge |
dc.format.extent | 6 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic |
dc.subject.lcsh | Speech processing systems |
dc.subject.lcsh | Automatic speech recognition |
dc.title | i-Vector modeling with deep belief networks for multi-session speaker recognition |
dc.type | Conference report |
dc.subject.lemac | Reconeixement automàtic de la parla |
dc.subject.lemac | Processament de la parla |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 15431739 |
dc.description.version | Postprint (published version) |
dc.date.lift | 10000-01-01 |
local.citation.author | Ghahabi, O.; Hernando, J. |
local.citation.contributor | The Speaker and Language Recognition Workshop |
local.citation.pubplace | Joensuu |
local.citation.publicationName | Odyssey 2014: The Speaker and Language Recognition Workshop |
local.citation.startingPage | 305 |
local.citation.endingPage | 310 |