i-Vector modeling with deep belief networks for multi-session speaker recognition

Ghahabi Esfahani, Omid; Hernando Pericás, Francisco Javier

dc.contributor.author	Ghahabi Esfahani, Omid
dc.contributor.author	Hernando Pericás, Francisco Javier
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned	2015-03-25T17:00:18Z
dc.date.created	2014
dc.date.issued	2014
dc.identifier.citation	Ghahabi, O.; Hernando, J. i-Vector modeling with deep belief networks for multi-session speaker recognition. A: The Speaker and Language Recognition Workshop. "Odyssey 2014: The Speaker and Language Recognition Workshop". Joensuu: 2014, p. 305-310.
dc.identifier.issn	2312-2846
dc.identifier.uri	http://hdl.handle.net/2117/27035
dc.description.abstract	In this paper we propose an impostor selection method for a Deep Belief Network (DBN) based system which models i-vectors in a multi-session speaker verification task. In the proposed method, instead of choosing a fixed number of most informative impostors, a threshold is defined according to the frequencies of impostors. The selected impostors are then clustered and the centroids are considered as the final impostors for target speakers. The system first trains each target speaker unsupervisingly by an adaptation method and then models discriminatively each target speaker using the impostor centroids and target i-vectors. The evaluation is performed on the NIST 2014 i-vector challenge database and it is shown that the proposed DBN-based system achieves 23% relative improvement of minDCF over the baseline system in the challenge
dc.format.extent	6 p.
dc.language.iso	eng
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcsh	Speech processing systems
dc.subject.lcsh	Automatic speech recognition
dc.title	i-Vector modeling with deep belief networks for multi-session speaker recognition
dc.type	Conference report
dc.subject.lemac	Reconeixement automàtic de la parla
dc.subject.lemac	Processament de la parla
dc.contributor.group	Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.rights.access	Restricted access - publisher's policy
local.identifier.drac	15431739
dc.description.version	Postprint (published version)
dc.date.lift	10000-01-01
local.citation.author	Ghahabi, O.; Hernando, J.
local.citation.contributor	The Speaker and Language Recognition Workshop
local.citation.pubplace	Joensuu
local.citation.publicationName	Odyssey 2014: The Speaker and Language Recognition Workshop
local.citation.startingPage	305
local.citation.endingPage	310

Fitxers d'aquest items

Nom:: Odyssey 2014.pdf
Mida:: 1,130Mb
Format:: PDF
Descripció:: Odyssey 2014

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [437]
Ponències/Comunicacions de congressos [3.332]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

i-Vector modeling with deep belief networks for multi-session speaker recognition

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora