Mostra el registre d'ítem simple
Overlap detection for speaker diarization by fusing spectral and spatial features
dc.contributor.author | Zelenak, Martin |
dc.contributor.author | Segura Perales, Carlos |
dc.contributor.author | Hernando Pericás, Francisco Javier |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2011-02-07T09:47:19Z |
dc.date.available | 2011-02-07T09:47:19Z |
dc.date.created | 2010 |
dc.date.issued | 2010 |
dc.identifier.citation | Zelenak, M.; Segura, C.; Hernando, J. Overlap detection for speaker diarization by fusing spectral and spatial features. A: INTERSPEECH. "INTERSPEECH". 2010, p. 2302-2305. |
dc.identifier.isbn | 1990-9772 |
dc.identifier.uri | http://hdl.handle.net/2117/11288 |
dc.description.abstract | A substantial portion of errors of the conventional speaker diarization systems on meeting data can be accounted to overlapped speech. This paper proposes the use of several spatial features to improve speech overlap detection on distant channel microphones. These spatial features are integrated into a spectral-based system by using principal component analysis and neural networks. Different overlap detection hypotheses are used to improve diarization performance with both overlap exclusion and overlap labeling. In experiments conducted on AMI Meeting Corpus we demonstrate a relative DER improvement of 11.6% and 14.6% for single- and multi-site data, respectively. |
dc.format.extent | 4 p. |
dc.language.iso | eng |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació |
dc.subject.lcsh | Speaker overlap detection |
dc.subject.lcsh | Speaker diarization |
dc.subject.lcsh | Signal theory (Telecommunication) |
dc.subject.lcsh | Neural networks (Computer science) |
dc.title | Overlap detection for speaker diarization by fusing spectral and spatial features |
dc.type | Conference lecture |
dc.subject.lemac | Processament de la parla |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.rights.access | Restricted access - publisher's policy |
local.identifier.drac | 4599986 |
dc.description.version | Postprint (published version) |
local.citation.author | Zelenak, M.; Segura, C.; Hernando, J. |
local.citation.contributor | INTERSPEECH |
local.citation.publicationName | INTERSPEECH |
local.citation.startingPage | 2302 |
local.citation.endingPage | 2305 |