Mostra el registre d'ítem simple

dc.contributor.authorButko, Taras
dc.contributor.authorNadeu Camprubí, Climent
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2012-03-04T11:09:35Z
dc.date.available2012-03-04T11:09:35Z
dc.date.created2011
dc.date.issued2011
dc.identifier.citationButko, T.; Nadeu, C. Audio segmentation of broadcast news : a hierarchical system with feature selection for the Albayzin-2010 evaluation. A: International Conference on Acoustics, Speech and Signal Processing. "2011 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings". Barcelona: IEEE Press. Institute of Electrical and Electronics Engineers, 2011, p. 357-360.
dc.identifier.urihttp://hdl.handle.net/2117/15469
dc.description.abstractIn this paper, we present an audio segmentation system for broadcast news, and its results in the Albayzin-2010 evaluation. First of all, the Albayzin-2010 evaluation setup, developed by the authors, is presented; in particular, the database and the metric are described. The reported hierarchical HMM-GMM-based system is composed of one binary detector for each of the five considered classes (music, speech, speech over music, speech over noise and other). A fast one-pass-training feature selection technique is adapted to the audio segmentation task to improve the results and to reduce the dimensionality of the input feature vector.
dc.format.extent4 p.
dc.language.isoeng
dc.publisherIEEE Press. Institute of Electrical and Electronics Engineers
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic
dc.subject.lcshHidden Markov models
dc.subject.lcshAudio signal processing
dc.subject.lcshBroadcasting
dc.titleAudio segmentation of broadcast news : a hierarchical system with feature selection for the Albayzin-2010 evaluation
dc.typeConference lecture
dc.subject.lemacModel ocult de Markov
dc.subject.lemacAlbayzin 2010
dc.subject.lemacSo -- Processament de dades
dc.contributor.groupUniversitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
dc.identifier.doi10.1109/ICASSP.2011.5946414
dc.description.peerreviewedPeer Reviewed
dc.rights.accessRestricted access - publisher's policy
local.identifier.drac9512446
dc.description.versionPostprint (published version)
local.citation.authorButko, T.; Nadeu, C.
local.citation.contributorInternational Conference on Acoustics, Speech and Signal Processing
local.citation.pubplaceBarcelona
local.citation.publicationName2011 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings
local.citation.startingPage357
local.citation.endingPage360


Fitxers d'aquest items

Imatge en miniatura

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple