Mostra el registre d'ítem simple
UPC system for the 2015 MediaEval multimodal person discovery in broadcast TV task
dc.contributor.author | India, Miquel |
dc.contributor.author | Varas González, David |
dc.contributor.author | Vilaplana Besler, Verónica |
dc.contributor.author | Morros Rubió, Josep Ramon |
dc.contributor.author | Hernando Pericás, Francisco Javier |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
dc.date.accessioned | 2016-03-18T11:59:25Z |
dc.date.available | 2016-03-18T11:59:25Z |
dc.date.issued | 2015 |
dc.identifier.citation | India, M., Varas, D., Vilaplana, V., Morros, J.R., Hernando, J. UPC system for the 2015 MediaEval multimodal person discovery in broadcast TV task. A: MediaEval Multimedia Benchmark Workshop. "MediaEval 2015 Multimedia Benchmark Workshop". Wurzen: 2015. |
dc.identifier.uri | http://hdl.handle.net/2117/84692 |
dc.description.abstract | This paper describes a system to identify people in broadcast TV shows in a purely unsupervised manner. The system outputs the identity of people that appear, talk and can be identified by using information appearing in the show (in our case, text with person names). Three types of monomodal technologies are used: speech diarization, video diarization and text detection / named entity recognition. These technologies are combined using a linear programming approach where some restrictions are imposed. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic |
dc.subject | Àrees temàtiques de la UPC::So, imatge i multimèdia::Creació multimèdia::Vídeo digital |
dc.subject.lcsh | Digital television |
dc.subject.lcsh | Automatic speech recognition |
dc.subject.lcsh | Speech processing systems |
dc.subject.lcsh | Digital video |
dc.title | UPC system for the 2015 MediaEval multimodal person discovery in broadcast TV task |
dc.type | Conference report |
dc.subject.lemac | Vídeo digital |
dc.subject.lemac | Televisió digital |
dc.subject.lemac | Reconeixement automàtic de la parla |
dc.contributor.group | Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo |
dc.contributor.group | Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla |
dc.relation.publisherversion | http://ceur-ws.org/Vol-1436/ |
dc.rights.access | Open Access |
local.identifier.drac | 17532051 |
dc.description.version | Postprint (published version) |
local.citation.author | India, M.; Varas, D.; Vilaplana, V.; Morros, J.R.; Hernando, J. |
local.citation.contributor | MediaEval Multimedia Benchmark Workshop |
local.citation.pubplace | Wurzen |
local.citation.publicationName | MediaEval 2015 Multimedia Benchmark Workshop |