UPC system for the 2016 MediaEval multimodal person discovery in broadcast TV task
Document typeConference lecture
Rights accessOpen Access
The UPC system works by extracting monomodal signal segments (face tracks, speech segments) that overlap with the person names overlaid in the video signal. These segments are assigned directly with the name of the person and used as a reference to compare against the non-overlapping (unassigned) signal segments. This process is performed independently both on the speech and video signals. A simple fusion scheme is used to combine both monomodal annotations into a single one.
CitationIndia, M., Marti, G., Sayrol, E., Morros, J.R., Hernando, J., Cortillas, C., Bouritsas, G. UPC system for the 2016 MediaEval multimodal person discovery in broadcast TV task. A: Multimedia Benchmark Workshop. "MediaEval 2016 working notes proceedings". Hilversum: CEUR-WS.org, 2016, p. 1-3.