Mostra el registre d'ítem simple

dc.contributor.authorFernàndez, Dèlia
dc.contributor.authorVaras, David
dc.contributor.authorEspadaler, Joan
dc.contributor.authorMasuda, Issey
dc.contributor.authorFerreira, Jordi
dc.contributor.authorWoodward, Alejandro
dc.contributor.authorRodríguez, David
dc.contributor.authorGiró Nieto, Xavier
dc.contributor.authorRiveiro, Juan Carlos
dc.contributor.authorBou Balust, Elisenda
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament d'Enginyeria Electrònica
dc.date.accessioned2017-11-15T11:09:21Z
dc.date.available2017-11-15T11:09:21Z
dc.date.issued2017
dc.identifier.citationFernàndez, D., Varas, D., Espadaler, J., Masuda, I., Ferreira, J., Woodward, A., Rodríguez, D., Giro, X., Riveiro, J. C., Bou, E. ViTS: Video tagging system from massive web multimedia collections. A: Workshop on Web-Scale Vision and Social Media. "Proceedings of the 5th Workshop on Web-scale Vision and Social Media (VSM)". Venice: IEEE Press, 2017, p. 337-346.
dc.identifier.urihttp://hdl.handle.net/2117/110668
dc.description.abstractThe popularization of multimedia content on the Web has arised the need to automatically understand, index and retrieve it. In this paper we present ViTS, an automatic Video Tagging System which learns from videos, their web context and comments shared on social networks. ViTS analyses massive multimedia collections by Internet crawling, and maintains a knowledge base that updates in real time with no need of human supervision. As a result, each video is indexed with a rich set of labels and linked with other related contents. ViTS is an industrial product under exploitation with a vocabulary of over 2.5M concepts, capable of indexing more than 150k videos per month. We compare the quality and completeness of our tags with respect to the ones in the YouTube-8M dataset, and we show how ViTS enhances the semantic annotation of the videos with a larger number of labels (10.04 tags/video), with an accuracy of 80,87%.
dc.format.extent10 p.
dc.language.isoeng
dc.publisherIEEE Press
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la imatge i del senyal vídeo
dc.subjectÀrees temàtiques de la UPC::So, imatge i multimèdia::Creació multimèdia::Vídeo digital
dc.subject.lcshDigital video
dc.subject.lcshImage processing--Digital techniques
dc.subject.lcshSemantic computing
dc.subject.lcshInternet videos
dc.subject.lcshOnline etiquette
dc.titleViTS: Video tagging system from massive web multimedia collections
dc.typeConference report
dc.subject.lemacVídeo digital
dc.subject.lemacImatges -- Processament -- Tècniques digitals
dc.subject.lemacWeb semàntica
dc.subject.lemacVídeos per Internet
dc.subject.lemacEtiqueta a Internet
dc.contributor.groupUniversitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
dc.relation.publisherversionhttp://openaccess.thecvf.com/ICCV2017_workshops/ICCV2017_W5.py
dc.rights.accessOpen Access
local.identifier.drac21590658
dc.description.versionPostprint (published version)
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO//TEC2013-43935-R/ES/PROCESADO DE INFORMACION HETEROGENEA Y SEÑALES EN GRAFOS PARA BIG DATA. APLICACION EN CRIBADO DE ALTO RENDIMIENTO, TELEDETECCION, MULTIMEDIA Y HCI./
local.citation.authorFernàndez, D.; Varas, D.; Espadaler, J.; Masuda, I.; Ferreira, J.; Woodward, A.; Rodríguez, D.; Giro, X.; Riveiro, J. C.; Bou, E.
local.citation.contributorWorkshop on Web-Scale Vision and Social Media
local.citation.pubplaceVenice
local.citation.publicationNameProceedings of the 5th Workshop on Web-scale Vision and Social Media (VSM)
local.citation.startingPage337
local.citation.endingPage346


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple