Mostra el registre d'ítem simple

dc.contributorTorres Viñals, Jordi
dc.contributor.authorLin, Xunyu
dc.date.accessioned2018-01-26T09:00:21Z
dc.date.available2018-01-26T09:00:21Z
dc.date.issued2017
dc.identifier.urihttp://hdl.handle.net/2117/113234
dc.description.abstractThis paper instroduces an unsupervised framework to extract semantically rich features for video representation. Inspired by how the human visual system groups objects based on motion cues, we propose a deep convolutional neural network that disentangles motion, foreground and background information. The proposed architecture consists of a 3D convolutional feature encoder for blocks of 16 frames, which is trained for reconstruction tasks over the first and last frames of the sequence. The model is trained with a fraction of videos from the UCF-101 dataset taking as ground truth the bounding boxes around the activity regions. Qualitative results indicate that the network can successfully update the foreground appearance based on pure-motion features. The benefits of these learned features are shown in a discriminative classification task when compared with a random initialization of the network weights, providing a gain of accuracy above the 10%.
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.subjectÀrees temàtiques de la UPC::Informàtica
dc.subject.lcshArtificial intelligence
dc.subject.lcshVideo recording
dc.subject.otherUnsupervised learning
dc.subject.otherartificial intelligence
dc.subject.othervideo features
dc.subject.otheraction recognition
dc.titleDisentangling motion, foreground and background features in videos
dc.title.alternativeUnsupervised video representations learning for activity recognition
dc.typeBachelor thesis
dc.subject.lemacIntel·ligència artificial
dc.subject.lemacVídeo
dc.identifier.slug128563
dc.rights.accessOpen Access
dc.date.updated2017-06-30T14:11:12Z
dc.audience.educationlevelGrau
dc.audience.mediatorFacultat d'Informàtica de Barcelona
dc.audience.degreeGRAU EN ENGINYERIA INFORMÀTICA (Pla 2010)


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple