Mostra el registre d'ítem simple
The temporal dimension of visual attention models
dc.contributor | Giró Nieto, Xavier |
dc.contributor.author | Assens Reina, Marc |
dc.date.accessioned | 2017-11-03T13:29:13Z |
dc.date.available | 2017-11-03T13:29:13Z |
dc.date.issued | 2017 |
dc.identifier.uri | http://hdl.handle.net/2117/109755 |
dc.description | Details of the project will be defined once the student is in Dublin. |
dc.description.abstract | This thesis explores methodologies for scanpath prediction on images using deep learning frameworks. As a preliminary step, we analyze the characteristics of the data provided by different datasets. We then explore the use of Convolutional Neural Networks (CNN) and Long-Short-Term-Memory (LSTM) newtworks for scanpath prediction. We observe that these models fail due to the high stochastic nature of the data. With the gained insight, we propose a novel time-aware visual saliency representation named Saliency Volume, that averages scanpaths over multiple observers. Next, we explore the SalNet network and adapt it for saliency volume prediction, and we find several ways of generating scanpaths from saliency volumes. Finally, we fine-tuned our model for scanpaht prediction on 360-degree images and successfully submitted it to the Salient360! Challenge from ICME. The source code and models are publicly available at https://github.com/massens/saliency-360salient-2017. |
dc.language.iso | eng |
dc.publisher | Universitat Politècnica de Catalunya |
dc.rights | S'autoritza la difusió de l'obra mitjançant la llicència Creative Commons o similar 'Reconeixement-NoComercial- SenseObraDerivada' |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació |
dc.subject.lcsh | Image processing |
dc.subject.lcsh | Machine learning |
dc.subject.lcsh | Neural networks (Computer science) |
dc.subject.other | Deep learning |
dc.subject.other | saliency |
dc.subject.other | visual attention |
dc.subject.other | saliency model |
dc.subject.other | neural network |
dc.title | The temporal dimension of visual attention models |
dc.title.alternative | La dimensión temporal de los modelos de atención visual |
dc.title.alternative | La dimensió temporal dels models d'atenció visual |
dc.type | Bachelor thesis |
dc.subject.lemac | Imatges -- Processament |
dc.subject.lemac | Aprenentatge automàtic |
dc.subject.lemac | Xarxes neuronals (Informàtica) |
dc.identifier.slug | ETSETB-230.126921 |
dc.rights.access | Open Access |
dc.date.updated | 2017-07-20T05:53:30Z |
dc.audience.educationlevel | Grau |
dc.audience.mediator | Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona |
dc.audience.degree | GRAU EN CIÈNCIES I TECNOLOGIES DE TELECOMUNICACIÓ (Pla 2010) |