The impact of visual saliency prediction in image classification

Arazo Sánchez, Eric

dc.contributor	Giró Nieto, Xavier
dc.contributor	McGuinness, Kevin
dc.contributor.author	Arazo Sánchez, Eric
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned	2017-02-27T08:30:59Z
dc.date.available	2017-02-27T08:30:59Z
dc.date.issued	2017
dc.identifier.uri	http://hdl.handle.net/2117/101576
dc.description.abstract	This thesis introduces an architecture to improve the accuracy of a Convolutional Neural Network trained for image classification using visual saliency predictions from the original images. In this thesis the accuracy of a Convolutional Neural Network (CNN) trained for classification has been improved using saliency maps from the original images. The network had an AlexNet architecture and was trained using 1.2 million images from the Imagenet dataset. Two methods had been explored in order to exploit the information from the visual saliency predictions. The first methodologies implemented applied the saliency maps directly to the existing layers of the CNN, which in some cases were already trained for classification and in other they were initialized with random weights. In the second methodology the information from the saliency maps was merged from a new branch, trained at the same time as the initial CNN. In order to speed up the training of the networks the experiments were implemented using images reduced to 128x128. With this sizes the proposed model achieves 12.39% increase in Top-1 accuracy performance with respect to the original CNN, and additionally reduces the number of parameters needed compared to AlexNet. Regarding the original size images 227x227 a model that increases 1.72% Top-1 accuracy is proposed. To accelerate the training process of the network the images have been reduced. The methodology that provides the higher improvement in accuracy will be implemented using the original size of the images. The results will be compared to those obtained from the network trained only with the original images. All the methodologies proposed are implemented in a network previously trained for classification. Additionally the most successful methodologies will be implemented in the training of a network. The results will provide information about the best way to add saliency maps to improve the accuracy.
dc.language.iso	eng
dc.publisher	Universitat Politècnica de Catalunya
dc.rights	S'autoritza la difusió de l'obra mitjançant la llicència Creative Commons o similar 'Reconeixement-NoComercial- SenseObraDerivada'
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcsh	Neural networks (Computer science)
dc.subject.lcsh	Machine learning
dc.subject.other	Saliency
dc.subject.other	alexnet
dc.subject.other	imagenet
dc.subject.other	convolutional neural network
dc.subject.other	deep learning
dc.title	The impact of visual saliency prediction in image classification
dc.type	Master thesis
dc.subject.lemac	Xarxes neuronals (Informàtica)
dc.subject.lemac	Aprenentatge automàtic
dc.identifier.slug	ETSETB-230.124831
dc.rights.access	Open Access
dc.date.updated	2017-02-20T06:51:12Z
dc.audience.educationlevel	Màster
dc.audience.mediator	Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona
dc.audience.degree	MÀSTER UNIVERSITARI EN ENGINYERIA DE TELECOMUNICACIÓ (Pla 2013)
dc.contributor.covenantee	Dublin City University

Fitxers d'aquest items

Nom:: Eric_Arazo_Sanchez_TFM.pdf
Mida:: 828,6Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Master's degree in Telecommunications Engineering (MET) [392]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

The impact of visual saliency prediction in image classification

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora