Shallow and deep convolutional networks for saliency prediction

Pan, Junting; Sayrol Clols, Elisa; Giró Nieto, Xavier; McGuinness, Kevin; O'Connor, Noel

doi:10.1109/CVPR.2016.71

dc.contributor.author	Pan, Junting
dc.contributor.author	Sayrol Clols, Elisa
dc.contributor.author	Giró Nieto, Xavier
dc.contributor.author	McGuinness, Kevin
dc.contributor.author	O'Connor, Noel
dc.contributor.other	Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned	2016-12-14T15:11:49Z
dc.date.issued	2016
dc.identifier.citation	Pan, J., Sayrol, E., Giro, X., McGuinness, K., O'Connor, N. Shallow and deep convolutional networks for saliency prediction. A: IEEE Conference on Computer Vision and Pattern Recognition. "29th IEEE Conference on Computer Vision and Pattern Recognition: 26 June-1 July 2016: Las Vegas, Nevada". Las Vegas, Nevada: Institute of Electrical and Electronics Engineers (IEEE), 2016, p. 598-606.
dc.identifier.isbn	978-1-4673-8852-8
dc.identifier.uri	http://hdl.handle.net/2117/98248
dc.description.abstract	The prediction of salient areas in images has been traditionally addressed with hand-crafted features based on neuroscience principles. This paper, however, addresses the problem with a completely data-driven approach by training a convolutional neural network (convnet). The learning process is formulated as a minimization of a loss function that measures the Euclidean distance of the predicted saliency map with the provided ground truth. The recent publication of large datasets of saliency prediction has provided enough data to train end-to-end architectures that are both fast and accurate. Two designs are proposed: a shallow convnet trained from scratch, and a another deeper solution whose first three layers are adapted from another network trained for classification. To the authors knowledge, these are the first end-to-end CNNs trained and tested for the purpose of saliency prediction.
dc.format.extent	9 p.
dc.language.iso	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.subject	Àrees temàtiques de la UPC::So, imatge i multimèdia::Creació multimèdia::Imatge digital
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Reconeixement de formes
dc.subject.lcsh	Computer vision
dc.subject.lcsh	Pattern recognition systems
dc.subject.other	Computer vision
dc.subject.other	Convolution
dc.subject.other	Forecasting
dc.subject.other	Neural networks
dc.title	Shallow and deep convolutional networks for saliency prediction
dc.type	Conference lecture
dc.subject.lemac	Visió per ordinador
dc.subject.lemac	Reconeixement de formes (Informàtica)
dc.contributor.group	Universitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
dc.identifier.doi	10.1109/CVPR.2016.71
dc.description.peerreviewed	Peer Reviewed
dc.relation.publisherversion	http://ieeexplore.ieee.org/document/7780440/
dc.rights.access	Restricted access - publisher's policy
local.identifier.drac	18719964
dc.description.version	Postprint (published version)
dc.relation.projectid	info:eu-repo/grantAgreement/MINECO//TEC2013-43935-R/ES/PROCESADO DE INFORMACION HETEROGENEA Y SEÑALES EN GRAFOS PARA BIG DATA. APLICACION EN CRIBADO DE ALTO RENDIMIENTO, TELEDETECCION, MULTIMEDIA Y HCI./
dc.date.lift	10000-01-01
local.citation.author	Pan, J.; Sayrol, E.; Giro, X.; McGuinness, K.; O'Connor, N.
local.citation.contributor	IEEE Conference on Computer Vision and Pattern Recognition
local.citation.pubplace	Las Vegas, Nevada
local.citation.publicationName	29th IEEE Conference on Computer Vision and Pattern Recognition: 26 June-1 July 2016: Las Vegas, Nevada
local.citation.startingPage	598
local.citation.endingPage	606

Fitxers d'aquest items

Nom:: Pan_Shallow_and_Deep_CVPR_2016 ...
Mida:: 466,1Kb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Ponències/Comunicacions de congressos [317]
Ponències/Comunicacions de congressos [3.327]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Shallow and deep convolutional networks for saliency prediction

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora