PathGAN: visual scanpath prediction with generative adversarial networks

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Projecte

Abstract

We introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its gaze. PathGAN is composed of two parts, the generator and the discriminator. Both parts extract features from images using off-the-shelf networks, and train recurrent layers to generate or discriminate scanpaths accordingly. In scanpath prediction, the stochastic nature of the data makes it very difficult to generate realistic predictions using supervised learning strategies, but we adopt adversarial training as a suitable alternative. Our experiments prove how PathGAN improves the state of the art of visual scanpath prediction on the iSUN and Salient360! datasets.

Descripció

“This is a post-peer-review, pre-copyedit version of an article published in: Computer Vision – ECCV 2018 Workshops. The final authenticated version is available online at: http://dx.doi.org/10.1007/978-3-030-11021-5_25”.

CitacióAssens, M. [et al.]. PathGAN: visual scanpath prediction with generative adversarial networks. A: Workshop on Egocentric Perception, Interaction and Computing. "Computer Vision: ECCV 2018 Workshops, Munich, Germany, September 8-14, 2018: proceedings, part V". Berlín: Springer, 2019, p. 406-422.

URIhttp://hdl.handle.net/2117/130229

DOI10.1007/978-3-030-11021-5_25

ISBN978-3-030-11021-5

Versió de l'editorhttps://link.springer.com/chapter/10.1007%2F978-3-030-11021-5_25

Altres identificadorshttps://imatge-upc.github.io/pathgan/

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
1809.00567.pdf		3,781Mb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

PathGAN: visual scanpath prediction with generative adversarial networks

Visualitza/Obre

Explora