Exploració per autor "Giró Nieto, Xavier"
Ara es mostren els items 83-102 de 175
-
Large scale content-based video retrieval with LIvRE
de Oliveira Barra, Gabriel; Lux, Mathias; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2016)
Comunicació de congrés
Accés restringit per política de l'editorialThe fast growth of video data requires robust, efficient, and scalable systems to allow for indexing and retrieval. These systems must be accessible from lightweight, portable and usable interfaces to help users in management ... -
LAVICAD: LAboratori VIrtual de Comunicacions Analògiques i Digitals
Cabrera-Bean, Margarita; Giró Nieto, Xavier; Rey Micolau, Francesc; Gasull Llampallas, Antoni; Casas, Josep; Villares Piera, Nemesio Javier; Fernández Rubio, Juan Antonio; Sala Álvarez, José; Espinosa Fricke, Pedro; Fernández, Carlos Marcos; Cortes, Silvia; Muntanyola, Mireia; Farré, Miquel Angel (2009-02-12T09:27:53Z)
Comunicació de congrés / Text en actes de congrés
Accés obertMitjançant el present ajut s’ha ampliat l’aplicació en xarxa LAVICAD (LAboratori VIrtual de COmunicacions Analògiques i Digitals) que s’ofereix de forma integrada dins de la plataforma d’e-learning COM@WEB. LAVICAD és ... -
LEMoRe: A lifelog engine for moments retrieval at the NTCIR-lifelog LSAT task
de Oliveira Barra, Gabriel; Cartas Ayala, Alejandro; Bolaños, Marc; Dimiccoli, Mariella; Giró Nieto, Xavier; Radeva, Petia (2016)
Comunicació de congrés
Accés obertSemantic image retrieval from large amounts of egocentric visual data requires to leverage powerful techniques for filling in the semantic gap. This paper introduces LEMoRe, a Lifelog Engine for Moments Retrieval, developed ... -
Linking media: adopting semantic technologies for multimodal media connection
Fernàndez, Dèlia; Bou Balust, Elisenda; Giró Nieto, Xavier; Riviero, Juan Carlos; Espadaler, Joan; Rodríguez, David; Colom Serra, Aleix; Rimmerk, Joan Marco; Varas, David; Massuda, Issey; Roig, Carlos (CEUR-WS.org, 2018)
Text en actes de congrés
Accés obertToday's media and news organizations are constantly generating large amounts of multimedia content, majorly delivered online. As the online media market grows, the management and delivery of contents is becoming a challenge. ... -
LTA 2016: The First Workshop on Lifelogging Tools and Applications
Gurrin, Cathal; Giró Nieto, Xavier; Radeva, Petia; Dimicolli, Mariella; Johansen, Havard D.; Joho, Hideo; Singh, Vivek K. (Association for Computing Machinery (ACM), 2016)
Text en actes de congrés
Accés restringit per política de l'editorialThe organisation of personal data is receiving increasing research attention due to the challenges we face in gathering, enriching, searching, and visualising such data. Given the increasing ease with which personal data ... -
Mask-guided sample selection for semi-supervised instance segmentation
Bellver Bueno, Míriam; Salvador Aguilera, Amaia; Torres Viñals, Jordi; Giró Nieto, Xavier (2020-07-05)
Article
Accés obertImage segmentation methods are usually trained with pixel-level annotations, which require significant human effort to collect. Weakly-supervised pipelines are the most common solution to address this constraint because ... -
More cat than cute?: interpretable prediction of adjective-noun pairs
Fernàndez, Dèlia; Woodward, Alejandro; Campos Camunez, Victor; Giró Nieto, Xavier; Jou, Brendan; Chang, Shih-Fu (2017)
Text en actes de congrés
Accés restringit per política de l'editorialThe increasing availability of affect-rich multimedia resources has bolstered interest in understanding sentiment and emotions in and from visual content. Adjective-noun pairs (ANP) are a popular midlevel semantic ... -
Mostratge i reconstrucció
Escola Superior d'Enginyeries Industrial, Aeroespacial i Audiovisual de Terrassa; Departament de Teoria del Senyal i Comunicacions (TSC); Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2010)
Apunts
Accés obert -
Multi-view 3D face reconstruction in the wild using siamese networks
Ramon, Eduard; Escur, Janna; Giró Nieto, Xavier (Computer Vision Foundation, 2019)
Text en actes de congrés
Accés obertIn this work, we present a novel learning based approach to reconstruct 3D faces from a single or multiple images. Our method uses a simple yet powerful architecture based on siamese neural networks that helps to extract ... -
Multiresolution co-clustering for uncalibrated multiview segmentation
Ventura, Carles; Varas, David; Vilaplana Besler, Verónica; Giró Nieto, Xavier; Marqués Acosta, Fernando (2019-05-04)
Article
Accés obertWe propose a technique for coherently co-clustering uncalibrated views of a scene with a contour-based representation. Our work extends the previous framework, an iterative algorithm for segmenting sequences with small ... -
Multiscale annotation of still images with GAT
Giró Nieto, Xavier; Martos Asensio, Manel (2012)
Comunicació de congrés
Accés obertThis paper presents GAT, a Graphical Annotation Tool for still images that works both at the global and local scales. This interface has been designed to assist users in the an- notation of images with relation to the ... -
NII-HITACHI-UIT at TRECVID 2015 instance search
Nguyen, Vinh-Tiep; Le, Duy-Dinh; Salvador Aguilera, Amaia; Zu, Caizhi; Nguyen, Dinh-Luan; Tran, Minh-Triet; Duc, Thanh-Ngo; Duong, Duc-Anh; Satoh, Shin'ichi; Giró Nieto, Xavier (2015)
Comunicació de congrés
Accés restringit per política de l'editorialIn this paper, we propose two methods to improve last year instance search framework. Both of them are based on post processing scheme that try to rerank top K shots returned from BOW model. The rst system is to propose a ... -
Object retrieval with deep convolutional features
Mohedano, Eva; Salvador Aguilera, Amaia; McGuinness, Kevin; Giró Nieto, Xavier; O'Connor, Noel; Marqués Acosta, Fernando (IOS Press, 2017-11-23)
Capítol de llibre
Accés restringit per política de l'editorialDeep learning and image processing are two areas of great interest to academics and industry professionals alike. The areas of application of these two disciplines range widely, encompassing fields such as medicine, robotics, ... -
Object segmentation in images using EEG signals
Mohedano, Eva; Healy, Graham; McGuinness, Kevin; Giró Nieto, Xavier; O'Connor, Noel; Smeaton, Alan F. (ACM, 2014)
Comunicació de congrés
Accés obertThis paper explores the potential of brain-computer interfaces in segmenting objects from images. Our approach is centered around designing an effective method for displaying the image parts to the users such that they ... -
One perceptron to rule them all: language, vision, audio and speech
Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2020)
Comunicació de congrés
Accés restringit per política de l'editorialDeep neural networks have boosted the convergence of multimedia data analytics in a unified framework shared by practitioners in natural language, vision and speech. Image captioning, lip reading or video sonorization are ... -
Online detection of action start in untrimmed, streaming videos
Shou, Zheng; Pan, Junting; Chan, Jonathan; Miyazawa, Kazuyuki; Mansour, Hassan; Vetro, Anthony; Giró Nieto, Xavier; Chang, Shih-Fu (Springer, 2018)
Comunicació de congrés
Accés obertWe aim to tackle a novel task in action detection - Online Detection of Action Start (ODAS) in untrimmed, streaming videos. The goal of ODAS is to detect the start of an action instance, with high categorization accuracy ... -
Part-based object retrieval with binary partition trees
Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2012-05-31)
Tesi
Accés obertThis thesis addresses the problem of visual object retrieval, where a user formulates a query to an image database by providing one or multiple examples of an object of interest. The presented techniques aim both at finding ... -
PathGAN: visual scanpath prediction with generative adversarial networks
Assens, Marc; Giró Nieto, Xavier; McGuinness, Kevin; O'Connor, Noel (Springer, 2019)
Comunicació de congrés
Accés obertWe introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its ... -
Photo clustering of social events by extending photoTOC to a rich context
Manchon Vizuete, Daniel; Gris-Sarabia, Irene; Giró Nieto, Xavier (2014)
Comunicació de congrés
Accés restringit per política de l'editorialThe popularisation of the storage of photos on the cloud has opened new opportunities and challenges for the organisation and extension of photo collections. This paper presents a light computational solution for the ... -
Pixinwav: Residual steganography for hiding pixels in audio
Geleta Geleta, Margarita; Puntí Álvarez, Cristina; McGuinness, Kevin; Pons Puig, Jordi; Canton Ferrer, Cristian; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2022)
Comunicació de congrés
Accés obertSteganography comprises the mechanics of hiding data in a host media that may be publicly available. While previous works focused on unimodal setups (e.g., hiding images in images, or hiding audio in audio), PixInWav targets ...