Ara es mostren els items 83-102 de 175

    • Large scale content-based video retrieval with LIvRE 

      de Oliveira Barra, Gabriel; Lux, Mathias; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2016)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      The fast growth of video data requires robust, efficient, and scalable systems to allow for indexing and retrieval. These systems must be accessible from lightweight, portable and usable interfaces to help users in management ...
    • LAVICAD: LAboratori VIrtual de Comunicacions Analògiques i Digitals 

      Cabrera-Bean, Margarita; Giró Nieto, Xavier; Rey Micolau, Francesc; Gasull Llampallas, Antoni; Casas, Josep; Villares Piera, Nemesio Javier; Fernández Rubio, Juan Antonio; Sala Álvarez, José; Espinosa Fricke, Pedro; Fernández, Carlos Marcos; Cortes, Silvia; Muntanyola, Mireia; Farré, Miquel Angel (2009-02-12T09:27:53Z)
      Comunicació de congrés / Text en actes de congrés
      Accés obert
      Mitjançant el present ajut s’ha ampliat l’aplicació en xarxa LAVICAD (LAboratori VIrtual de COmunicacions Analògiques i Digitals) que s’ofereix de forma integrada dins de la plataforma d’e-learning COM@WEB. LAVICAD és ...
    • LEMoRe: A lifelog engine for moments retrieval at the NTCIR-lifelog LSAT task 

      de Oliveira Barra, Gabriel; Cartas Ayala, Alejandro; Bolaños, Marc; Dimiccoli, Mariella; Giró Nieto, Xavier; Radeva, Petia (2016)
      Comunicació de congrés
      Accés obert
      Semantic image retrieval from large amounts of egocentric visual data requires to leverage powerful techniques for filling in the semantic gap. This paper introduces LEMoRe, a Lifelog Engine for Moments Retrieval, developed ...
    • Linking media: adopting semantic technologies for multimodal media connection 

      Fernàndez, Dèlia; Bou Balust, Elisenda; Giró Nieto, Xavier; Riviero, Juan Carlos; Espadaler, Joan; Rodríguez, David; Colom Serra, Aleix; Rimmerk, Joan Marco; Varas, David; Massuda, Issey; Roig, Carlos (CEUR-WS.org, 2018)
      Text en actes de congrés
      Accés obert
      Today's media and news organizations are constantly generating large amounts of multimedia content, majorly delivered online. As the online media market grows, the management and delivery of contents is becoming a challenge. ...
    • LTA 2016: The First Workshop on Lifelogging Tools and Applications 

      Gurrin, Cathal; Giró Nieto, Xavier; Radeva, Petia; Dimicolli, Mariella; Johansen, Havard D.; Joho, Hideo; Singh, Vivek K. (Association for Computing Machinery (ACM), 2016)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      The organisation of personal data is receiving increasing research attention due to the challenges we face in gathering, enriching, searching, and visualising such data. Given the increasing ease with which personal data ...
    • Mask-guided sample selection for semi-supervised instance segmentation 

      Bellver Bueno, Míriam; Salvador Aguilera, Amaia; Torres Viñals, Jordi; Giró Nieto, Xavier (2020-07-05)
      Article
      Accés obert
      Image segmentation methods are usually trained with pixel-level annotations, which require significant human effort to collect. Weakly-supervised pipelines are the most common solution to address this constraint because ...
    • More cat than cute?: interpretable prediction of adjective-noun pairs 

      Fernàndez, Dèlia; Woodward, Alejandro; Campos Camunez, Victor; Giró Nieto, Xavier; Jou, Brendan; Chang, Shih-Fu (2017)
      Text en actes de congrés
      Accés restringit per política de l'editorial
      The increasing availability of affect-rich multimedia resources has bolstered interest in understanding sentiment and emotions in and from visual content. Adjective-noun pairs (ANP) are a popular midlevel semantic ...
    • Mostratge i reconstrucció 

      Escola Superior d'Enginyeries Industrial, Aeroespacial i Audiovisual de Terrassa; Departament de Teoria del Senyal i Comunicacions (TSC); Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2010)
      Apunts
      Accés obert
    • Multi-view 3D face reconstruction in the wild using siamese networks 

      Ramon, Eduard; Escur, Janna; Giró Nieto, Xavier (Computer Vision Foundation, 2019)
      Text en actes de congrés
      Accés obert
      In this work, we present a novel learning based approach to reconstruct 3D faces from a single or multiple images. Our method uses a simple yet powerful architecture based on siamese neural networks that helps to extract ...
    • Multiresolution co-clustering for uncalibrated multiview segmentation 

      Ventura, Carles; Varas, David; Vilaplana Besler, Verónica; Giró Nieto, Xavier; Marqués Acosta, Fernando (2019-05-04)
      Article
      Accés obert
      We propose a technique for coherently co-clustering uncalibrated views of a scene with a contour-based representation. Our work extends the previous framework, an iterative algorithm for segmenting sequences with small ...
    • Multiscale annotation of still images with GAT 

      Giró Nieto, Xavier; Martos Asensio, Manel (2012)
      Comunicació de congrés
      Accés obert
      This paper presents GAT, a Graphical Annotation Tool for still images that works both at the global and local scales. This interface has been designed to assist users in the an- notation of images with relation to the ...
    • NII-HITACHI-UIT at TRECVID 2015 instance search 

      Nguyen, Vinh-Tiep; Le, Duy-Dinh; Salvador Aguilera, Amaia; Zu, Caizhi; Nguyen, Dinh-Luan; Tran, Minh-Triet; Duc, Thanh-Ngo; Duong, Duc-Anh; Satoh, Shin'ichi; Giró Nieto, Xavier (2015)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      In this paper, we propose two methods to improve last year instance search framework. Both of them are based on post processing scheme that try to rerank top K shots returned from BOW model. The rst system is to propose a ...
    • Object retrieval with deep convolutional features 

      Mohedano, Eva; Salvador Aguilera, Amaia; McGuinness, Kevin; Giró Nieto, Xavier; O'Connor, Noel; Marqués Acosta, Fernando (IOS Press, 2017-11-23)
      Capítol de llibre
      Accés restringit per política de l'editorial
      Deep learning and image processing are two areas of great interest to academics and industry professionals alike. The areas of application of these two disciplines range widely, encompassing fields such as medicine, robotics, ...
    • Object segmentation in images using EEG signals 

      Mohedano, Eva; Healy, Graham; McGuinness, Kevin; Giró Nieto, Xavier; O'Connor, Noel; Smeaton, Alan F. (ACM, 2014)
      Comunicació de congrés
      Accés obert
      This paper explores the potential of brain-computer interfaces in segmenting objects from images. Our approach is centered around designing an effective method for displaying the image parts to the users such that they ...
    • One perceptron to rule them all: language, vision, audio and speech 

      Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2020)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      Deep neural networks have boosted the convergence of multimedia data analytics in a unified framework shared by practitioners in natural language, vision and speech. Image captioning, lip reading or video sonorization are ...
    • Online detection of action start in untrimmed, streaming videos 

      Shou, Zheng; Pan, Junting; Chan, Jonathan; Miyazawa, Kazuyuki; Mansour, Hassan; Vetro, Anthony; Giró Nieto, Xavier; Chang, Shih-Fu (Springer, 2018)
      Comunicació de congrés
      Accés obert
      We aim to tackle a novel task in action detection - Online Detection of Action Start (ODAS) in untrimmed, streaming videos. The goal of ODAS is to detect the start of an action instance, with high categorization accuracy ...
    • Part-based object retrieval with binary partition trees 

      Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2012-05-31)
      Tesi
      Accés obert
      This thesis addresses the problem of visual object retrieval, where a user formulates a query to an image database by providing one or multiple examples of an object of interest. The presented techniques aim both at finding ...
    • PathGAN: visual scanpath prediction with generative adversarial networks 

      Assens, Marc; Giró Nieto, Xavier; McGuinness, Kevin; O'Connor, Noel (Springer, 2019)
      Comunicació de congrés
      Accés obert
      We introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its ...
    • Photo clustering of social events by extending photoTOC to a rich context 

      Manchon Vizuete, Daniel; Gris-Sarabia, Irene; Giró Nieto, Xavier (2014)
      Comunicació de congrés
      Accés restringit per política de l'editorial
      The popularisation of the storage of photos on the cloud has opened new opportunities and challenges for the organisation and extension of photo collections. This paper presents a light computational solution for the ...
    • Pixinwav: Residual steganography for hiding pixels in audio 

      Geleta Geleta, Margarita; Puntí Álvarez, Cristina; McGuinness, Kevin; Pons Puig, Jordi; Canton Ferrer, Cristian; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2022)
      Comunicació de congrés
      Accés obert
      Steganography comprises the mechanics of hiding data in a host media that may be publicly available. While previous works focused on unimodal setups (e.g., hiding images in images, or hiding audio in audio), PixInWav targets ...