Now showing items 1-20 of 175

    • A closer look at referring expressions for video object segmentation 

      Bellver Bueno, Míriam; Ventura Royo, Carles; Silberer, Carina; Kazakos, Ioannis; Torres Viñals, Jordi; Giró Nieto, Xavier (2023-01)
      Article
      Open Access
      The task of Language-guided Video Object Segmentation (LVOS) aims at generating binary masks for an object referred by a linguistic expression. When this expression unambiguously describes an object in the scene, it is ...
    • Acoustic event detection based on feature-level fusion of audio and video modalities 

      Butko, Taras; Canton Ferrer, Cristian; Segura Perales, Carlos; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon (HINDAWI, 2011-03-15)
      Article
      Open Access
      Acoustic event detection (AED) aims at determining the identity of sounds and their temporal position in audio signals. When applied to spontaneously generated acoustic events, AED based only on audio information shows a ...
    • An interactive lifelog search engine for LSC2018 

      Alsina, Adrià; Giró Nieto, Xavier; Gurrin, Cathal (Association for Computing Machinery (ACM), 2018)
      Conference lecture
      Open Access
      In this work, we describe an interactive lifelog search engine developed for the LSC 2018 search challenge at ACM ICMR 2018. The paper introduces the four-step process required to support lifelog search engines and describes ...
    • APLICACIONS A LA CREACIO DE CONTINGUTS MULTIMEDIA (Examen 1r Quadr.) 

      Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2006-01-13)
      Exam
      Restricted access to the UPC academic community
    • Aplicacions de la transformada Z 

      Escola Superior d'Enginyeries Industrial, Aeroespacial i Audiovisual de Terrassa; Departament de Teoria del Senyal i Comunicacions (TSC); Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2010)
      Lecture notes
      Open Access
    • Assessing knee OA severity with CNN attention-based end-to-end architectures 

      Górriz, Marc; Antony, Joseph; McGuinness, Kevin; Giró Nieto, Xavier; O'Connor, Noel (2019)
      Conference lecture
      Open Access
      This work proposes a novel end-to-end convolutional neural network (CNN) architecture to automatically quantify the severity of knee osteoarthritis (OA) using X-Ray images, which incorporates trainable attention modules ...
    • Assessment of crowdsourcing and gamification loss in user-assisted object segmentation 

      Carlier, Axel; Salvador Aguilera, Amaia; Cabezas, Ferran; Giró Nieto, Xavier; Charvillat, Vincent; Marques, Oge (2015-09-12)
      Article
      Open Access
      There has been a growing interest in applying human computation – particularly crowdsourcing techniques – to assist in the solution of multimedia, image processing, and computer vision problems which are still too difficult ...
    • Audiovisual event detection towards scene understanding 

      Canton Ferrer, Cristian; Butko, Taras; Segura, C.; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2009)
      Conference report
      Restricted access - publisher's policy
      Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a ...
    • Automatic keyframe selection based on mutual reinforcement algorithm 

      Ventura Royo, Carles; Giró Nieto, Xavier; Vilaplana Besler, Verónica; Giribet, Daniel; Carasusan, Eusebio (Institute of Electrical and Electronics Engineers (IEEE), 2013)
      Conference report
      Restricted access - publisher's policy
      This paper addresses the problem of video summarization through an automatic selection of a single representative keyframe. The proposed solution is based on the mutual reinforcement paradigm, where a keyframe is selected ...
    • Automatic reminiscence therapy for dementia 

      Carós, Mariona; Garolera Freixa, Maite; Radeva, Petia; Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2020)
      Conference lecture
      Restricted access - publisher's policy
      With people living longer than ever, the number of cases with dementia such as Alzheimer's disease increases steadily. It affects more than 46 million people worldwide, and it is estimated that in 2050 more than 100 million ...
    • Bags of local convolutional features for scalable instance search 

      Mohedano, Eva; Salvador Aguilera, Amaia; McGuinness, Kevin; Marqués Acosta, Fernando; O'Connor, Noel; Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2016)
      Conference lecture
      Open Access
      This work proposes a simple instance retrieval pipeline based on encoding the convolutional features of CNN using the bag of words aggregation scheme (BoW). Assigning each local array of activations in a convolutional layer ...
    • BitSearch, the blog before the thesis 

      Giró Nieto, Xavier (2010)
      Conference lecture
      Open Access
    • Budget-aware semi-supervised semantic and instance segmentation 

      Bellver Bueno, Míriam; Salvador Aguilera, Amaia; Torres Viñals, Jordi; Giró Nieto, Xavier (2019)
      Conference lecture
      Open Access
      Methods that move towards less supervised scenarios are key for image segmentation, as dense labels demand significant human intervention. Generally, the annotation burden is mitigated by labeling datasets with weaker forms ...
    • Class-weighted convolutional features for visual instance search 

      Jiménez, Albert; Alvarez, Jose M.; Giró Nieto, Xavier (2017)
      Conference lecture
      Open Access
      Image retrieval in realistic scenarios targets large dynamic datasets of unlabeled images. In these cases, training or fine-tuning a model every time new images are added to the database is neither efficient nor scalable. ...
    • Click'n'Cut: crowdsourced interactive segmentation with object candidates 

      Carlier, Axel; Charvillat, Vincent; Salvador Aguilera, Amaia; Giró Nieto, Xavier; Marques, Ogé (ACM, 2014)
      Conference lecture
      Open Access
      This paper introduces Click’n’Cut, a novel web tool for inter- active object segmentation designed for crowdsourcing tasks. Click’n’Cut combines bounding boxes and clicks generated by workers to obtain accurate object ...
    • Comparing fixed and adaptive computation time for recurrent neural networks 

      Fojo, Daniel; Campos Camunez, Victor; Giró Nieto, Xavier (2018)
      Conference report
      Open Access
      Deep networks commonly perform better than shallow ones, but allocating the proper amount of computation for each particular input sample remains an open problem. This issue is particularly challenging in sequential tasks, ...
    • COMUNICACIONS ANALÒGIQUES I DIGITALS (1r quadrimestre) 

      Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2009-11-06)
      Exam
      Restricted access to the UPC academic community
    • COMUNICACIONS AUDIOVISUALS (1r quadrimestre) 

      Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2010-01-11)
      Exam
      Restricted access to the UPC academic community
    • COMUNICACIONS AUDIOVISUALS (1r quadrimestre, 1a avaluació) 

      Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2010-11-12)
      Exam
      Restricted access to the UPC academic community
    • COMUNICACIONS AUDIOVISUALS (Examen de teoria, 1r quadrimestre) 

      Giró Nieto, Xavier (Universitat Politècnica de Catalunya, 2009-11-06)
      Exam
      Restricted access to the UPC academic community