Now showing items 1-20 of 80

  • Acoustic event detection based on feature-level fusion of audio and video modalities 

    Butko, Taras; Canton Ferrer, Cristian; Segura Perales, Carlos; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon (HINDAWI, 2011-03-15)
    Article
    Open Access
    Acoustic event detection (AED) aims at determining the identity of sounds and their temporal position in audio signals. When applied to spontaneously generated acoustic events, AED based only on audio information shows a ...
  • An interactive lifelog search engine for LSC2018 

    Alsina, Adrià; Giró Nieto, Xavier; Gurrin, Cathal (Association for Computing Machinery (ACM), 2018)
    Conference lecture
    Open Access
    In this work, we describe an interactive lifelog search engine developed for the LSC 2018 search challenge at ACM ICMR 2018. The paper introduces the four-step process required to support lifelog search engines and describes ...
  • Assessment of crowdsourcing and gamification loss in user-assisted object segmentation 

    Carlier, Axel; Salvador Aguilera, Amaia; Cabezas, Ferran; Giró Nieto, Xavier; Charvillat, Vincent; Marques, Oge (2015-09-12)
    Article
    Open Access
    There has been a growing interest in applying human computation – particularly crowdsourcing techniques – to assist in the solution of multimedia, image processing, and computer vision problems which are still too difficult ...
  • Audiovisual event detection towards scene understanding 

    Canton Ferrer, Cristian; Butko, Taras; Segura, C.; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2009)
    Conference report
    Restricted access - publisher's policy
    Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a ...
  • Automatic keyframe selection based on mutual reinforcement algorithm 

    Ventura Royo, Carles; Giró Nieto, Xavier; Vilaplana Besler, Verónica; Giribet, Daniel; Carasusan, Eusebio (Institute of Electrical and Electronics Engineers (IEEE), 2013)
    Conference report
    Restricted access - publisher's policy
    This paper addresses the problem of video summarization through an automatic selection of a single representative keyframe. The proposed solution is based on the mutual reinforcement paradigm, where a keyframe is selected ...
  • Bags of local convolutional features for scalable instance search 

    Mohedano, Eva; Salvador Aguilera, Amaia; McGuinness, Kevin; Marqués Acosta, Fernando; O'Connor, Noel; Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2016)
    Conference lecture
    Open Access
    This work proposes a simple instance retrieval pipeline based on encoding the convolutional features of CNN using the bag of words aggregation scheme (BoW). Assigning each local array of activations in a convolutional layer ...
  • BitSearch, the blog before the thesis 

    Giró Nieto, Xavier (2010)
    Conference lecture
    Open Access
  • Class-weighted convolutional features for visual instance search 

    Jiménez, Albert; Alvarez, Jose M.; Giró Nieto, Xavier (2017)
    Conference lecture
    Open Access
    Image retrieval in realistic scenarios targets large dynamic datasets of unlabeled images. In these cases, training or fine-tuning a model every time new images are added to the database is neither efficient nor scalable. ...
  • Click'n'Cut: crowdsourced interactive segmentation with object candidates 

    Carlier, Axel; Charvillat, Vincent; Salvador Aguilera, Amaia; Giró Nieto, Xavier; Marques, Ogé (ACM, 2014)
    Conference lecture
    Open Access
    This paper introduces Click’n’Cut, a novel web tool for inter- active object segmentation designed for crowdsourcing tasks. Click’n’Cut combines bounding boxes and clicks generated by workers to obtain accurate object ...
  • Comparing fixed and adaptive computation time for recurrent neural networks 

    Fojo, Daniel; Campos Camunez, Victor; Giró Nieto, Xavier (2018)
    Conference report
    Open Access
    Deep networks commonly perform better than shallow ones, but allocating the proper amount of computation for each particular input sample remains an open problem. This issue is particularly challenging in sequential tasks, ...
  • Cost-effective active learning for melanoma segmentation 

    Górriz, Marc; Giró Nieto, Xavier; Carlier, Axel; Faure, Emmanuel (2017)
    Conference lecture
    Open Access
    We propose a novel Active Learning framework capable to train effectively a convolutional neural network for semantic segmentation of medical imaging, with a limited amount of training labeled data. Our contribution is a ...
  • Creació de recursos en línea per a l’autoaprenentage, foment de l’esperit crític i emprenedor i formació d’un comité d’avaluació externa en el màster MERIT 

    Pradell i Cara, Lluís; Cabrera-Bean, Margarita; Canal Bienzobas, Fernando; Cardama Aznar, Ángel; Gasull Llampallas, Antoni; González Arbesú, José María; Giró Nieto, Xavier; Heldring, Alexander; Herranz Luis, Jaime; Jofre Roca, Lluís; Mallorquí Franquet, Jordi Joan; Nadeu Camprubí, Climent; Olmos Bonafé, Juan José; Romeu Robert, Jordi; Rius Casals, Juan Manuel; Torres Torres, Francisco; Úbeda Farré, Eduard; Coderch Collell, Marcel; Kerans, Mary Ellen; Ros Giralt, Jordi; García, Antonio; González Font, Blanca; Guardiola Garcia, Marta; Hernández, Antoni; Melgarejo, Natali; Mestres, Albert (2009-02-12T10:25:51Z)
    Conference lecture / Conference report
    Open Access
    El projecte es desenvolupa en el marc de la titulació oficial de màster MERIT del Departament de Teoria del Senyal i Comunicacions. El màster, orientat a la recerca i integrat dins del programa Erasmus Mundus, presenta ...
  • Cross-modal embeddings for video and audio retrieval 

    Surís Coll-Vinent, Dídac; Duarte, Amanda; Salvador Aguilera, Amaia; Torres Viñals, Jordi; Giró Nieto, Xavier (Springer, 2019)
    Conference report
    Open Access
    In this work, we explore the multi-modal information provided by the Youtube-8M dataset by projecting the audio and visual features into a common feature space, to obtain joint audio-visual embeddings. These links are used ...
  • Crowdsourced object segmentation with a game 

    Salvador Aguilera, Amaia; Carlier, Axel; Giró Nieto, Xavier; Marques, Oge; Charvillat, Vincent (2013)
    Conference report
    Open Access
    We introduce a new algorithm for image segmentation based on crowdsourcing through a game : Ask'nSeek. The game provides information on the objects of an image, under the form of clicks that are either on the object, ...
  • Cultural event recognition with visual ConvNets and temporal models 

    Salvador Aguilera, Amaia; Manchon Vizuete, Daniel; Calafell, Andrea; Giró Nieto, Xavier; Zeppelzauer, Matthias (2015)
    Conference lecture
    Open Access
    This paper presents our contribution to the ChaLearn Challenge 2015 on Cultural Event Classification. The challenge in this task is to automatically classify images from 50 different cultural events. Our solution is based ...
  • Demonstration of an open source framework for qualitative evaluation of CBIR systems 

    Gomez Duran, Paula; Mohedano, Eva; McGuinness, Kevin; Giró Nieto, Xavier; O'Connor, Noel (Association for Computing Machinery (ACM), 2018)
    Conference lecture
    Open Access
    Evaluating image retrieval systems in a quantitative way, for example by computing measures like mean average precision, allows for objective comparisons with a ground-truth. However, in cases where ground-truth is not ...
  • Detection-aided liver lesion segmentation using deep learning 

    Bellver, Míriam; Maninis, Kevis-Kokitsi; Pont Tuset, Jordi; Giró Nieto, Xavier; Torres Viñals, Jordi; Van Gool, Luc (2017)
    Conference lecture
    Open Access
    A fully automatic technique for segmenting the liver and localizing its unhealthy tissues is a convenient tool in order to diagnose hepatic diseases and assess the response to the according treatments. In this work we ...
  • Digimatge, a Rich Internet Application for video retrieval from a Multimedia Asset Management system 

    Giró Nieto, Xavier; Salla, Ramon; Vives, Xavier (2010)
    Conference lecture
    Open Access
    This paper describes the integration of two new services aimed at assisting into the retrieval of video content from a Multimedia Asset Manager (MAM). The first tool suggest tags after an first textual query, and the ...
  • Distributed training strategies for a computer vision deep learning algorithm on a distributed GPU cluster 

    Campos Camunez, Victor; Sastre, Francesc; Yagües, Maurici; Bellver, Míriam; Giró Nieto, Xavier; Torres Viñals, Jordi (Elsevier, 2017)
    Conference lecture
    Open Access
    Deep learning algorithms base their success on building high learning capacity models with millions of parameters that are tuned in a data-driven fashion. These models are trained by processing millions of examples, so ...
  • Distributed training strategies for a computer vision deep learning algorithm on a distributed GPU cluster 

    Campos, Victor; Sastre, Francesc; Yagües, Maurici; Bellver, Míriam; Giró Nieto, Xavier; Torres Viñals, Jordi (Elsevier, 2017)
    Article
    Open Access
    Deep learning algorithms base their success on building high learning capacity models with millions of parameters that are tuned in a data-driven fashion. These models are trained by processing millions of examples, so ...