Now showing items 1-20 of 64

  • Acoustic event detection based on feature-level fusion of audio and video modalities 

    Butko, Taras; Canton Ferrer, Cristian; Segura Perales, Carlos; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon (HINDAWI, 2011-03-15)
    Article
    Open Access
    Acoustic event detection (AED) aims at determining the identity of sounds and their temporal position in audio signals. When applied to spontaneously generated acoustic events, AED based only on audio information shows a ...
  • Assessment of crowdsourcing and gamification loss in user-assisted object segmentation 

    Carlier, Axel; Salvador Aguilera, Amaia; Cabezas, Ferran; Giró Nieto, Xavier; Charvillat, Vincent; Marques, Oge (2015-09-12)
    Article
    Open Access
    There has been a growing interest in applying human computation – particularly crowdsourcing techniques – to assist in the solution of multimedia, image processing, and computer vision problems which are still too difficult ...
  • Audiovisual event detection towards scene understanding 

    Canton Ferrer, Cristian; Butko, Taras; Segura, C.; Giró Nieto, Xavier; Nadeu Camprubí, Climent; Hernando Pericás, Francisco Javier; Casas Pla, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2009)
    Conference report
    Restricted access - publisher's policy
    Acoustic events produced in meeting environments may contain useful information for perceptually aware interfaces and multimodal behavior analysis. In this paper, a system to detect and recognize these events from a ...
  • Automatic keyframe selection based on mutual reinforcement algorithm 

    Ventura Royo, Carles; Giró Nieto, Xavier; Vilaplana Besler, Verónica; Giribet, Daniel; Carasusan, Eusebio (Institute of Electrical and Electronics Engineers (IEEE), 2013)
    Conference report
    Restricted access - publisher's policy
    This paper addresses the problem of video summarization through an automatic selection of a single representative keyframe. The proposed solution is based on the mutual reinforcement paradigm, where a keyframe is selected ...
  • Bags of local convolutional features for scalable instance search 

    Mohedano, Eva; Salvador Aguilera, Amaia; McGuinness, Kevin; Marqués Acosta, Fernando; O'Connor, Noel; Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2016)
    Conference lecture
    Open Access
    This work proposes a simple instance retrieval pipeline based on encoding the convolutional features of CNN using the bag of words aggregation scheme (BoW). Assigning each local array of activations in a convolutional layer ...
  • BitSearch, the blog before the thesis 

    Giró Nieto, Xavier (2010)
    Conference lecture
    Open Access
  • Class-weighted convolutional features for visual instance search 

    Jiménez, Albert; Alvarez, Jose M.; Giró Nieto, Xavier (2017)
    Conference lecture
    Open Access
    Image retrieval in realistic scenarios targets large dynamic datasets of unlabeled images. In these cases, training or fine-tuning a model every time new images are added to the database is neither efficient nor scalable. ...
  • Click'n'Cut: crowdsourced interactive segmentation with object candidates 

    Carlier, Axel; Charvillat, Vincent; Salvador Aguilera, Amaia; Giró Nieto, Xavier; Marques, Ogé (ACM, 2014)
    Conference lecture
    Open Access
    This paper introduces Click’n’Cut, a novel web tool for inter- active object segmentation designed for crowdsourcing tasks. Click’n’Cut combines bounding boxes and clicks generated by workers to obtain accurate object ...
  • Cost-effective active learning for melanoma segmentation 

    Górriz, Marc; Giró Nieto, Xavier; Carlier, Axel; Faure, Emmanuel (2017)
    Conference lecture
    Open Access
    We propose a novel Active Learning framework capable to train effectively a convolutional neural network for semantic segmentation of medical imaging, with a limited amount of training labeled data. Our contribution is a ...
  • Creació de recursos en línea per a l’autoaprenentage, foment de l’esperit crític i emprenedor i formació d’un comité d’avaluació externa en el màster MERIT 

    Pradell i Cara, Lluís; Cabrera Beán, Margarita Asuncion; Canal Bienzobas, Fernando; Cardama Aznar, Ángel; Gasull Llampallas, Antoni; González Arbesú, José María; Giró Nieto, Xavier; Heldring, Alexander; Herranz Luis, Jaime; Jofre Roca, Lluís; Mallorquí Franquet, Jordi Joan; Nadeu Camprubí, Climent; Olmos Bonafé, Juan José; Romeu Robert, Jordi; Rius Casals, Juan Manuel; Torres Torres, Francisco; Úbeda Farré, Eduard; Coderch Collell, Marcel; Kerans, Mary Ellen; Ros Giralt, Jordi; García, Antonio; González Font, Blanca; Guardiola Garcia, Marta; Hernández, Antoni; Melgarejo, Natali; Mestres, Albert (2009-02-12T10:25:51Z)
    Conference lecture / Conference report
    Open Access
    El projecte es desenvolupa en el marc de la titulació oficial de màster MERIT del Departament de Teoria del Senyal i Comunicacions. El màster, orientat a la recerca i integrat dins del programa Erasmus Mundus, presenta ...
  • Crowdsourced object segmentation with a game 

    Salvador, Amaia; Carlier, Axel; Giró Nieto, Xavier; Marques, Oge; Charvillat, Vincent (2013)
    Conference report
    Open Access
    We introduce a new algorithm for image segmentation based on crowdsourcing through a game : Ask'nSeek. The game provides information on the objects of an image, under the form of clicks that are either on the object, ...
  • Cultural event recognition with visual ConvNets and temporal models 

    Salvador Aguilera, Amaia; Manchon Vizuete, Daniel; Calafell, Andrea; Giró Nieto, Xavier; Zeppelzauer, Matthias (2015)
    Conference lecture
    Open Access
    This paper presents our contribution to the ChaLearn Challenge 2015 on Cultural Event Classification. The challenge in this task is to automatically classify images from 50 different cultural events. Our solution is based ...
  • Digimatge, a Rich Internet Application for video retrieval from a Multimedia Asset Management system 

    Giró Nieto, Xavier; Salla, Ramon; Vives, Xavier (2010)
    Conference lecture
    Open Access
    This paper describes the integration of two new services aimed at assisting into the retrieval of video content from a Multimedia Asset Manager (MAM). The first tool suggest tags after an first textual query, and the ...
  • Distributed training strategies for a computer vision deep learning algorithm on a distributed GPU cluster 

    Campos Camunez, Victor; Sastre, Francesc; Yagües, Maurici; Bellver, Míriam; Giró Nieto, Xavier; Torres Viñals, Jordi (Elsevier, 2017)
    Conference lecture
    Open Access
    Deep learning algorithms base their success on building high learning capacity models with millions of parameters that are tuned in a data-driven fashion. These models are trained by processing millions of examples, so ...
  • Distributed training strategies for a computer vision deep learning algorithm on a distributed GPU cluster 

    Campos, Victor; Sastre, Francesc; Yagües, Maurici; Bellver, Míriam; Giró Nieto, Xavier; Torres Viñals, Jordi (Elsevier, 2017)
    Article
    Open Access
    Deep learning algorithms base their success on building high learning capacity models with millions of parameters that are tuned in a data-driven fashion. These models are trained by processing millions of examples, so ...
  • Diversity ranking for video retrieval from a broadcaster archive 

    Giró Nieto, Xavier; Alfaro Vendrell, Mónica; Marqués Acosta, Fernando (2011)
    Conference lecture
    Open Access
    Video retrieval through text queries is a very common practice in broadcaster archives. The query keywords are compared to the metadata labels that documentalists have previously associated to the video assets. This ...
  • Diving deep into sentiment: understanding fine-tuned CNNs for visual sentiment prediction 

    Campos Camúñez, Victor; Salvador Aguilera, Amaia; Jou, Brendan; Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2015)
    Conference lecture
    Open Access
    Visual media are powerful means of expressing emotions and sentiments. The constant generation of new content in social networks highlights the need of automated visual sentiment analysis tools. While Convolutional Neural ...
  • Event video retrieval using global and local descriptors in visual domain 

    Roldan-Carlos, Jennifer; Lux, Mathias; Giró Nieto, Xavier; Muñoz Trallero, Pia; Anagnostopoulos, Nektarios (Institute of Electrical and Electronics Engineers (IEEE), 2015)
    Conference lecture
    Open Access
    With the advent of affordable multimedia smart phones, it has become common that people take videos when they are at events. The larger the event, the larger is the amount of videos taken there and also, the more videos ...
  • Exploring EEG for object detection and retrieval 

    Mohedano Robles, Eva; Salvador Aguilera, Amaia; Porta, Sergi; Giró Nieto, Xavier; Healy, Graham; McGuinness, Kevin; O'Connor, Noel; Smeaton, Alan F. (Association for Computing Machinery (ACM), 2015)
    Conference lecture
    Open Access
    This paper explores the potential for using Brain Computer Interfaces (BCI) as a relevance feedback mechanism in content-based image retrieval. Several experiments are performed using a rapid serial visual presentation ...
  • From global image annotation to interactive object segmentation 

    Giró Nieto, Xavier; Martos Asensio, Manel; Mohedano Robles, Eva; Pont Tuset, Jordi (2013-02-16)
    Article
    Restricted access - publisher's policy
    This paper presents a graphical environment for the annotation of still images that works both at the global and local scales. At the global scale, each image can be tagged with positive, negative and neutral labels referred ...