Recent Submissions

  • Cost-effective active learning for melanoma segmentation 

    Górriz, Marc; Giró Nieto, Xavier; Carlier, Axel; Faure, Emmanuel (2017)
    Conference lecture
    Open Access
    We propose a novel Active Learning framework capable to train effectively a convolutional neural network for semantic segmentation of medical imaging, with a limited amount of training labeled data. Our contribution is a ...
  • Towards large scale multimedia indexing: a case study on person discovery in broadcast news 

    Le, Nam; Bredin, Herve; Sergent, Gabriel; India Massana, Miquel Àngel; López-Otero, Paula; Barras, Claude; Guinaudeau, Camille; Gravier, Guillaume; Barbosa da Fonseca, Gabriel; Lyon Freire, Izabela; Patrocinio Jr., Zenilton; Jamil F. Guimarães, Silvio; Martí Juan, Gerard; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Docio-Fernández, Laura; García-Mateo, Carmen; Meignier, Sylvain; Odobez, Jean-Marc (Association for Computing Machinery (ACM), 2017)
    Conference report
    Restricted access - publisher's policy
    The rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audio-visual documents essential for searching archives. Person discovery ...
  • Two level continuous speech recognition using demisyllable-based HMM word spotting 

    Lleida Solano, Eduardo; Mariño Acebal, José Bernardo; Nadeu Camprubí, Climent; Oliveras Vergés, Albert (1991)
    Conference report
    Open Access
    This paper describes a two level Spanish Continuous Speech Recognition System based on Demisyllable HMM modelling, word-spotting and finite-state lexical and syntactic knowledge. The first level, the word level, is based ...
  • Image sequence analysis and merging 

    Salembier Clairon, Philippe Jean; Garrido Ostermann, Luis; Garcia, D (LINKÖPING UNIVERSITY, 1997)
    Conference report
    Open Access
  • Registration of images to unorganized 3D point clouds using contour cues 

    Pujol Miró, Alba; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Conference report
    Open Access
    Low resolution commercial 3D sensors contribute to computer vision tasks even better when the analysis is carried out in a combination with higher resolution image data. This requires registration of 2D images to ...
  • ViTS: Video tagging system from massive web multimedia collections 

    Fernàndez, Dèlia; Varas, David; Espadaler, Joan; Masuda, Issey; Ferreira, Jordi; Woodward, Alejandro; Rodríguez, David; Giró Nieto, Xavier; Riveiro, Juan Carlos; Bou Balust, Elisenda (IEEE Press, 2017)
    Conference report
    Open Access
    The popularization of multimedia content on the Web has arised the need to automatically understand, index and retrieve it. In this paper we present ViTS, an automatic Video Tagging System which learns from videos, their ...
  • More cat than cute?: interpretable prediction of adjective-noun pairs 

    Fernàndez, Dèlia; Woodward, Alejandro; Campos Camunez, Victor; Giró Nieto, Xavier; Jou, Brendan; Chang, Shih-Fu (2017)
    Conference report
    Restricted access - publisher's policy
    The increasing availability of affect-rich multimedia resources has bolstered interest in understanding sentiment and emotions in and from visual content. Adjective-noun pairs (ANP) are a popular midlevel semantic ...
  • Semantic summarization of egocentric photo stream events 

    Lidon, Aniol; Bolaños, Marc; Dimiccoli, Mariella; Radeva, Petia; Garolera Freixa, Maite; Giró Nieto, Xavier (2017)
    Conference report
    Restricted access - publisher's policy
    With the rapid increase of users of wearable cameras in recent years and of the amount of data they produce, there is a strong need for automatic retrieval and summarization techniques. This work addresses the problem of ...
  • Class-weighted convolutional features for visual instance search 

    Jiménez, Albert; Alvarez, Jose M.; Giró Nieto, Xavier (2017)
    Conference lecture
    Open Access
    Image retrieval in realistic scenarios targets large dynamic datasets of unlabeled images. In these cases, training or fine-tuning a model every time new images are added to the database is neither efficient nor scalable. ...
  • Scaling a convolutional neural network for classification of adjective noun pairs with TensorFlow on GPU clusters 

    Campos, Víctor; Sastre, Francesc; Yagües, Maurici; Torres Viñals, Jordi; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Conference report
    Open Access
    Deep neural networks have gained popularity in recent years, obtaining outstanding results in a wide range of applications such as computer vision in both academia and multiple industry areas. The progress made in recent ...
  • Codificacion de imagenes: un metodo de segunda generacion 

    Marqués Acosta, Fernando; Gasull Llampallas, Antoni (Universidad de Málaga, 1992)
    Conference report
    Open Access
  • Segmentacion de imagenes multiespectrales con tecnicas piramidales 

    Marqués Acosta, Fernando; Gasull Llampallas, Antoni (Universidad de Málaga, 1992)
    Conference report
    Open Access

View more