Envíos recientes

  • Skip RNN: learning to skip state updates in recurrent neural networks 

    Campos Camunez, Victor; Jou, Brendan; Giró Nieto, Xavier; Torres Viñals, Jordi; Chang, Shih-Fu (2018)
    Comunicación de congreso
    Acceso abierto
    Recurrent Neural Networks (RNNs) continue to show outstanding performance in sequence modeling tasks. However, training RNNs on long sequences often face challenges like slow inference, vanishing gradients and difficulty ...
  • Foreground objects segmentation for moving camera scenarios based on SCGMM 

    Gallego Vila, Jaime; Pardàs Feliu, Montse; Solano, Montse (2011)
    Texto en actas de congreso
    Acceso restringido por política de la editorial
    In this paper we present a new system for segmenting non-rigid objects in moving camera sequences for indoor and outdoor scenarios that achieves a correct object segmentation via global MAP-MRF framework formulation for ...
  • Motion analysis of image sequences using connected operators 

    Garrido Ostermann, Luis; Oliveras Vergés, Albert; Salembier Clairon, Philippe Jean (International Society for Photo-Optical Instrumentation Engineers (SPIE), 1997)
    Texto en actas de congreso
    Acceso restringido por política de la editorial
    This paper deals with a class of morphological operators called connected operators. These operators interact with the signal by merging flat zones. As a results, they do not create any new contours and are very attractive ...
  • Active mesh coding and rate-distortion theory 

    Salembier Clairon, Philippe Jean; Martí Navarro, Eva; Pardàs Feliu, Montse (Institute of Electrical and Electronics Engineers (IEEE), 1996)
    Comunicación de congreso
    Acceso abierto
    This paper presents a video coding scheme for very low bit rate applications. The coding approach relies on active meshes and can be viewed as a particular case of region-based coding. The active mesh is used to efficiently ...
  • SaltiNet: scan-path prediction on 360 degree images using saliency volumes 

    Assens, Marc; Giró Nieto, Xavier; McGuinness, Kevin; O'Connor, Noel (IEEE Press, 2018)
    Comunicación de congreso
    Acceso abierto
    We introduce SaltiNet, a deep neural network for scan-path prediction trained on 360-degree images. The model is based on a temporal-aware novel representation of saliency information named the saliency volume. The first ...
  • Cost-effective active learning for melanoma segmentation 

    Górriz, Marc; Giró Nieto, Xavier; Carlier, Axel; Faure, Emmanuel (2017)
    Comunicación de congreso
    Acceso abierto
    We propose a novel Active Learning framework capable to train effectively a convolutional neural network for semantic segmentation of medical imaging, with a limited amount of training labeled data. Our contribution is a ...
  • Towards large scale multimedia indexing: a case study on person discovery in broadcast news 

    Le, Nam; Bredin, Herve; Sergent, Gabriel; India Massana, Miquel Àngel; López-Otero, Paula; Barras, Claude; Guinaudeau, Camille; Gravier, Guillaume; Barbosa da Fonseca, Gabriel; Lyon Freire, Izabela; Patrocinio Jr., Zenilton; Jamil F. Guimarães, Silvio; Martí Juan, Gerard; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Docio-Fernández, Laura; García-Mateo, Carmen; Meignier, Sylvain; Odobez, Jean-Marc (Association for Computing Machinery (ACM), 2017)
    Texto en actas de congreso
    Acceso restringido por política de la editorial
    The rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audio-visual documents essential for searching archives. Person discovery ...
  • Two level continuous speech recognition using demisyllable-based HMM word spotting 

    Lleida Solano, Eduardo; Mariño Acebal, José Bernardo; Nadeu Camprubí, Climent; Oliveras Vergés, Albert (1991)
    Texto en actas de congreso
    Acceso abierto
    This paper describes a two level Spanish Continuous Speech Recognition System based on Demisyllable HMM modelling, word-spotting and finite-state lexical and syntactic knowledge. The first level, the word level, is based ...
  • Image sequence analysis and merging 

    Salembier Clairon, Philippe Jean; Garrido Ostermann, Luis; Garcia, D (LINKÖPING UNIVERSITY, 1997)
    Texto en actas de congreso
    Acceso abierto
  • Registration of images to unorganized 3D point clouds using contour cues 

    Pujol Miró, Alba; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Texto en actas de congreso
    Acceso abierto
    Low resolution commercial 3D sensors contribute to computer vision tasks even better when the analysis is carried out in a combination with higher resolution image data. This requires registration of 2D images to ...
  • ViTS: Video tagging system from massive web multimedia collections 

    Fernàndez, Dèlia; Varas, David; Espadaler, Joan; Masuda, Issey; Ferreira, Jordi; Woodward, Alejandro; Rodríguez, David; Giró Nieto, Xavier; Riveiro, Juan Carlos; Bou Balust, Elisenda (IEEE Press, 2017)
    Texto en actas de congreso
    Acceso abierto
    The popularization of multimedia content on the Web has arised the need to automatically understand, index and retrieve it. In this paper we present ViTS, an automatic Video Tagging System which learns from videos, their ...
  • More cat than cute?: interpretable prediction of adjective-noun pairs 

    Fernàndez, Dèlia; Woodward, Alejandro; Campos Camunez, Victor; Giró Nieto, Xavier; Jou, Brendan; Chang, Shih-Fu (2017)
    Texto en actas de congreso
    Acceso restringido por política de la editorial
    The increasing availability of affect-rich multimedia resources has bolstered interest in understanding sentiment and emotions in and from visual content. Adjective-noun pairs (ANP) are a popular midlevel semantic ...

Muestra más