Recent Submissions

  • Weakly supervised semantic segmentation for remote sensing hyperspectral imaging 

    Moliner, Eloi; Salgueiro Romero, Luis Fernando; Vilaplana Besler, Verónica (Institute of Electrical and Electronics Engineers (IEEE), 2020)
    Conference lecture
    Restricted access - publisher's policy
    This paper studies the problem of training a semantic segmentation neural network with weak annotations, in order to be applied in aerial vegetation images from Teide National Park. It proposes a Deep Seeded Region Growing ...
  • One perceptron to rule them all: language, vision, audio and speech 

    Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2020)
    Conference lecture
    Restricted access - publisher's policy
    Deep neural networks have boosted the convergence of multimedia data analytics in a unified framework shared by practitioners in natural language, vision and speech. Image captioning, lip reading or video sonorization are ...
  • Automatic reminiscence therapy for dementia 

    Carós, Mariona; Garolera Freixa, Maite; Radeva, Petia; Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2020)
    Conference lecture
    Restricted access - publisher's policy
    With people living longer than ever, the number of cases with dementia such as Alzheimer's disease increases steadily. It affects more than 46 million people worldwide, and it is estimated that in 2050 more than 100 million ...
  • Audience measurement using a top-view camera and oriented trajectories 

    López Palma, Manuel; Gago Barrio, Javier; Corbalán Fuertes, Montserrat; Morros Rubió, Josep Ramon (2019)
    Conference report
    Restricted access - publisher's policy
    A crucial aspect for selecting optimal areas for commercial advertising is the probability with which that publicity will be seen. This paper presents a method based on top-view camera measurement, where the probability ...
  • VLX-Stories: building an online Event Knowledge Base with Emerging Entity detection 

    Fernández Cañellas, Dèlia; Espadaler, Joan; Rodríguez, David; Garolera, Blai; Canet Tarrés, Gemma; Colom Serra, Aleix; Rimmek, Joan Marco; Giró Nieto, Xavier; Bou Balust, Elisenda; Riveiro, Juan Carlos (Springer, 2019)
    Conference lecture
    Restricted access - publisher's policy
    We present an online multilingual system for event detection and comprehension from media feeds. The system retrieves information from news sites, aggregates them into events (event detection), and summarizes them by ...
  • Budget-aware semi-supervised semantic and instance segmentation 

    Bellver Bueno, Míriam; Salvador Aguilera, Amaia; Torres Viñals, Jordi; Giró Nieto, Xavier (2019)
    Conference lecture
    Open Access
    Methods that move towards less supervised scenarios are key for image segmentation, as dense labels demand significant human intervention. Generally, the annotation burden is mitigated by labeling datasets with weaker forms ...
  • Residual attention graph convolutional network for geometric 3D scene classification 

    Mosella Montoro, Albert; Ruiz Hidalgo, Javier (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference report
    Restricted access - publisher's policy
    Geometric 3D scene classification is a very challenging task. Current methodologies extract the geometric information using only a depth channel provided by an RGB-D sensor. These kinds of methodologies introduce possible ...
  • VLX-Stories: a semantically linked event platform for media publishers 

    Fernández Cañellas, Dèlia; Espadaler, Joan; Garolera, Blai; Rodríguez, David; Canet, Gemma; Colom, Aleix; Rimmek, Joan Marco; Giró Nieto, Xavier; Bou Balust, Elisenda; Riveiro, Juan Carlos (CEUR-WS.org, 2019)
    Conference lecture
    Open Access
    In the recent years, video sharing in social media from different video recording devices has resulted in a exponential growth of videos on the Internet. Such video data is continuously increasing with daily recordings ...
  • Hyperparameter-free losses for model-based monocular reconstruction 

    Ramon Maldonado, Eduard; Ruiz, Guillermo; Batard, Thomas; Giró Nieto, Xavier (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference lecture
    Open Access
    This work proposes novel hyperparameter-free losses for single view 3D reconstruction with morphable models (3DMM). We dispense with the hyperparameters used in other works by exploiting geometry, so that the shape of the ...
  • Picking groups instead of samples: a close look at Static Pool-based Meta-Active Learning 

    Mas Méndez, Ignasi; Morros Rubió, Josep Ramon; Vilaplana Besler, Verónica (Institute of Electrical and Electronics Engineers (IEEE), 2019)
    Conference lecture
    Open Access
    Active Learning techniques are used to tackle learning problems where obtaining training labels is costly. In this work we use Meta-Active Learning to learn to select a subset of samples from a pool of unsupervised input ...
  • Simple vs complex temporal recurrences for video saliency prediction 

    Linardos, Panagiotis; Mohedano, Eva; Nieto, Juan Jose; O'Connor, Noel; Giró Nieto, Xavier; McGuinness, Kevin (2019)
    Conference lecture
    Restricted access - publisher's policy
    This paper investigates modifying an existing neural network architecture for static saliency prediction using two types of recurrences that integrate information from the temporal domain. The first modification is the ...
  • Video object linguistic grounding 

    Herrera-Palacio, Alba; Ventura, Carles; Giró Nieto, Xavier (Association for Computing Machinery (ACM), 2019)
    Conference lecture
    Restricted access - publisher's policy
    The goal of this work is segmenting on a video sequence the objects which are mentioned in a linguistic description of the scene. We have adapted an existing deep neural network that achieves state of the art performance ...

View more