El GPI fa recerca en Processament d'Imatge i Vídeo per representació, codificació, indexació i anàlisi del contingut visual. L'expertesa del grup en Morfologia i Segmentació ha estat la base de contribucions als estàndards ISO MPEG-4 i MPEG-7. La recerca en anàlisi d’imatge li ha permès participar en projectes europeus des de 1992, als programes RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas), IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), en xarxes d'excel·lència (SCHEMA, SIMILAR, MUSCLE), i en projectes integrats FP6 (CHIL) i FP7 (FASCINATE). El grup ha construït dues “smart rooms” al Campus Nord de la UPC, i ha fet contribucions en anàlisi visual per interacció, així com en aplicacions d’imatge biomèdica i teledetecció. Ha signat convenis de recerca amb empreses com ara Philips (París), France Telecom (Rennes), NXP (Holanda), Thomson (Princeton, USA), Alterface (Bèlgica) i nacionals com Telefónica, CCRTV, MediaPro, Fundació CELLEX, Hospital Clínic, AD Telecom o Abertis.

El GPI investiga sobre procesamiento de imagen y vídeo por representación, codificación, indexación y análisis del contenido visual. La investigación del grupo en Morfología y Segmentación ha sido la base de contribuciones a los estándares ISO MPEG-4 y MPEG-7. La investigación en análisis de imagen le ha permitido participar en proyectos europeos desde 1992, en los programas RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas), IST (Diceman, Hypermedia, INTERFACE, Advisor, MASCOT, FAETHON ), en redes de excelencia (SCHEMA, SIMILAR, MUSCLE), y en proyectos integrados FP6 (CHIL) y FP7 (Fascinate). El grupo ha construido dos “smart romos” en el Campus Nord de la UPC, y ha hecho contribuciones en análisis visual por interacción, así como en aplicaciones de imagen biomédica y teledetección. Ha firmado convenios de investigación con empresas como Philips (París), France Telecom (Rennes), NXP (Holanda), Thomson (Princeton, USA), Alterface (Bélgica) y nacionales como Telefónica, CCRTV, MediaPro, Fundación CELLEX, Hospital Clínic, AD Telecom o Abertis.

The GPI does research on image and video processing for representing, coding, indexing and analysing visual content. The expertise of the group working on morphology and segmentation has been the basis for contributions to ISO standards MPEG-4 and MPEG-7. Research on image analysis has allowed it to participate in European projects since 1992, including the programs RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas) and IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), the networks of excellence SCHEMA, SIMILAR and MUSCLE and the integrated FP6 and FP7 projects CHIL and FASCINATE, respectively.

The GPI does research on image and video processing for representing, coding, indexing and analysing visual content. The expertise of the group working on morphology and segmentation has been the basis for contributions to ISO standards MPEG-4 and MPEG-7. Research on image analysis has allowed it to participate in European projects since 1992, including the programs RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas) and IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), the networks of excellence SCHEMA, SIMILAR and MUSCLE and the integrated FP6 and FP7 projects CHIL and FASCINATE, respectively.

Recent Submissions

  • SLAM-based 3D outdoor reconstructions from lidar data 

    Caminal Colell, Ivan; Casas Pla, Josep Ramon; Royo Royo, Santiago (Institute of Electrical and Electronics Engineers (IEEE), 2018)
    Conference report
    Open Access
    The use of depth (RGBD) cameras to reconstruct large outdoor environments is not feasible due to lighting conditions and low depth range. LIDAR sensors can be used instead. Most state of the art SLAM methods are devoted ...
  • PathGAN: visual scanpath prediction with generative adversarial networks 

    Assens, Marc; Giró Nieto, Xavier; McGuinness, Kevin; O'Connor, Noel (Springer, 2019)
    Conference lecture
    Restricted access - publisher's policy
    We introduce PathGAN, a deep neural network for visual scanpath prediction trained on adversarial examples. A visual scanpath is defined as the sequence of fixation points over an image defined by a human observer with its ...
  • Collaborative voting of 3D features for robust gesture estimation 

    van Sabben Alsina, Daniel; Ruiz Hidalgo, Javier; Suau Cuadros, Xavier; Casas Pla, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Conference lecture
    Open Access
    Human body analysis raises special interest because it enables a wide range of interactive applications. In this paper we present a gesture estimator that discriminates body poses in depth images. A novel collaborative ...
  • Cross-modal embeddings for video and audio retrieval 

    Surís Coll-Vinent, Dídac; Duarte, Amanda; Salvador Aguilera, Amaia; Torres Viñals, Jordi; Giró Nieto, Xavier (Springer, 2019)
    Conference report
    Open Access
    In this work, we explore the multi-modal information provided by the Youtube-8M dataset by projecting the audio and visual features into a common feature space, to obtain joint audio-visual embeddings. These links are used ...
  • Action tube extraction based 3D-CNN for RGB-D action recognition 

    Xu, Zhengyu; Vilaplana Besler, Verónica; Morros Rubió, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2018)
    Conference report
    Open Access
    In this paper we propose a novel action tube extractor for RGB-D action recognition in trimmed videos. The action tube extractor takes as input a video and outputs an action tube. The method consists of two parts: spatial ...
  • UPC multimodal speaker diarization system for the 2018 Albayzin challenge 

    India Massana, Miquel Àngel; Sagastiberri, Itziar; Palau Puigdevall, Ponç; Sayrol Clols, Elisa; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier (International Speech Communication Association (ISCA), 2018)
    Conference report
    Open Access
    This paper presents the UPC system proposed for the Multimodal Speaker Diarization task of the 2018 Albayzin Challenge. This approach works by processing individually the speech and the image signal. In the speech domain, ...
  • Shared latent structures between imaging features and biomarkers in early stages of Alzheimer's disease 

    Casamitjana Díaz, Adrià; Vilaplana Besler, Verónica; Petrone, Paula; Molinuevo, Jose Luis; Gispert, Juan Domingo (Springer International Publishing, 2018)
    Conference lecture
    Restricted access - publisher's policy
    In this work, we identify meaningful latent patterns in MR images for patients across the Alzheimer’s disease (AD) continuum. For this purpose, we apply Projection to Latent Structures (PLS) method using cerebrospinal fluid ...
  • Projection to latent spaces disentangles specific cerebral morphometric patterns associated to aging and preclinical AD”, 

    Casamitjana Díaz, Adrià; Petrone, Paula; Artigues, Miquel; Molinuevo, Jose Luis; Gispert, Juan Domingo; Vilaplana Besler, Verónica (Elsevier, 2018)
    Article
    Restricted access - publisher's policy
  • Characteristic brain volumetric changes in the AD preclinical signature 

    Petrone, Paula; Casamitjana Díaz, Adrià; Falcón, Carlos; Artigues, Miquel; Operto, Grégory; Skouras, Stavros; Cacciaglia, Raffaele; Molinuevo, Jose Luis; Vilaplana Besler, Verónica; Gispert, Juan Domingo; Salvado, Gemma (Elsevier, 2018)
    Article
    Restricted access - publisher's policy
  • MRI-based screening of preclinical Alzheimer's disease for prevention clinical trials 

    Casamitjana Díaz, Adrià; Petrone, Paula; Tucholka, Alan; Falcón, Carlos; Skouras, Stavros; Molinuevo, José Luis; Vilaplana Besler, Verónica; Gispert, Juan Domingo (IOS Press, 2018)
    Article
    Open Access
    The identification of healthy individuals harboring amyloid pathology represents one important challenge for secondary prevention clinical trials in Alzheimer’s disease (AD). Consequently, noninvasive and cost-efficient ...
  • Quantitative ultrasound texture analysis of fetal lungs to predict neonatal respiratory morbidity 

    Bonet-Carne, Elisenda; Palacio, M.; Cobo, T.; Perez-Moreno, A.; Lopez, M.; Piraquive, J.P.; Ramirez, J.C.; Botet, F.; Marqués Acosta, Fernando; Gratacos, E. (Wiley (John Wiley & Sons), 2015)
    Article
    Open Access
    Objective To develop and evaluate the performance of a novel method for predicting neonatal respiratory morbidity based on quantitative analysis of the fetal lung by ultrasound. Methods More than 13¿000 non-clinical ...
  • Differential expression of long non-coding RNAs are related to proliferation and histological diversity in follicular lymphomas 

    Roisman, Alejandro; Castellano, Giancarlo; Navarro López, Alba; Bellot, Pau; Salembier Clairon, Philippe Jean; Oliveras Vergés, Albert (2018-01-01)
    Article
    Restricted access - publisher's policy
    Long non-coding RNAs (lncRNAs) comprise a family of non-coding transcripts that are emerging as relevant gene expression regulators of different processes, including tumour development. To determine the possible contribution ...

View more