El GPI fa recerca en Processament d'Imatge i Vídeo per representació, codificació, indexació i anàlisi del contingut visual. L'expertesa del grup en Morfologia i Segmentació ha estat la base de contribucions als estàndards ISO MPEG-4 i MPEG-7. La recerca en anàlisi d’imatge li ha permès participar en projectes europeus des de 1992, als programes RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas), IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), en xarxes d'excel·lència (SCHEMA, SIMILAR, MUSCLE), i en projectes integrats FP6 (CHIL) i FP7 (FASCINATE). El grup ha construït dues “smart rooms” al Campus Nord de la UPC, i ha fet contribucions en anàlisi visual per interacció, així com en aplicacions d’imatge biomèdica i teledetecció. Ha signat convenis de recerca amb empreses com ara Philips (París), France Telecom (Rennes), NXP (Holanda), Thomson (Princeton, USA), Alterface (Bèlgica) i nacionals com Telefónica, CCRTV, MediaPro, Fundació CELLEX, Hospital Clínic, AD Telecom o Abertis.

El GPI investiga sobre procesamiento de imagen y vídeo por representación, codificación, indexación y análisis del contenido visual. La investigación del grupo en Morfología y Segmentación ha sido la base de contribuciones a los estándares ISO MPEG-4 y MPEG-7. La investigación en análisis de imagen le ha permitido participar en proyectos europeos desde 1992, en los programas RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas), IST (Diceman, Hypermedia, INTERFACE, Advisor, MASCOT, FAETHON ), en redes de excelencia (SCHEMA, SIMILAR, MUSCLE), y en proyectos integrados FP6 (CHIL) y FP7 (Fascinate). El grupo ha construido dos “smart romos” en el Campus Nord de la UPC, y ha hecho contribuciones en análisis visual por interacción, así como en aplicaciones de imagen biomédica y teledetección. Ha firmado convenios de investigación con empresas como Philips (París), France Telecom (Rennes), NXP (Holanda), Thomson (Princeton, USA), Alterface (Bélgica) y nacionales como Telefónica, CCRTV, MediaPro, Fundación CELLEX, Hospital Clínic, AD Telecom o Abertis.

The GPI does research on image and video processing for representing, coding, indexing and analysing visual content. The expertise of the group working on morphology and segmentation has been the basis for contributions to ISO standards MPEG-4 and MPEG-7. Research on image analysis has allowed it to participate in European projects since 1992, including the programs RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas) and IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), the networks of excellence SCHEMA, SIMILAR and MUSCLE and the integrated FP6 and FP7 projects CHIL and FASCINATE, respectively.

The GPI does research on image and video processing for representing, coding, indexing and analysing visual content. The expertise of the group working on morphology and segmentation has been the basis for contributions to ISO standards MPEG-4 and MPEG-7. Research on image analysis has allowed it to participate in European projects since 1992, including the programs RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas) and IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), the networks of excellence SCHEMA, SIMILAR and MUSCLE and the integrated FP6 and FP7 projects CHIL and FASCINATE, respectively.

Recent Submissions

  • Identifying the best machine learning algorithms for brain tumor segmentation, progression assessment, and overall survival prediction in the BRATS challenge 

    Bakas, Spyridon; Reyes, Mauricio; Jakab, Andras; Bauer, Stefan; Casamitjana Díaz, Adrià; Catà, Marcel; Combalia, Marc; Sanchez Muriana, Irina; Vilaplana Besler, Verónica (2019-03-19)
    External research report
    Open Access
    Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic ...
  • Refinement network for unsupervised on the scene foreground segmentation 

    Pardàs Feliu, Montse; Canet Tarrés, Gemma (European Association for Signal Processing (EURASIP), 2020)
    Conference report
    Open Access
    Unsupervised learning represents one of the most interesting challenges in computer vision today. The task has an immense practical value with many applications in artificial intelligence and emerging technologies, as large ...
  • Explore, discover and learn: unsupervised discovery of state-covering skills 

    Campos Camúñez, Víctor; Trott, Alex; Xiong, Caiming; Socher, Richard; Giró Nieto, Xavier; Torres Viñals, Jordi (2020)
    Conference lecture
    Open Access
    Acquiring abilities in the absence of a task-oriented reward function is at the frontier of reinforcement learning research. This problem has been studied through the lens of empowerment, which draws a connection between ...
  • Enhancing online knowledge graph population with semantic knowledge 

    Fernàndez, Dèlia; Rimmek, Joan Marco; Espadaler, Joan; Garolera, Blai; Barja, Adrià; Codina, Marc; Sastre, Marc; Giró Nieto, Xavier; Riveiro, Juan Carlos; Bou Balust, Elisenda (Springer, 2020-11-01)
    Part of book or chapter of book
    Restricted access - publisher's policy
    Knowledge Graphs (KG) are becoming essential to organize, represent and store the world’s knowledge, but they still rely heavily on humanly-curated structured data. Information Extraction (IE) tasks, like disambiguating ...
  • Mixed integration of CDIO skills into telecommunication engineering curricula 

    Sayrol Clols, Elisa; Bragós Bardia, Ramon; Alarcón Cot, Eduardo José; Cabrera-Bean, Margarita; Calveras Augé, Anna M.; Comellas Colomé, Jaume; O'Callaghan Castellà, Juan Manuel; Pegueroles Vallés, Josep R.; Pla, Enrique; Prat Viñas, Lluís; Sáez Moreno, Germán; Sardà Ferrer, Joan; Tallon Montoro, Carme (2010)
    Article
    Open Access
    Spain has been intensively involved in designing engineering curricula for the last two years and next academic year all engineering schools will be deploying all bachelor programs adapted to the EHEA and to the Spanish ...
  • Projection to latent spaces disentangles pathological effects on brain morphology in the asymptomatic phase of Alzheimer's disease 

    Casamitjana Díaz, Adrià; Petrone, Paula; Molinuevo, Jose Luis; Gispert, Juan Domingo; Vilaplana Besler, Verónica (2020-07-28)
    Article
    Open Access
    Alzheimer's disease (AD) continuum is defined as a cascade of several neuropathological processes that can be measured using biomarkers, such as cerebrospinal fluid (CSF) levels of Aß, p-tau, and t-tau. In parallel, brain ...
  • Super-resolution of Sensinel-2 imagery using generative adversarial networks 

    Salgueiro Romero, Luis Fernando; Marcello, Javier; Vilaplana Besler, Verónica (Multidisciplinary Digital Publishing Institute (MDPI), 2020-07-28)
    Article
    Open Access
    Sentinel-2 satellites provide multi-spectral optical remote sensing images with four bands at 10 m of spatial resolution. These images, due to the open data distribution policy, are becoming an important resource for several ...
  • FuCiTNet: improving the generalization of deep learning networks by the fusion of learned class-inherent transformations 

    Rey-Arena, Manuel; Guirado, Emilio; Tabik, Siham; Ruiz Hidalgo, Javier (Elsevier, 2020-10)
    Article
    Restricted access - publisher's policy
    It is widely known that very small datasets produce overfitting in Deep Neural Networks (DNNs), i.e., the network becomes highly biased to the data it has been trained on. This issue is often alleviated using transfer ...
  • RVOS: end-to-end recurrent network for video object segmentation 

    Ventura Royo, Carles; Bellver, Míriam; Girbau Xalabarder, Andreu; Salvador Aguilera, Amaia; Marqués Acosta, Fernando; Giró Nieto, Xavier (2019-06-15)
    Article
    Open Access
    Multiple object video object segmentation is a challenging task, specially for the zero-shot case, when no object mask is given at the initial frame and the model has to find the objects to be segmented along the sequence. ...
  • Mask-guided sample selection for semi-supervised instance segmentation 

    Bellver Bueno, Míriam; Salvador Aguilera, Amaia; Torres Viñals, Jordi; Giró Nieto, Xavier (2020-07-05)
    Article
    Restricted access - publisher's policy
    Image segmentation methods are usually trained with pixel-level annotations, which require significant human effort to collect. Weakly-supervised pipelines are the most common solution to address this constraint because ...
  • Fuji-SfM dataset: a collection of annotated images and point clouds for Fuji apple detection and location using structure-from-motion photogrammetry 

    Gené Mola, Jordi; Sanz Cortiella, Ricardo; Rosell Polo, Joan Ramon; Morros Rubió, Josep Ramon; Ruiz Hidalgo, Javier; Vilaplana Besler, Verónica; Gregorio, Eduard (Elsevier, 2020-06)
    Article
    Open Access
    The present dataset contains colour images acquired in a commercial Fuji apple orchard (Malus domestica Borkh. cv. Fuji) to reconstruct the 3D model of 11 trees by using structure-from-motion (SfM) photogrammetry. The data ...
  • Weakly supervised semantic segmentation for remote sensing hyperspectral imaging 

    Moliner, Eloi; Salgueiro Romero, Luis Fernando; Vilaplana Besler, Verónica (Institute of Electrical and Electronics Engineers (IEEE), 2020)
    Conference lecture
    Restricted access - publisher's policy
    This paper studies the problem of training a semantic segmentation neural network with weak annotations, in order to be applied in aerial vegetation images from Teide National Park. It proposes a Deep Seeded Region Growing ...

View more