El GPI fa recerca en Processament d'Imatge i Vídeo per representació, codificació, indexació i anàlisi del contingut visual. L'expertesa del grup en Morfologia i segmentació ha estat la base de contribucions als estàndards ISO MPEG-4 i MPEG-7. La recerca en anàlisi d’imatge l'ha permès participar en projectes europeus des de 1992 als programes RACE (Morpheco, coord.), ACTS (MAVT, MoMuSys, Vidas), IST (Diceman, Hypermedia, INTERFACE, ADViSOR, MASCOT, FAETHON), xarxes d'excel•lència (SCHEMA, SIMILAR, MUSCLE), i projectes integrats FP6 (CHIL) i FP7 (FASCINATE). El grup ha construit dues smart rooms al Campus Nord de la UPC, i ha fet contribucions en anàlisi visual per interacció, així com en aplicacions d’imatge biomèdica i teledetecció. Ha signat convenis de recerca amb empreses com Philips (París), France Telecom (Rennes), NXP (Holanda), Thomson (Princeton, USA), Alterface (Bèlgica) i nacionals com Telefònica, CCRTV, MediaPro, Fundació CELLEX, Hospital Clínic, AD Telecom o Abertis.

http://futur.upc.edu/GPI

Enviaments recents

  • SaltiNet: scan-path prediction on 360 degree images using saliency volumes 

    Assens, Marc; Giró Nieto, Xavier; McGuinness, Kevin; O'Connor, Noel (IEEE Press, 2018)
    Comunicació de congrés
    Accés obert
    We introduce SaltiNet, a deep neural network for scan-path prediction trained on 360-degree images. The model is based on a temporal-aware novel representation of saliency information named the saliency volume. The first ...
  • Magnetic resonance imaging and machine learning make a valuable combined tool for the screening of preclinical AD 

    Petrone, Paula; Vilaplana Besler, Verónica; Casamitjana Díaz, Adrià; Domingo Gispert, Juan; Molinuevo, Jose Luis; Sánchez Escobedo, Dalila (2017-07)
    Article
    Accés restringit per política de l'editorial
  • Hierarchical stack filtering: a bitplane-based algorithm for massively parallel processors 

    Frías Velazquez, Andres; Morros Rubió, Josep Ramon; García Molina, Mario; Philips, Wilfried (2017-03-15)
    Article
    Accés obert
    With the development of novel parallel architectures for image processing, the implementation of well-known image operators needs to be reformulated to take advantage of the so-called massive parallelism. In this work, we ...
  • Cost-effective active learning for melanoma segmentation 

    Górriz, Marc; Giró Nieto, Xavier; Carlier, Axel; Faure, Emmanuel (2017)
    Comunicació de congrés
    Accés obert
    We propose a novel Active Learning framework capable to train effectively a convolutional neural network for semantic segmentation of medical imaging, with a limited amount of training labeled data. Our contribution is a ...
  • 3D hierarchical optimization for multi-view depth map coding 

    Maceira, Marc; Varas, David; Morros Rubió, Josep Ramon; Ruiz Hidalgo, Javier; Marqués Acosta, Fernando (2017-11-28)
    Article
    Accés restringit per política de l'editorial
    Depth data has a widespread use since the popularity of high resolution 3D sensors. In multi-view sequences, depth information is used to supplement the color data of each view. This article proposes a joint encoding of ...
  • Hierarchical object detection with deep reinforcement learning 

    Bellver, Míriam; Giró Nieto, Xavier; Marqués Acosta, Fernando; Torres Viñals, Jordi (IOS Press, 2017-11-23)
    Capítol de llibre
    Accés restringit per política de l'editorial
    Deep learning and image processing are two areas of great interest to academics and industry professionals alike. The areas of application of these two disciplines range widely, encompassing fields such as medicine, robotics, ...
  • Object retrieval with deep convolutional features 

    Mohedano, Eva; Salvador Aguilera, Amaia; McGuinness, Kevin; Giró Nieto, Xavier; O'Connor, Noel; Marqués Acosta, Fernando (IOS Press, 2017-11-23)
    Capítol de llibre
    Accés restringit per política de l'editorial
    Deep learning and image processing are two areas of great interest to academics and industry professionals alike. The areas of application of these two disciplines range widely, encompassing fields such as medicine, robotics, ...
  • Towards large scale multimedia indexing: a case study on person discovery in broadcast news 

    Le, Nam; Bredin, Herve; Sergent, Gabriel; India Massana, Miquel Àngel; López-Otero, Paula; Barras, Claude; Guinaudeau, Camille; Gravier, Guillaume; Barbosa da Fonseca, Gabriel; Lyon Freire, Izabela; Patrocinio Jr., Zenilton; Jamil F. Guimarães, Silvio; Martí Juan, Gerard; Morros Rubió, Josep Ramon; Hernando Pericás, Francisco Javier; Docio-Fernández, Laura; García-Mateo, Carmen; Meignier, Sylvain; Odobez, Jean-Marc (Association for Computing Machinery (ACM), 2017)
    Text en actes de congrés
    Accés restringit per política de l'editorial
    The rapid growth of multimedia databases and the human interest in their peers make indices representing the location and identity of people in audio-visual documents essential for searching archives. Person discovery ...
  • Two level continuous speech recognition using demisyllable-based HMM word spotting 

    Lleida Solano, Eduardo; Mariño Acebal, José Bernardo; Nadeu Camprubí, Climent; Oliveras Vergés, Albert (1991)
    Text en actes de congrés
    Accés obert
    This paper describes a two level Spanish Continuous Speech Recognition System based on Demisyllable HMM modelling, word-spotting and finite-state lexical and syntactic knowledge. The first level, the word level, is based ...
  • Image sequence analysis and merging 

    Salembier Clairon, Philippe Jean; Garrido Ostermann, Luis; Garcia, D (LINKÖPING UNIVERSITY, 1997)
    Text en actes de congrés
    Accés obert
  • Registration of images to unorganized 3D point clouds using contour cues 

    Pujol Miró, Alba; Ruiz Hidalgo, Javier; Casas Pla, Josep Ramon (Institute of Electrical and Electronics Engineers (IEEE), 2017)
    Text en actes de congrés
    Accés obert
    Low resolution commercial 3D sensors contribute to computer vision tasks even better when the analysis is carried out in a combination with higher resolution image data. This requires registration of 2D images to ...
  • ViTS: Video tagging system from massive web multimedia collections 

    Fernàndez, Dèlia; Varas, David; Espadaler, Joan; Masuda, Issey; Ferreira, Jordi; Woodward, Alejandro; Rodríguez, David; Giró Nieto, Xavier; Riveiro, Juan Carlos; Bou Balust, Elisenda (IEEE Press, 2017)
    Text en actes de congrés
    Accés obert
    The popularization of multimedia content on the Web has arised the need to automatically understand, index and retrieve it. In this paper we present ViTS, an automatic Video Tagging System which learns from videos, their ...

Mostra'n més