Mostra el registre d'ítem simple

dc.contributor.authorMosella Montoro, Albert
dc.contributor.authorRuiz Hidalgo, Javier
dc.contributor.otherUniversitat Politècnica de Catalunya. Doctorat en Teoria del Senyal i Comunicacions
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2021-05-31T14:32:16Z
dc.date.available2023-12-01T01:30:47Z
dc.date.issued2021-12
dc.identifier.citationMosella, A.; Ruiz-Hidalgo, J. 2D–3D geometric fusion network using multi-neighbourhood graph convolution for RGB-D indoor scene classification. "Information fusion", Desembre 2021, vol. 76, p. 46-54.
dc.identifier.issn1566-2535
dc.identifier.urihttp://hdl.handle.net/2117/346435
dc.description.abstractMulti-modal fusion has been proved to help enhance the performance of scene classification tasks. This paper presents a 2D-3D Fusion stage that combines 3D Geometric Features with 2D Texture Features obtained by 2D Convolutional Neural Networks. To get a robust 3D Geometric embedding, a network that uses two novel layers is proposed. The first layer, Multi-Neighbourhood Graph Convolution, aims to learn a more robust geometric descriptor of the scene combining two different neighbourhoods: one in the Euclidean space and the other in the Feature space. The second proposed layer, Nearest Voxel Pooling, improves the performance of the well-known Voxel Pooling. Experimental results, using NYU-Depth-V2 and SUN RGB-D datasets, show that the proposed method outperforms the current state-of-the-art in RGB-D indoor scene classification task.
dc.description.sponsorshipThis work was supported by Secretary of Universities and Research of the Generalitat de Catalunya and the European Social Fund via a PhD grant (FI2018) in the framework of project TEC2016-75976-R, financed by the Ministerio de Economía, Industria y Competitividad and the European Regional Development Fund (ERDF).
dc.format.extent9 p.
dc.language.isoeng
dc.publisherElsevier
dc.subject.lcshNeural networks (Computer science)
dc.subject.lcshMultisensor data fusion
dc.subject.otherConvolutional Graph Neural Network
dc.subject.otherMulti-modal fusion
dc.subject.otherMulti-Neighbourhood Graph Neural Network
dc.subject.otherIndoor scene classification
dc.subject.otherRGB-D
dc.title2D–3D geometric fusion network using multi-neighbourhood graph convolution for RGB-D indoor scene classification
dc.typeArticle
dc.subject.lemacXarxes neuronals (Informàtica)
dc.subject.lemacFusió d'imatges
dc.contributor.groupUniversitat Politècnica de Catalunya. GPI - Grup de Processament d'Imatge i Vídeo
dc.identifier.doi10.1016/j.inffus.2021.05.002
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttps://www.sciencedirect.com/science/article/pii/S1566253521001032
dc.rights.accessOpen Access
local.identifier.drac31757795
dc.description.versionPostprint (author's final draft)
dc.relation.projectidinfo:eu-repo/grantAgreement/MINECO/2PE/TEC2016-75976-R
local.citation.authorMosella, A.; Ruiz-Hidalgo, J.
local.citation.publicationNameInformation fusion
local.citation.volume76
local.citation.startingPage46
local.citation.endingPage54


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple