3D semantic scene completion with LiDAR point clouds

dc.audience.degreeMÀSTER UNIVERSITARI EN TECNOLOGIES AVANÇADES DE TELECOMUNICACIÓ (Pla 2019)
dc.audience.educationlevelMàster
dc.audience.mediatorEscola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona
dc.contributorCasas Pla, Josep Ramon
dc.contributorRuiz Hidalgo, Javier
dc.contributor.authorCortada Garcia, Martí
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
dc.date.accessioned2025-01-30T19:20:48Z
dc.date.available2025-01-30T19:20:48Z
dc.date.issued2024-07-10
dc.date.updated2024-10-28T06:49:52Z
dc.description.abstractIn recent years, the development of autonomous vehicles has shown significant potential in improving road safety by reducing traffic accidents and fatalities. One of the critical technologies enabling this advancement is LiDAR (Light Detection and Ranging), which provides precise geometric information about the environment. This master's thesis fo- cuses on 3D Semantic Scene Completion using LiDAR point clouds, a technique that aims to predict complete 3D voxel representations of scenes from incomplete LiDAR data. This task involves determining whether each voxel is occupied and assigning it a semantic label. The study reviews state-of-the-art methods for semantic scene completion, including SSA- SC, JS3C-Net, and SCPNet, which have demonstrated high performance in benchmarks like SemanticKITTI. The chosen method, SCPNet, utilizes a teacher-student framework to distill dense semantic knowledge from multi-frame point clouds (teacher) to single-frame point clouds (student). The implementation involves significant memory management and architectural optimizations to handle large datasets and computational limitations effectively. Experiments were conducted using the SemanticKITTI dataset, and the results were evaluated using mean Intersection over Union (mIoU) metrics. The thesis also explores the fusion of semantic scene completion with object detection tasks, using the nuScenes dataset to assess generalization. The findings indicate that while SCPNet shows superior performance in certain dynamic object classes, challenges remain in accurately detecting and representing moving objects like pedestrians and cyclists. Future research directions include further optimizing memory usage and improving the integration of semantic scene completion with other perception tasks.
dc.identifier.slugETSETB-230.189578
dc.identifier.urihttps://hdl.handle.net/2117/423092
dc.language.isoeng
dc.publisherUniversitat Politècnica de Catalunya
dc.rightsS'autoritza la difusió de l'obra mitjançant la llicència Creative Commons o similar 'Reconeixement-NoComercial- SenseObraDerivada'
dc.rights.accessOpen Access
dc.subjectÀrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la imatge i del senyal vídeo
dc.subject.lcshMachine learning
dc.subject.lcshComputer vision
dc.subject.lcshPattern recognition systems
dc.subject.lcshRemote sensing
dc.subject.lemacAprenentatge automàtic
dc.subject.lemacVisió per ordinador
dc.subject.lemacReconeixement de formes (Informàtica)
dc.subject.lemacTeledetecció
dc.subject.otherLiDAR
dc.subject.otherPoint Cloud
dc.subject.otherknowledge distillation
dc.subject.othersemantic scene completion
dc.title3D semantic scene completion with LiDAR point clouds
dc.typeMaster thesis
dspace.entity.typePublication

Fitxers

Paquet original

Mostrant 1 - 1 de 1
Carregant...
Miniatura
Nom:
TFM.pdf
Mida:
7.03 MB
Format:
Adobe Portable Document Format