3D semantic scene completion with LiDAR point clouds
| dc.audience.degree | MÀSTER UNIVERSITARI EN TECNOLOGIES AVANÇADES DE TELECOMUNICACIÓ (Pla 2019) |
| dc.audience.educationlevel | Màster |
| dc.audience.mediator | Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona |
| dc.contributor | Casas Pla, Josep Ramon |
| dc.contributor | Ruiz Hidalgo, Javier |
| dc.contributor.author | Cortada Garcia, Martí |
| dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions |
| dc.date.accessioned | 2025-01-30T19:20:48Z |
| dc.date.available | 2025-01-30T19:20:48Z |
| dc.date.issued | 2024-07-10 |
| dc.date.updated | 2024-10-28T06:49:52Z |
| dc.description.abstract | In recent years, the development of autonomous vehicles has shown significant potential in improving road safety by reducing traffic accidents and fatalities. One of the critical technologies enabling this advancement is LiDAR (Light Detection and Ranging), which provides precise geometric information about the environment. This master's thesis fo- cuses on 3D Semantic Scene Completion using LiDAR point clouds, a technique that aims to predict complete 3D voxel representations of scenes from incomplete LiDAR data. This task involves determining whether each voxel is occupied and assigning it a semantic label. The study reviews state-of-the-art methods for semantic scene completion, including SSA- SC, JS3C-Net, and SCPNet, which have demonstrated high performance in benchmarks like SemanticKITTI. The chosen method, SCPNet, utilizes a teacher-student framework to distill dense semantic knowledge from multi-frame point clouds (teacher) to single-frame point clouds (student). The implementation involves significant memory management and architectural optimizations to handle large datasets and computational limitations effectively. Experiments were conducted using the SemanticKITTI dataset, and the results were evaluated using mean Intersection over Union (mIoU) metrics. The thesis also explores the fusion of semantic scene completion with object detection tasks, using the nuScenes dataset to assess generalization. The findings indicate that while SCPNet shows superior performance in certain dynamic object classes, challenges remain in accurately detecting and representing moving objects like pedestrians and cyclists. Future research directions include further optimizing memory usage and improving the integration of semantic scene completion with other perception tasks. |
| dc.identifier.slug | ETSETB-230.189578 |
| dc.identifier.uri | https://hdl.handle.net/2117/423092 |
| dc.language.iso | eng |
| dc.publisher | Universitat Politècnica de Catalunya |
| dc.rights | S'autoritza la difusió de l'obra mitjançant la llicència Creative Commons o similar 'Reconeixement-NoComercial- SenseObraDerivada' |
| dc.rights.access | Open Access |
| dc.subject | Àrees temàtiques de la UPC::Enginyeria de la telecomunicació::Processament del senyal::Processament de la imatge i del senyal vídeo |
| dc.subject.lcsh | Machine learning |
| dc.subject.lcsh | Computer vision |
| dc.subject.lcsh | Pattern recognition systems |
| dc.subject.lcsh | Remote sensing |
| dc.subject.lemac | Aprenentatge automàtic |
| dc.subject.lemac | Visió per ordinador |
| dc.subject.lemac | Reconeixement de formes (Informàtica) |
| dc.subject.lemac | Teledetecció |
| dc.subject.other | LiDAR |
| dc.subject.other | Point Cloud |
| dc.subject.other | knowledge distillation |
| dc.subject.other | semantic scene completion |
| dc.title | 3D semantic scene completion with LiDAR point clouds |
| dc.type | Master thesis |
| dspace.entity.type | Publication |
Fitxers
Paquet original
1 - 1 de 1

