Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

Banner header
66.422 UPC academic works
You are here:
View Item 
  •   DSpace Home
  • Treballs acadèmics
  • Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona
  • Grau en Enginyeria de Tecnologies i Serveis de Telecomunicació (Pla 2015)
  • View Item
  •   DSpace Home
  • Treballs acadèmics
  • Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona
  • Grau en Enginyeria de Tecnologies i Serveis de Telecomunicació (Pla 2015)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Deep learning for object tracking in 3D point clouds

Thumbnail
View/Open
thesis.pdf (2,826Mb)
  View Usage Statistics
  LA Referencia / Recolecta stats
Cita com:
hdl:2117/329126

Show full item record
Colom Hernández, Jaume
Tutor / directorCasas Pla, Josep RamonMés informacióMés informacióMés informació; Osep, Aljosa
CovenanteeTechnische Universität München
Document typeBachelor thesis
Date2020-06
Rights accessOpen Access
All rights reserved. This work is protected by the corresponding intellectual and industrial property rights. Without prejudice to any existing legal exemptions, reproduction, distribution, public communication or transformation of this work are prohibited without permission of the copyright holder
Abstract
Great progress has been achieved in computer vision tasks within image and video; however, technological advances in LiDAR sensors have created a whole new area of computer vision research devoted to it. With applications in many industries, such as transportation, agriculture, or healthcare. This thesis studies object tracking in 3D point clouds. Pairs of point cloud observations are feed to a neural network to estimate pose and translation between the observations. Then these estimations, together with external features, are processed with Kalman Filter and RNN to extract spatial-temporal redundancies and improve the results. The system has been tested in the KITTI dataset, with pre-segmented observations, on different types of objects and paths. The results show that the neural network estimated pose gives a very accurate tracking. Still, the best results are achieved when combining the estimated pose and translations with a recurrent neural network.
 
Great progress has been achieved in computer vision tasks within image and video; however, technological advances in LiDAR sensors have created a whole new area of computer vision research devoted to it. With applications in many industries, such as transportation, agriculture, or healthcare. This thesis studies object tracking in 3D point clouds. Pairs of point cloud observations are feed to a neural network to estimate pose and translation between the observations. Then these estimations, together with external features, are processed with Kalman Filter and RNN to extract spatial-temporal redundancies and improve the results. The system has been tested in the KITTI dataset, with pre-segmented observations, on different types of objects and paths. The results show that the neural network estimated pose gives a very accurate tracking. Still, the best results are achieved when combining the estimated pose and translations with a recurrent neural network.
 
Great progress has been achieved in computer vision tasks within image and video; however, technological advances in LiDAR sensors have created a whole new area of computer vision research devoted to it. With applications in many industries, such as transportation, agriculture, or healthcare. This thesis studies object tracking in 3D point clouds. Pairs of point cloud observations are feed to a neural network to estimate pose and translation between the observations. Then these estimations, together with external features, are processed with Kalman Filter and RNN to extract spatial-temporal redundancies and improve the results. The system has been tested in the KITTI dataset, with pre-segmented observations, on different types of objects and paths. The results show that the neural network estimated pose gives a very accurate tracking. Still, the best results are achieved when combining the estimated pose and translations with a recurrent neural network.
SubjectsMachine learning, Artificial intelligence, Neural networks (Computer science), Robot vision, Aprenentatge automàtic, Intel·ligència artificial, Xarxes neuronals (Informàtica), Visió artificial (Robòtica)
DegreeGRAU EN ENGINYERIA DE TECNOLOGIES I SERVEIS DE TELECOMUNICACIÓ (Pla 2015)
URIhttp://hdl.handle.net/2117/329126
Collections
  • Escola Tècnica Superior d'Enginyeria de Telecomunicació de Barcelona - Grau en Enginyeria de Tecnologies i Serveis de Telecomunicació (Pla 2015) [779]
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
thesis.pdf2,826MbPDFView/Open

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Privacy Settings
  • Inici de la pàgina