Ir al contenido (pulsa Retorno)

Universitat Politècnica de Catalunya

    • Català
    • Castellano
    • English
    • LoginRegisterLog in (no UPC users)
  • mailContact Us
  • world English 
    • Català
    • Castellano
    • English
  • userLogin   
      LoginRegisterLog in (no UPC users)

UPCommons. Global access to UPC knowledge

Banner header
64.096 UPC academic works
You are here:
View Item 
  •   DSpace Home
  • Treballs acadèmics
  • Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels
  • Grau en Enginyeria Telemàtica (Pla 2009)
  • View Item
  •   DSpace Home
  • Treballs acadèmics
  • Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels
  • Grau en Enginyeria Telemàtica (Pla 2009)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

A new deep reinforcement learning architecture for autonomous UAVsAward-winning

Thumbnail
View/Open
memoria.pdf (14,59Mb)
Share:
 
  View Usage Statistics
Cita com:
hdl:2117/121577

Show full item record
Muñoz Ferran, Guillem
Author's e-mailguillemeetacarrobagmail.com
Tutor / directorBarrado Muxí, CristinaMés informacióMés informacióMés informació
Document typeBachelor thesis
Date2018-09-07
Rights accessOpen Access
Attribution-NonCommercial-ShareAlike 3.0 Spain
Except where otherwise noted, content on this work is licensed under a Creative Commons license : Attribution-NonCommercial-ShareAlike 3.0 Spain
Abstract
Recent improvements in computation and algorithmic research, together with the rising era of Big Data, have allowed Artificial Intelligence increase its popularity within masses. The recent publication of the Deep Q-Network (DQN) algorithm, which combines Q-learning with deep neural networks, has been demonstrated as being able to learn how to solve complex task, such as playing Atari games, in an unknown environment solely by gathering experience. These conditions open the door for many other applications, such as autonomous vehicles, doctors or production chains. Moreover, the preceding work of this project was focused on building a baseline architecture for enabling Unmanned Aerial Vehicles (UAVs) learn how to behave autonomously. In this project we provide different architectures for scaling this solution. To evaluate the convergence of the algorithm, we create challenging tasks concerning obstacle avoidance and goal position reaching inside a realistic simulated environment. The provided solution allows UAVs to autonomously move in three dimensions as well as controlling and modifying their velocities. Modifications in the architecture provide different approaches for learning, which are evaluated together with its training efficiency metrics and testing results. The development has been focused on integrating Deep Learning and Reinforcement Learning tools such as Keras and OpenAI Gym in order to build a modular and accessible framework capable of training and testing DRL models for autonomous UAVs within simulated environments. Results of the carried experiments show multiple enhancements compared to previous research and work, along with providing useful insights for potentially identified improvements. In this project, we have been able to successfully beat the existent baseline Double Deep Q-Learning architecture for autonomous UAVs, obtaining a 49% more of average reward and no collisions, on a non-trivial task within a realistic simulated environment.
Description
Premi HEMAV 2018 al millor TFG
SubjectsData mining, Information technology, Mineria de dades, Tecnologia de la informació
DegreeGRAU EN ENGINYERIA TELEMÀTICA (Pla 2009)
Award-winningAward-winning
URIhttp://hdl.handle.net/2117/121577
Collections
  • Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels - Grau en Enginyeria Telemàtica (Pla 2009) [167]
Share:
 
  View Usage Statistics

Show full item record

FilesDescriptionSizeFormatView
memoria.pdf14,59MbPDFView/Open

Browse

This CollectionBy Issue DateAuthorsOther contributionsTitlesSubjectsThis repositoryCommunities & CollectionsBy Issue DateAuthorsOther contributionsTitlesSubjects

© UPC Obrir en finestra nova . Servei de Biblioteques, Publicacions i Arxius

info.biblioteques@upc.edu

  • About This Repository
  • Contact Us
  • Send Feedback
  • Privacy Settings
  • Inici de la pàgina