A new deep reinforcement learning architecture for autonomous UAVs

Muñoz Ferran, Guillem

dc.contributor	Barrado Muxí, Cristina
dc.contributor.author	Muñoz Ferran, Guillem
dc.date.accessioned	2018-09-27T12:49:15Z
dc.date.available	2018-09-27T12:49:15Z
dc.date.issued	2018-09-07
dc.identifier.uri	http://hdl.handle.net/2117/121577
dc.description	Premi HEMAV 2018 al millor TFG
dc.description.abstract	Recent improvements in computation and algorithmic research, together with the rising era of Big Data, have allowed Artificial Intelligence increase its popularity within masses. The recent publication of the Deep Q-Network (DQN) algorithm, which combines Q-learning with deep neural networks, has been demonstrated as being able to learn how to solve complex task, such as playing Atari games, in an unknown environment solely by gathering experience. These conditions open the door for many other applications, such as autonomous vehicles, doctors or production chains. Moreover, the preceding work of this project was focused on building a baseline architecture for enabling Unmanned Aerial Vehicles (UAVs) learn how to behave autonomously. In this project we provide different architectures for scaling this solution. To evaluate the convergence of the algorithm, we create challenging tasks concerning obstacle avoidance and goal position reaching inside a realistic simulated environment. The provided solution allows UAVs to autonomously move in three dimensions as well as controlling and modifying their velocities. Modifications in the architecture provide different approaches for learning, which are evaluated together with its training efficiency metrics and testing results. The development has been focused on integrating Deep Learning and Reinforcement Learning tools such as Keras and OpenAI Gym in order to build a modular and accessible framework capable of training and testing DRL models for autonomous UAVs within simulated environments. Results of the carried experiments show multiple enhancements compared to previous research and work, along with providing useful insights for potentially identified improvements. In this project, we have been able to successfully beat the existent baseline Double Deep Q-Learning architecture for autonomous UAVs, obtaining a 49% more of average reward and no collisions, on a non-trivial task within a realistic simulated environment.
dc.language.iso	eng
dc.publisher	Universitat Politècnica de Catalunya
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Enginyeria de la telecomunicació
dc.subject.lcsh	Data mining
dc.subject.lcsh	Information technology
dc.subject.other	Artificial Intelligence
dc.subject.other	UAVs
dc.subject.other	Deep Learning
dc.subject.other	Reinforcement Learning
dc.subject.other	Machine Learning
dc.subject.other	Autonomous vehicles
dc.subject.other	Neural Networks
dc.subject.other	Deep Reinforcement Learning
dc.subject.other	Algorithms
dc.title	A new deep reinforcement learning architecture for autonomous UAVs
dc.type	Bachelor thesis
dc.subject.lemac	Mineria de dades
dc.subject.lemac	Tecnologia de la informació
dc.description.awardwinning	Award-winning
dc.rights.access	Open Access
dc.date.updated	2018-09-11T04:29:03Z
dc.audience.educationlevel	Estudis de primer/segon cicle
dc.audience.mediator	Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels
dc.audience.degree	GRAU EN ENGINYERIA TELEMÀTICA (Pla 2009)

Fitxers d'aquest items

Nom:: memoria.pdf
Mida:: 14,59Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Grau en Enginyeria Telemàtica (Pla 2009) [190]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

A new deep reinforcement learning architecture for autonomous UAVs

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora