Deep reinforcement learning as control method for autonomous UAVs

Kersandt, Kjell

dc.contributor	Barrado Muxí, Cristina
dc.contributor.author	Kersandt, Kjell
dc.date.accessioned	2018-02-08T15:01:01Z
dc.date.available	2018-02-08T15:01:01Z
dc.date.issued	2018-02-06
dc.identifier.uri	http://hdl.handle.net/2117/113948
dc.description.abstract	Deep Reinforcement Learning (DRL) is attracting increasing interest due to its ability to learn how to solve complex tasks in an unknown environment solely by gathering experience. In this thesis, we investigate the use of DRL methods on the vision-based control of an autonomous quadcopter within a simulated environment. More specifically we employ an algorithm called Deep Q-network and two extensions involving the concept of Double Q-learning and Dueling Architecture. To evaluate the algorithms, we create a challenging task that concern obstacle avoidance and goal position reaching. Due to the lack of available tools that would combine the simulation of drones and the accessibility of DRL methods, we contribute AirGym as a framework that offers a convenient implementation of our task an these of following researchers. The results of the study support the idea of full control of an autonomous drone through DRL methods since we achieved an 80% success rate in solving the task under a near human-level of performance. This achievement is enhanced by considering the relatively short training time and the identification of further improvements.
dc.language.iso	eng
dc.publisher	Universitat Politècnica de Catalunya
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/es/
dc.subject	Àrees temàtiques de la UPC::Aeronàutica i espai::Aeronaus::Avions
dc.subject.lcsh	Drone aircraft
dc.subject.other	Reinforcement learning
dc.subject.other	Deep learning
dc.subject.other	Autonomous UAV
dc.subject.other	Optimal control
dc.subject.other	Neural networks
dc.subject.other	Simulation
dc.title	Deep reinforcement learning as control method for autonomous UAVs
dc.type	Master thesis
dc.subject.lemac	Avions no tripulats
dc.rights.access	Open Access
dc.date.updated	2018-02-07T05:24:32Z
dc.audience.educationlevel	Estudis de primer/segon cicle
dc.audience.mediator	Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels

Fitxers d'aquest items

Nom:: memoria.pdf
Mida:: 18,71Mb
Format:: PDF

Visualitza/Obre

Aquest ítem apareix a les col·leccions següents

Master's degree in Aerospace Science and Technology - MAST (Pla 2015) [160]

Mostra el registre d'ítem simple

UPCommons. Portal del coneixement obert de la UPC

Deep reinforcement learning as control method for autonomous UAVs

Fitxers d'aquest items

Aquest ítem apareix a les col·leccions següents

Explora