Mostra el registre d'ítem simple
Deep reinforcement learning as control method for autonomous UAVs
dc.contributor | Barrado Muxí, Cristina |
dc.contributor.author | Kersandt, Kjell |
dc.date.accessioned | 2018-02-08T15:01:01Z |
dc.date.available | 2018-02-08T15:01:01Z |
dc.date.issued | 2018-02-06 |
dc.identifier.uri | http://hdl.handle.net/2117/113948 |
dc.description.abstract | Deep Reinforcement Learning (DRL) is attracting increasing interest due to its ability to learn how to solve complex tasks in an unknown environment solely by gathering experience. In this thesis, we investigate the use of DRL methods on the vision-based control of an autonomous quadcopter within a simulated environment. More specifically we employ an algorithm called Deep Q-network and two extensions involving the concept of Double Q-learning and Dueling Architecture. To evaluate the algorithms, we create a challenging task that concern obstacle avoidance and goal position reaching. Due to the lack of available tools that would combine the simulation of drones and the accessibility of DRL methods, we contribute AirGym as a framework that offers a convenient implementation of our task an these of following researchers. The results of the study support the idea of full control of an autonomous drone through DRL methods since we achieved an 80% success rate in solving the task under a near human-level of performance. This achievement is enhanced by considering the relatively short training time and the identification of further improvements. |
dc.language.iso | eng |
dc.publisher | Universitat Politècnica de Catalunya |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/es/ |
dc.subject | Àrees temàtiques de la UPC::Aeronàutica i espai::Aeronaus::Avions |
dc.subject.lcsh | Drone aircraft |
dc.subject.other | Reinforcement learning |
dc.subject.other | Deep learning |
dc.subject.other | Autonomous UAV |
dc.subject.other | Optimal control |
dc.subject.other | Neural networks |
dc.subject.other | Simulation |
dc.title | Deep reinforcement learning as control method for autonomous UAVs |
dc.type | Master thesis |
dc.subject.lemac | Avions no tripulats |
dc.rights.access | Open Access |
dc.date.updated | 2018-02-07T05:24:32Z |
dc.audience.educationlevel | Estudis de primer/segon cicle |
dc.audience.mediator | Escola d'Enginyeria de Telecomunicació i Aeroespacial de Castelldefels |