Now showing items 1-20 of 68

    • A Comparison of autonomous vehicle navigation simulators under regulatory and reinforcement learning constraints 

      Cabañeros, Alex; Angulo Bahón, Cecilio (IOS Press, 2019)
      Conference lecture
      Restricted access - publisher's policy
      The transition from conventional vehicles to autonomous vehicles is regulated thorough ADAS (Advanced Driver Assistance Systems) functionalities. The combination of different ADAS functions allows vehicles navigate on a ...
    • A competitive strategy for function approximation in Q-learning 

      Agostini, Alejandro Gabriel; Celaya Llover, Enric (2011)
      Conference report
      Open Access
      In this work we propose an approach for generalization in continuous domain Reinforcement Learning that, instead of using a single function approximator, tries many different function approximators in parallel, each one ...
    • A novel framework for dynamic spectrum management in multiCell OFDMA networks based on reinforcement learning 

      Bernardo Álvarez, Francisco; Agustí Comes, Ramon; Pérez Romero, Jordi; Sallent Roig, José Oriol (2010)
      Conference report
      Open Access
      In this work the feasibility of Reinforcement Learning (RL) for Dynamic Spectrum Management (DSM) in the context of next generation multicell Orthogonal Frequency Division Multiple Access (OFDMA) networks is studied. ...
    • A self-organized spectrum assignment strategy in next generation OFDMA networks providing secondary spectrum access 

      Bernardo Álvarez, Francisco; Agustí Comes, Ramon; Pérez Romero, Jordi; Sallent Roig, José Oriol (2009)
      Conference report
      Open Access
      This paper proposes a Self-organized Spectrum Assignment strategy in the context of next generation multicell Orthogonal Frequency Division Multiple Access networks. The proposed strategy is able to dynamically find ...
    • Adaptive request scheduling for the I/O forwarding layer using reinforcement learning 

      Bez, Jean Luca; Zanon Boito, Francieli; Nou Castell, Ramon; Miranda Bueno, Alberto; Cortés, Toni; Navaux, Philippe O.A. (Elsevier, 2020-11)
      Article
      Restricted access - publisher's policy
      In this paper, we propose an approach to adapt the I/O forwarding layer of HPC systems to applications’ access patterns. I/O optimization techniques can improve performance for the access patterns they were designed to ...
    • An application of explainability methods in reinforcement learning 

      Climent Muñoz, Antoni (Universitat Politècnica de Catalunya, 2020-07-02)
      Bachelor thesis
      Open Access
      La popularidad de los métodos explicativos está aumentando en el contexto de la Inteligencia Artificial y consiste en dar explicaciones inteligibles a modelos complejos. Recientemente, en el contexto del Aprendizaje Reforzado ...
    • An efficient RAN slicing strategy for a heterogeneous network with eMBB and V2X services 

      Resin Albonda, Haider D.; Pérez Romero, Jordi (Institute of Electrical and Electronics Engineers (IEEE), 2019-03)
      Article
      Open Access
      Emerging 5G wireless technology will support services and use cases with vastly heterogeneous requirements. Network slicing, which allows composing multiple dedicated logical networks with specific functionality running ...
    • Analysis of RAN slicing for cellular V2X and mobile broadband services based on reinforcement learning 

      Albonda, Haider Daami Resin; Pérez Romero, Jordi (European Alliance for Innovation n.o., 2020-03)
      Article
      Open Access
      Radio Access Network (RAN) slicing is one of the key enablers to provide the design flexibility and enable 5G system to support heterogeneous services over a common platform (i.e., by creating a customized slice for each ...
    • Applying and verifying an explainability method based on policy graphs in the context of reinforcement learning 

      Climent Muñoz, Antoni; Gnatyshak, Dmitry; Álvarez Napagao, Sergio (IOS Press, 2021)
      Conference report
      Open Access
      The advancement on explainability techniques is quite relevant in the field of Reinforcement Learning (RL) and its applications can be beneficial for the development of intelligent agents that are understandable by humans ...
    • Applying multi-agent reinforcement learning to solve sequential moral dilemmas 

      Choinski, Michal (Universitat Politècnica de Catalunya, 2021-04-26)
      Master thesis
      Open Access
      Covenantee:   Universitat de Barcelona. Facultat de Matemàtiques i Informàtica / Universitat Rovira i Virgili
      Incorporation of ethical values in the field of Artificial Intelligence is inevitable. With the rapid development of technologies capable of making autonomous decisions, more attention should be dedicated to the process ...
    • Applying the rainbow architecture to intrusion detection systems 

      Izquierdo García-Faria, Tomás (Universitat Politècnica de Catalunya, 2021-04-28)
      Master thesis
      Open Access
      There is a lot of expectation on how Artificial Intelligence (AI) is going to have an impact on Cybersecurity. From new sophisticated attacks to new ways of defending a system from cybercriminals. A lot of techniques are ...
    • Aprendizaje por refuerzo aplicado a los videojuegos cooperativos 

      Alcocer Soto, Daniel (Universitat Politècnica de Catalunya, 2018-07)
      Bachelor thesis
      Open Access
      Se ha desarrollado un algoritmo que combina el uso de redes neuronales con el algoritmo Q-learning para aprender a jugar con la ayuda de un jugador humano y aprender a cooperar con él para ganar en un videojuego basado en ...
    • Aprendizaje por refuerzo aplicado a personajes no controlables en Minetest 

      Romero Reviriego, Aitor (Universitat Politècnica de Catalunya, 2019-01)
      Bachelor thesis
      Open Access
      En este proyecto se ha utilizado la rama de la IA llamada aprendizaje por refuerzo y intenta desarrollar agentes para el videojuego Minetest que actúen como aliados del jugador.
    • Aprendizaje por refuerzo multi-nivel para sistemas RRM 

      Collados Zamora, Kevin (Universitat Politècnica de Catalunya, 2014-03-17)
      Master thesis (pre-Bologna period)
      Open Access
      [ANGLÈS] This paper focuses on the problem of resource management in the field of RRM (Radio Resource Management) systems with more than one objective to maximize. Specifically focuses on simultaneously maximize the quality ...
    • Aprenentatge aplicat al NL Texas Hold'em 

      Perapoch Amadó, Marçal (Universitat Politècnica de Catalunya, 2014-06-06)
      Master thesis (pre-Bologna period)
      Open Access
      Aquest projecte consisteix en la construcció d'un sistema software que permeti observar el comportament i resultats d'aplicar tècniques d'Intel·ligència Artificial al domini que engloba el conegut joc de cartes: Pòquer.
    • Autonomous and energy efficient lightpath operation based on digital subcarrier multiplexing 

      Velasco Esteban, Luis Domingo; Barzegar, Sima; Sequeira, Diogo Gonçalo; Ferrari, Alessio; Costa, Nelson; Curri, Vittorio; Pedro, João; Napoli, Antonio; Ruiz Ramírez, Marc (2021-09)
      Article
      Open Access
      The massive deployment of 5G and beyond will require high capacity and low latency connectivity services, so network operators will have either to overprovision capacity in their transport networks or to upgrade the optical ...
    • Combining long-short term memory and reinforcement learning for improved autonomous network operation 

      Tabatabaeimehr, Fatemehsadat; Barzegar, Sima; Ruiz Ramírez, Marc; Velasco Esteban, Luis Domingo (Institute of Electrical and Electronics Engineers (IEEE), 2021)
      Conference report
      Open Access
      A combined LSTM and RL approach is proposed for dynamic connection capacity allocation. The LSTM predictor anticipates periodical long-term sharp traffic changes and extends short-term RL knowledge. Numerical results show ...
    • Comparison of multicast/broadcast services in long term evolution advanced and IEEE 802.16m networks 

      Calabuig Gaspar, Jorge; Monserrat del Río, José Francisco; Martín-Sacristán Gandía, David; Olmos Bonafé, Juan José (2012-03-21)
      Article
      Restricted access - publisher's policy
      This paper performs a comparison of multicast/broadcast services (MBS) support in Long Term Evolution Advanced (LTE-A) and Worldwide Interoperability for Microwave Access (WiMAX) IEEE 802.16m. Firstly, the main ...
    • Continuous-action reinforcement learning for memory allocation in virtualized servers 

      Garrido Platero, Luis Ángel; Nishtala, Rajiv; Carpenter, Paul Matthew (Springer, 2019)
      Conference report
      Open Access
      In a virtualized computing server (node) with multiple Virtual Machines (VMs), it is necessary to dynamically allocate memory among the VMs. In many cases, this is done only considering the memory demand of each VM without ...
    • Counter a drone and the performance analysis of deep reinforcement learning method and human pilot 

      Cetin, Ender; Barrado Muxí, Cristina; Pastor Llorens, Enric (Institute of Electrical and Electronics Engineers (IEEE), 2021)
      Conference report
      Open Access
      Artificial Intelligence (AI) has been used in different research areas in aerospace to create an intelligent system. Especially, an unmanned aerial vehicle (UAV), known as a drone, can be controlled by AI methods such ...