Continuous multi-objective zero-touch network slicing via twin delayed DDPG and OpenAI gym

Rezazadeh, Farhad; Chergui, Hatim; Alonso Zárate, Luis Gonzaga; Verikoukis, Christos

doi:10.1109/GLOBECOM42002.2020.9322237

Visualitza/Obre

Article (2,292Mb) (Accés restringit) Sol·licita una còpia a l'autor

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Rezazadeh, Farhad

Chergui, Hatim

Alonso Zárate, Luis Gonzaga

Verikoukis, Christos

Tipus de documentComunicació de congrés

Data publicació2020

EditorInstitute of Electrical and Electronics Engineers (IEEE)

Condicions d'accésAccés restringit per política de l'editorial

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Projecte5G-SOLUTIONS - 5G Solutions for European Citizens (EC-H2020-856691)
MonB5G - Distributed management of Network Slices in beyond 5G (EC-H2020-871780)
UNICO PUNTO DE ASOCIACION EN REDES DE COMUNICACIONES MOVILES HETEROGENEAS DE 5ª GENERACION (AEI-TEC2017-87456-P)

Abstract

Artificial intelligence (AI)-driven zero-touch network slicing (NS) is a new paradigm enabling the automation of resource management and orchestration (MANO) in multi-tenant beyond 5G (B5G) networks. In this paper, we tackle the problem of cloud-RAN (C-RAN) joint slice admission control and resource allocation by first formulating it as a Markov decision process (MDP). We then invoke an advanced continuous deep reinforcement learning (DRL) method called twin delayed deep deterministic policy gradient (TD3) to solve it. In this intent, we introduce a multi-objective approach to make the central unit (CU) learn how to re-configure computing resources autonomously while minimizing latency, energy consumption and virtual network function (VNF) instantiation cost for each slice. Moreover, we build a complete 5G C-RAN network slicing environment using OpenAI Gym toolkit where, thanks to its standardized interface, it can be easily tested with different DRL schemes. Finally, we present extensive experimental results to showcase the gain of TD3 as well as the adopted multi-objective strategy in terms of achieved slice admission success rate, latency, energy saving and CPU utilization.

CitacióRezazadeh, F. [et al.]. Continuous multi-objective zero-touch network slicing via twin delayed DDPG and OpenAI gym. A: IEEE Global Communications Conference. "Proceedings of IEEE Globecom 2020". Institute of Electrical and Electronics Engineers (IEEE), 2020, p. 1-6. DOI 10.1109/GLOBECOM42002.2020.9322237.

URIhttp://hdl.handle.net/2117/338159

DOI10.1109/GLOBECOM42002.2020.9322237

Versió de l'editorhttps://ieeexplore.ieee.org/document/9322237

Altres identificadorshttps://zenodo.org/record/4459653#.YAylATmSmUk

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
2020 Globecomm ... uch Network 1570641726.pdf	Article	2,292Mb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

Continuous multi-objective zero-touch network slicing via twin delayed DDPG and OpenAI gym

Visualitza/Obre

Explora