Machine Learning Applied to Network Traffic

Rodríguez Segado, David

Visualitza/Obre

memoria.pdf (6,477Mb) (Accés restringit)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Rodríguez Segado, David

Tutor / directorPerera Lluna, Alexandre

; Romero Ruiz, Iván

Realitzat a/ambStarflow Networks

Tipus de documentProjecte Final de Màster Oficial

Data2019-06

Condicions d'accésAccés restringit per decisió de l'autor

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

The appliance of machine learning to TCP/IP traffic flows is not new. However, this projects aims to use it to predict the congestion avoidance algorithm at the first second of a data transference. Being able to recognize the congestion avoidance strategy that is being used, would improve flow control, allowing to act proactively instead of reactively. For this project, the flows are generated using NS-3 simulator. It provides an structure that can simulate the behaviour of the data transference through internet, allowing to extract information through pcap files. Wireshark has been used to extract the information that will be necessary to collect time series data and statistics of them. With this information available, there are proposed some machine learning methods to see if they can, using different sets of information representing the performance of the flows, distinguish between 8 different congestion avoidance algorithms: TCP BIC, TCP Highspeed, H-TCP, TCP Illinois, TCP Vegas, TCP Veno, TCP Westwood and TCP Yeah. None of the attempts allow a test error significantly lower than 50%, some algorithms are having performance too similar to be distinguished (specially H-TCP and TCP Veno). In addition, the best results were achieved when working with random forests (using all the statistics collected as input) and with RNN-LSTM (when the inputs are percent change values time series).

MatèriesCoding theory, Information theory, Codificació, Teoria de la, Informació, Teoria de la

TitulacióMÀSTER UNIVERSITARI EN ESTADÍSTICA I INVESTIGACIÓ OPERATIVA (Pla 2013)

URIhttp://hdl.handle.net/2117/165336

Col·leccions

Màsters oficials - Màster universitari en Estadística i Investigació Operativa (UPC-UB) [437]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
memoria.pdf		6,477Mb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

Machine Learning Applied to Network Traffic

Visualitza/Obre

Explora