Anomaly detection model selection using minimum description length
View/Open
memoria.pdf (3,096Mb) (Restricted access)
Document typeMaster thesis
Date2020-07
Rights accessRestricted access - confidentiality agreement
Abstract
Detecting objects that deviate significantly from the rest of a dataset is a complex process which requires advanced techniques. A great variety of algorithms to detect anomalies have been presented over the last years, but none has been proved to be the best. We present a proxy technique for predicting the outlier detection performance of compression-based algorithms using the minimum description length (MDL) principle given a particular dataset. We analyse the correlation between how well an algorithm can compress the data and its performance in anomaly detection (AD). The results show a clear relationship between the total compressed size of a dataset and the outlier detection performance for an MDL-based algorithm. This fact allows us to use the size as a proxy for selecting the most effective AD algorithm for a specific application.
Files | Description | Size | Format | View |
---|---|---|---|---|
memoria.pdf![]() | 3,096Mb | Restricted access |
Except where otherwise noted, content on this work
is licensed under a Creative Commons license
:
Attribution-NonCommercial-ShareAlike 3.0 Spain