Mostra el registre d'ítem simple
An efficient closed frequent itemset miner for the MOA stream mining system
dc.contributor.author | Quadrana, Massimo |
dc.contributor.author | Bifet Figuerol, Albert Carles |
dc.contributor.author | Gavaldà Mestre, Ricard |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Llenguatges i Sistemes Informàtics |
dc.date.accessioned | 2017-01-17T10:16:22Z |
dc.date.available | 2017-01-17T10:16:22Z |
dc.date.issued | 2013 |
dc.identifier.citation | Quadrana, M., Bifet, A.C., Gavaldà, R. "An efficient closed frequent itemset miner for the MOA stream mining system". 2013. |
dc.identifier.uri | http://hdl.handle.net/2117/99416 |
dc.description.abstract | Mining itemsets is a central task in data mining, both in the batch and the streaming paradigms. While robust, efficient, and well-tested implementations exist for batch mining, hardly any publicly available equivalent exists for the streaming scenario. The lack of an efficient, usable tool for the task hinders its use by practitioners and makes it difficult to assess new research in the area. To alleviate this situation, we review the algorithms described in the literature, and implement and evaluate the IncMine algorithm by Cheng, Ke, and Ng (2008) for mining frequent closed itemsets from data streams. Our implementation works on top of the MOA (Massive Online Analysis) stream mining framework to ease its use and integration with other stream mining tasks. We provide a PAC-style rigorous analysis of the quality of the output of IncMine as a function of its parameters; this type of analysis is rare in pattern mining algorithms. As a by-product, the analysis shows how one of the user-provided parameters in the original description can be removed entirely while retaining the performance guarantees. Finally, we experimentally confirm both on synthetic and real data the excellent performance of the algorithm, as reported in the original paper, and its ability to handle concept drift. |
dc.format.extent | 31 p. |
dc.language.iso | eng |
dc.relation.ispartofseries | LSI-13-9-R |
dc.subject | Àrees temàtiques de la UPC::Informàtica::Informàtica teòrica |
dc.subject.other | Data mining |
dc.subject.other | Data streams |
dc.subject.other | Stream mining |
dc.subject.other | Itemset mining |
dc.subject.other | MOA |
dc.title | An efficient closed frequent itemset miner for the MOA stream mining system |
dc.type | External research report |
dc.contributor.group | Universitat Politècnica de Catalunya. LARCA - Laboratori d'Algorísmia Relacional, Complexitat i Aprenentatge |
dc.rights.access | Open Access |
local.identifier.drac | 19592955 |
dc.description.version | Postprint (published version) |
local.citation.author | Quadrana, M.; Bifet, A.C.; Gavaldà, R. |
Fitxers d'aquest items
Aquest ítem apareix a les col·leccions següents
-
Reports de recerca [68]
-
Reports de recerca [1.107]