Mostra el registre d'ítem simple

dc.contributor.authorQuadrana, Massimo
dc.contributor.authorBifet Figuerol, Albert Carles
dc.contributor.authorGavaldà Mestre, Ricard
dc.contributor.otherUniversitat Politècnica de Catalunya. Departament de Ciències de la Computació
dc.date.accessioned2016-02-16T08:02:25Z
dc.date.available2016-02-16T08:02:25Z
dc.date.issued2015-01-07
dc.identifier.citationQuadrana, M., Bifet, A.C., Gavaldà, R. An efficient closed frequent itemset miner for the MOA stream mining system. "AI communications: the european journal of artificial intelligence", 07 Gener 2015, vol. 28, núm. 1, p. 143-158.
dc.identifier.issn0921-7126
dc.identifier.urihttp://hdl.handle.net/2117/82987
dc.description.abstractMining itemsets is a central task in data mining, both in the batch and the streaming paradigms. While robust, efficient, and well-tested implementations exist for batch mining, hardly any publicly available equivalent exists for the streaming scenario. The lack of an efficient, usable tool for the task hinders its use by practitioners and makes it difficult to assess new research in the area. To alleviate this situation, we review the algorithms described in the literature, and implement and evaluate the IncMine algorithm by Cheng, Ke and Ng [J. Intell. Inf. Syst. 31(3) (2008), 191–215] for mining frequent closed itemsets from data streams. Our implementation works on top of the MOA (Massive Online Analysis) stream mining framework to ease its use and integration with other stream mining tasks. We provide a PAC-style rigorous analysis of the quality of the output of IncMine as a function of its parameters; this type of analysis is rare in pattern mining algorithms. As a by-product, the analysis shows how one of the user-provided parameters in the original description can be removed entirely while retaining the performance guarantees. Finally, we experimentally confirm both on synthetic and real data the excellent performance of the algorithm, as reported in the original paper, and its ability to handle concept drift.
dc.format.extent16 p.
dc.language.isoeng
dc.subjectÀrees temàtiques de la UPC::Informàtica::Sistemes d'informació
dc.subject.lcshData mining
dc.subject.otherData streams
dc.subject.otherStream mining
dc.subject.otherItemset mining
dc.subject.otherMOAS
dc.titleAn efficient closed frequent itemset miner for the MOA stream mining system
dc.typeArticle
dc.subject.lemacMineria de dades
dc.contributor.groupUniversitat Politècnica de Catalunya. LARCA - Laboratori d'Algorísmia Relacional, Complexitat i Aprenentatge
dc.identifier.doi10.3233/AIC-140615
dc.description.peerreviewedPeer Reviewed
dc.relation.publisherversionhttp://content.iospress.com/articles/ai-communications/aic615
dc.rights.accessOpen Access
local.identifier.drac17501291
dc.description.versionPostprint (author's final draft)
local.citation.authorQuadrana, M.; Bifet, A.C.; Gavaldà, R.
local.citation.publicationNameAI communications: the european journal of artificial intelligence
local.citation.volume28
local.citation.number1
local.citation.startingPage143
local.citation.endingPage158


Fitxers d'aquest items

Thumbnail

Aquest ítem apareix a les col·leccions següents

Mostra el registre d'ítem simple