Adaptively learning probabilistic deterministic automata from data streams

Balle Pigem, Borja de; Castro Rabal, Jorge; Gavaldà Mestre, Ricard

doi:10.1007/s10994-013-5408-x

Visualitza/Obre

Article principal (994,7Kb) (Accés restringit) Sol·licita una còpia a l'autor

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

Balle Pigem, Borja de

Castro Rabal, Jorge

Gavaldà Mestre, Ricard

Tipus de documentArticle

Data publicació2014-07

Condicions d'accésAccés restringit per política de l'editorial

Tots els drets reservats. Aquesta obra està protegida pels drets de propietat intel·lectual i industrial corresponents. Sense perjudici de les exempcions legals existents, queda prohibida la seva reproducció, distribució, comunicació pública o transformació sense l'autorització del titular dels drets

Abstract

Markovian models with hidden state are widely-used formalisms for modeling sequential phenomena. Learnability of these models has been well studied when the sample is given in batch mode, and algorithms with PAC-like learning guarantees exist for specific classes of models such as Probabilistic Deterministic Finite Automata (PDFA). Here we focus on PDFA and give an algorithm for inferring models in this class in the restrictive data stream scenario: Unlike existing methods, our algorithm works incrementally and in one pass, uses memory sublinear in the stream length, and processes input items in amortized constant time. We also present extensions of the algorithm that (1) reduce to a minimum the need for guessing parameters of the target distribution and (2) are able to adapt to changes in the input distribution, relearning new models when needed. We provide rigorous PAC-like bounds for all of the above. Our algorithm makes a key usage of stream sketching techniques for reducing memory and processing time, and is modular in that it can use different tests for state equivalence and for change detection in the stream.

CitacióBalle, B.; Castro, J.; Gavaldà, R. Adaptively learning probabilistic deterministic automata from data streams. "Machine learning", Juliol 2014, vol. 96, núm. 1-2, p. 99-127.

URIhttp://hdl.handle.net/2117/28256

DOI10.1007/s10994-013-5408-x

ISSN0885-6125

Versió de l'editorhttp://link.springer.com/article/10.1007%2Fs10994-013-5408-x

Col·leccions

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
ml2014.pdf	Article principal	994,7Kb	PDF	Accés restringit

UPCommons. Portal del coneixement obert de la UPC

Adaptively learning probabilistic deterministic automata from data streams

Visualitza/Obre

Explora