Now showing items 1-5 of 5

  • Adaptive sampling methods for scaling up knowledge discovery algorithms 

    Domingo Soriano, Carlos; Gavaldà Mestre, Ricard; Watanabe, Osamu (2001-07)
    External research report
    Open Access
    One of the biggest research challenges in KDD and Data Mining is to develop methods that scale up well to large amounts of data. A possible approach for achieving scalability is to take a random sample and do data mining ...
  • Algorithms for learning finite automata from queries: a unified view 

    Balcázar Navarro, José Luis; Díaz Cort, Josep; Gavaldà Mestre, Ricard; Watanabe, Osamu (1996-09)
    External research report
    Open Access
    In this survey we compare several known variants of the algorithm for learning deterministic finite automata via membership and equivalence queries. We believe that our presentation makes it easier to understand what ...
  • Coding complexity: the computational complexity of succinct descriptions 

    Balcázar Navarro, José Luis; Gavaldà Mestre, Ricard; Watanabe, Osamu (1996-09)
    External research report
    Open Access
    For a given set of strings, the problem of obtaining a succinct description becomes an important subject of research, related to several areas of theoretical computer science. In structural complexity theory, researchers ...
  • On-line sampling methods for discovering association rules 

    Domingo Soriano, Carlos; Gavaldà Mestre, Ricard; Watanabe, Osamu (1999-02)
    External research report
    Open Access
    Association rule discovery is one of the prototypical problems in data mining. In this problem, the input database is assumed to be very large and most of the algorithms are designed to minimize the number of scans of ...
  • Sequential sampling algorithms: unified analysis and lower bounds 

    Gavaldà Mestre, Ricard; Watanabe, Osamu (2001-11)
    External research report
    Open Access
    Sequential sampling algorithms have recently attracted interest as a way to design scalable algorithms for Data mining and KDD processes. In this paper, we identify an elementary sequential sampling task (estimation from ...