Mostra el registre d'ítem simple
A theoretical and empirical study of affirmative sampling
dc.contributor | Martínez Parra, Conrado |
dc.contributor.author | Montes Sanabria, Jordi |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament de Ciències de la Computació |
dc.date.accessioned | 2020-07-30T15:52:47Z |
dc.date.issued | 2020-07 |
dc.identifier.uri | http://hdl.handle.net/2117/328097 |
dc.description.abstract | Distinct random sampling is widely used in many applications due to its ability to answer aggregate queries on data with provable probabilistic guarantees on the quality of the answer. We consider the data stream model where a single pass over the data is allowed and only a small number of elements in the stream can be stored. We analyze Affirmative Sampling, an algorithm for distinct random sampling with adaptive sample size. We discuss an unbiased cardinality estimator of the stream using the elements in the random sample from Affirmative sampling and provide bounds for its accuracy. Random samples can be used for much more than just cardinality estimation. We introduce Nooh, a fast genome distance estimation tool that uses Affirmative sampling to compete with the current state-of-the-art tools in bioinformatics. |
dc.language.iso | eng |
dc.publisher | Universitat Politècnica de Catalunya |
dc.subject | Àrees temàtiques de la UPC::Matemàtiques i estadística |
dc.subject.lcsh | Algorithms |
dc.subject.other | Random sampling |
dc.subject.other | Distinct sampling |
dc.subject.other | Cardinality estimation |
dc.subject.other | Affirmative sampling |
dc.title | A theoretical and empirical study of affirmative sampling |
dc.type | Master thesis |
dc.subject.lemac | Algorismes |
dc.subject.ams | Classificació AMS::68 Computer science::68W Algorithms |
dc.identifier.slug | FME-2044 |
dc.rights.access | Restricted access - author's decision |
dc.date.lift | 10000-01-01 |
dc.date.updated | 2020-07-17T09:25:03Z |
dc.audience.educationlevel | Màster |
dc.audience.mediator | Universitat Politècnica de Catalunya. Facultat de Matemàtiques i Estadística |
dc.audience.degree | MÀSTER UNIVERSITARI EN MATEMÀTICA AVANÇADA I ENGINYERIA MATEMÀTICA (Pla 2010) |