Data wrangling, computational burden, automation, robustness and accuracy in ecological inference forecasting of R×C tables

Cita com:
hdl:2117/397846
Document typeArticle
Defense date2023-06-12
PublisherInstitut d'Estadística de Catalunya
Rights accessOpen Access
This work is protected by the corresponding intellectual and industrial property rights.
Except where otherwise noted, its contents are licensed under a Creative Commons license
:
Attribution-NonCommercial-NoDerivs 3.0 Spain
Abstract
This paper assesses the two current major alternatives for ecological inference, based on a multinomial-Dirichlet Bayesian model and on mathematical programming. Their performance is evaluated in a database made up of almost 2000 real datasets for which the actual cross-distributions are known. The analysis reveals both approaches as complementarity, each one of them performing better in a different area of the simplex space, although with Bayesian solutions deteriorating when the amount of information is scarce. After offering some guidelines regarding the appropriate contexts for employing each one of the algorithms, we conclude with some ideas for exploiting their complementarities.
CitationPavía, J.M.; Romero, R. Data wrangling, computational burden, automation, robustness and accuracy in ecological inference forecasting of R×C tables. "SORT", 12 Juny 2023, vol. 47, p. 151-186.
ISSN1696-2281
Files | Description | Size | Format | View |
---|---|---|---|---|
47.1.3.Pavia-Romero.pdf | 1,247Mb | View/Open | ||
47.1.3.Pavia-Romero.zip | 1,969Mb | application/zip | View/Open |