dc.contributor.author | Ezbakhe, Fatine |
dc.contributor.author | Pérez Foguet, Agustí |
dc.contributor.other | Universitat Politècnica de Catalunya. Doctorat en Enginyeria Civil |
dc.contributor.other | Universitat Politècnica de Catalunya. Departament d'Enginyeria Civil i Ambiental |
dc.date.accessioned | 2019-12-17T14:05:20Z |
dc.date.available | 2019-12-17T14:05:20Z |
dc.date.issued | 2019 |
dc.identifier.citation | Ezbakhe, F.; Pérez-Foguet, A. WASH your data off: navigating statistical uncertainty in compositional data analysis. A: International Workshop on Compositional Data Analysis. "Proceedings of the 8th International Workshop on Compositional Data Analysis (CoDaWork2019): Terrassa, 3-8 June, 2019". 2019, p. 57-62. |
dc.identifier.isbn | 978-84-947240-2-2 |
dc.identifier.uri | http://hdl.handle.net/2117/174036 |
dc.description.abstract | International monitoring of access to drinking water, sanitation and hygiene (WASH) is essential to inform policy planning, implementation and delivery of services. The Joint Monitoring Programme for Water Supply and Sanitation (JMP) is the recognized mechanism for tracking access and progress, and it is based on household surveys and linear regression modelling over time. However, the methods employed have two substantial limitations: they do not address the
compositional nature of the data, nor its statistical uncertainty (Ezbakhe & Pérez-Foguet 2018). While the first issue has been tackled previously in the literature (Pérez-Foguet et al. 2017), the effect of non-uniform sampling errors on the regressions remains ignored. This article aims to address these shortcomings in order to produce a more truthful interpretation of JMP data. The main challenge we try to overcome is how to translate the sampling errors provided in household surveys to the space of compositional data. A Normal distribution is commonly assumed for estimates in household surveys, with a mean and its standard deviation. However, when working with binary data on households - the proportions of households that have access to WASH services - the errors cannot follow normal distributions due to the domain restrictions of proportions, limited to the range 0 to 1. Thus, the Beta distributions seems a better option to characterize the uncertainty around mean access coverage. Yet, as the Beta distribution is defined on the [0,1] interval, the zero values must be dealt with in order to employ the isometric log-ratio (ilr) transformation designed for compositional data. In this article, we investigate the use of two probability distributions (Pearson Type I and Truncated Normal) and Monte
Carlo simulations to reinterpret the error in the JMP data so that compositional data analysis is possible. With a specific focus on the WASH sector, our article shows that the importance of including the survey errors of the data - and its compositional nature - when using this information to support evidence-based policy-making. Indeed, given the current levels of statistical uncertainty in WASH, data may lead to misleading results if errors are not acknowledged (or minimized). |
dc.format.extent | 6 p. |
dc.language.iso | eng |
dc.subject | Àrees temàtiques de la UPC::Matemàtiques i estadística::Estadística matemàtica::Modelització estadística |
dc.subject | Àrees temàtiques de la UPC::Desenvolupament humà i sostenible::Desenvolupament humà::Aigua i sanejament |
dc.subject.lcsh | Sanitary engineering--Developing countries |
dc.subject.lcsh | Mathematical statistics |
dc.subject.other | Demographic Data |
dc.subject.other | Statistical Uncertainty |
dc.subject.other | Compositional Data |
dc.subject.other | Joint Monitor-
ing Programme (JMP) |
dc.subject.other | WASH |
dc.title | WASH your data off: navigating statistical uncertainty in compositional data analysis |
dc.type | Conference report |
dc.subject.lemac | Sanejament -- Països en vies de desenvolupament |
dc.subject.lemac | Estadística matemática |
dc.contributor.group | Universitat Politècnica de Catalunya. EScGD - Engineering Sciences and Global Development |
dc.relation.publisherversion | https://webs.camins.upc.edu/codawork2019/proceedings/book-proceedings-CoDaWork2019-correctedv.pdf |
dc.rights.access | Open Access |
local.identifier.drac | 25979437 |
dc.description.version | Postprint (published version) |
local.citation.author | Ezbakhe, F.; Pérez-Foguet, A. |
local.citation.contributor | International Workshop on Compositional Data Analysis |
local.citation.publicationName | Proceedings of the 8th International Workshop on Compositional Data Analysis (CoDaWork2019): Terrassa, 3-8 June, 2019 |
local.citation.startingPage | 57 |
local.citation.endingPage | 62 |