Statistical validation of synthetic data for lung cancer patients generated by using generative adversarial networks

González Abril, Luis; Angulo Bahón, Cecilio; Antonio Ortega, Juan; López Guerra, José Luis

doi:10.3390/electronics11203277

Visualitza/Obre

electronics-11-03277.pdf (867,9Kb)

Veure estadístiques d'ús d'UPCommons

Estadístiques de LA Referencia / Recolecta

Cita com:

Mostra el registre d'ítem complet

González Abril, Luis

Angulo Bahón, Cecilio

Antonio Ortega, Juan

López Guerra, José Luis

Tipus de documentArticle

Data publicació2022-10-01

Condicions d'accésAccés obert

Llevat que s'hi indiqui el contrari, els continguts d'aquesta obra estan subjectes a la llicència de Creative Commons : Reconeixement 4.0 Internacional

Abstract

The development of healthcare patient digital twins in combination with machine learning technologies helps doctors in therapeutic prescription and in minimally invasive intervention procedures. The confidentiality of medical records or limited data availability in many health domains are drawbacks that can be overcome with the generation of synthetic data conformed to real data. The use of generative adversarial networks (GAN) for the generation of synthetic data of lung cancer patients has been previously introduced as a tool to solve this problem in the form of anonymized synthetic patients. However, generated synthetic data are mainly validated from the machine learning domain (loss functions) or expert domain (oncologists). In this paper, we propose statistical decision making as a validation tool: Is the model good enough to be used? Does the model pass rigorous hypothesis testing criteria? We show for the case at hand how loss functions and hypothesis validation are not always well aligned.

CitacióGonzález Abril, L. [et al.]. Statistical validation of synthetic data for lung cancer patients generated by using generative adversarial networks. "Electronics (Switzerland)", 1 Octubre 2022, vol. 11, núm. 20, article 3277, p. 1-15.

URIhttp://hdl.handle.net/2117/382001

DOI10.3390/electronics11203277

ISSN2079-9292

Versió de l'editorhttps://www.mdpi.com/2079-9292/11/20/3277

Col·leccions

Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial - Articles de revista [1.391]

Veure estadístiques d'ús d'UPCommons

Mostra el registre d'ítem complet

Fitxers	Descripció	Mida	Format	Visualitza
electronics-11-03277.pdf		867,9Kb	PDF	Visualitza/Obre

UPCommons. Portal del coneixement obert de la UPC

Statistical validation of synthetic data for lung cancer patients generated by using generative adversarial networks

Visualitza/Obre

Explora