HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Medical Time-Series Data Generation using Generative Adversarial Networks

Abstract : Medical data is rarely made publicly available due to high deidentification costs and risks. Access to such data is highly regulated due to it's sensitive nature. These factors impede the development of data-driven advancements in the healthcare domain. Synthetic medical data which can maintain the utility of the real data while simultaneously preserving privacy can be an ideal substitute for advancing research. Medical data is longitudinal in nature, with a single patient having multiple temporal events, influenced by static covariates like age, gender, comorbidities, etc. Extending existing time-series generative models to generate medical data can be challenging due to this influence of patient covariates. We propose a workflow wherein we leverage existing generative models to generate such data. We demonstrate this approach by generating synthetic versions of several time-series datasets where static covariates influence the temporal values. We use a state-of-the-art benchmark as a comparative baseline. Our methodology for empirically evaluating synthetic timeseries data shows that the synthetic data generated with our workflow has higher resemblance and utility. We also demonstrate how stratification by covariates is required to gain a deeper understanding of synthetic data quality and underscore the importance of including this analysis in evaluation of synthetic medical data quality.
Document type :
Conference papers
Complete list of metadata

Contributor : Marc Schoenauer Connect in order to contact the contributor
Submitted on : Thursday, March 4, 2021 - 8:13:29 AM
Last modification on : Thursday, February 3, 2022 - 11:15:51 AM
Long-term archiving on: : Saturday, June 5, 2021 - 6:13:44 PM


Files produced by the author(s)


  • HAL Id : hal-03158549, version 1


Saloni Dash, Andrew Yale, Isabelle Guyon, Kristin Bennett. Medical Time-Series Data Generation using Generative Adversarial Networks. AIME 2020 - International Conference on Artificial Intelligence in Medicine, Aug 2020, Minneapolis, United States. pp.382-391. ⟨hal-03158549⟩



Record views


Files downloads