Hugues Annoye
Cédric Heuchenne
(2025).
Generating survey databases with Wasserstein Generative Adversarial Networks.
Appl Intell 55, 1095 (2025).
In a world increasingly surrounded by data, data privacy and anonymization are becoming more and more important. Under these circumstances, the need for synthetic databases that replicate the characteristics of the population while preserving privacy is arising. In this article, we investigate how we can use Wasserstein Generative Adversarial Networks (WGANs), developed by Arjovsky et al. [1] in the context of image generation, to create synthetic survey databases. Survey data have both categorical and continuous variables and especially contain sampling weights that will be introduced in the proposed procedures. We evaluate the quality of the (generated) synthetic data through different indicators and especially a new practical and intuitive measure based on Support Vector Data Description (SVDD). All our analyses are achieved with the Labour Force Survey (LFS) data for Belgium.
Field : scientific publication