Interpretation for Variational Autoencoder Used to Generate Financial Synthetic Tabular Data - Publication

Synthetic data, artificially generated by computer programs, has become more widely used in the financial domain to mitigate privacy concerns. Variational Autoencoder (VAE) is one of the most popular deep-learning models for generating synthetic data. However, VAE is often considered a “black box” due to its opaqueness. Although some studies have been conducted to provide explanatory insights into VAE, research focusing on explaining how the input data could influence VAE to create synthetic data, especially for tabular data, is still lacking. However, in the financial industry, most data are stored in a tabular format. This paper proposes a sensitivity-based method to assess the impact of inputted tabular data on how VAE synthesizes data. This sensitivity-based method can provide both global and local interpretations efficiently and intuitively. To test this method, a simulated dataset and three Kaggle banking tabular datasets were employed. The results confirmed the applicability of this proposed method

Related Research

Detecting Mule Account Fraud with Federated Learning

Detecting Mule Account Fraud with Federated Learning

Responsible AI

Research
ATOM: Attention Mixer for Efficient Dataset Distillation

ATOM: Attention Mixer for Efficient Dataset Distillation

*S. Khaki, *A. Sajedi, K. Wang, L. Z. Liu, Y. A. Lawryshyn, and K. N. Plataniotis. Oral presentation at The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR)

Responsible AI

Publications
DataDAM: Efficient Dataset Distillation with Attention Matching

DataDAM: Efficient Dataset Distillation with Attention Matching

*A. Sajedi, *S. Khaki, E. Amjadian, L. Z. Liu, Y. A. Lawryshyn, and K. N. Plataniotis. International Conference in Computer Vision (ICCV)

Responsible AI

Publications

Cookies Settings

Related Research