A concise explanation of synthetic data generation methods, such as copulas and GANs, within AGENT.
Synthetic Data
Overview
Explore synthetic data use in AGENT with this tutorial, where we guide you through generating and using synthetic datasets for secure data analysis. Synthetic data is a pivotal resource for secure data analysis when original data cannot be used directly. This video walks you through generating synthetic data using different methods, including marginal distributions and neural network-based approaches like GANs.
Included in this video
1
2
Step-by-step instructions on importing datasets into AGENT and converting them into private data frames.
3
A demonstration of using the Smart Noise Synthetic Data Toolbox for creating differentially private synthetic datasets.
4
Comparison of summary statistics and correlation matrices between original and synthetic datasets to assess data fidelity.