Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
- 
            Updated
            Aug 30, 2025 
- Go
Open Source Data Security Platform for Developers to Monitor and Detect PII, Anonymize Production Data and Sync it across environments.
Synthetic data generation for tabular data
Conditional GAN for generating synthetic tabular data.
Synthetic Data SDK ✨
A library to model multivariate data using copulas.
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
Unity's privacy-preserving human-centric synthetic data generator
[IMC 2020 (Best Paper Finalist)] Using GANs for Sharing Networked Time Series Data: Challenges, Initial Promise, and Open Questions
INGenious Playwright Studio
Synthetic Data Generation for mixed-type, multivariate time series.
(SIGCOMM '22) Practical GAN-based Synthetic IP Header Trace Generation using NetShare
Flow Matching implemented in PyTorch
Synthetic Data Engine 💎
[TMLR] GraphMaker: Can Diffusion Models Generate Large Attributed Graphs?
NeurIPS 2025: Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs
This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.
A toolset to test data classification engines that generates mock data in various file formats, sizes and data profiles.
[ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models".
Unity's Privacy-Preserving Novel Human Body Model Trained Solely on Synthetic Data and Corresponding Dense Anthropometric Measurements
Codebase for "Generating multivariate time series with COmmon Source CoordInated GAN (COSCI-GAN)"
Add a description, image, and links to the synthetic-data-generation topic page so that developers can more easily learn about it.
To associate your repository with the synthetic-data-generation topic, visit your repo's landing page and select "manage topics."