Skip to content

mostly-ai/mostly-tutorials

Repository files navigation

Synthetic Data Tutorials - by MOSTLY AI

Welcome! We're excited to share our repository of tutorials with you, which will help you explore and validate the benefits of synthetic data. Simply clone the repository to your own environment and run it locally via Jupyter Lab, or make it even easier and run each tutorial directly on Google's cloud resources via Colab. Let's get started!

  • Validate synthetic data via Train-Synthetic-Test-Real [run on Colab]
  • Explore the size vs. accuracy trade-off for synthetic data [run on Colab]
  • Rebalance synthetic datasets for data augmentation [run on Colab]
  • Conditionally generate synthetic (geo) data [run on Colab]
  • Explain AI with synthetic data [run on Colab]
  • Generate synthetic text [run on Colab]
  • Perform multi-table synthesis [run on Colab]
  • Develop a fake or real discriminator with Synthetic Data [run on Colab]
  • Close gaps in your data with Smart Imputation [run on Colab]
  • Calculate accuracy and privacy metrics for Quality Assurance [run on Colab]