• Skip to main content
  • Skip to header right navigation
  • Skip to site footer
CCAIM

CCAIM

Novel AI to transform healthcare

  • Home
  • About
    • Our Aims
    • ISAB Report
  • People
    • Leadership
    • Faculty
    • Associate Faculty
    • Joint Steering Committee
    • Independent Scientific Advisory Board
    • Staff
    • Our students
    • Affiliated clinicians
    • Visitors
  • Research
    • Papers
    • Breakthroughs
    • Software
      • AutoPrognosis
      • HyperImpute
      • Interpretability Suite
      • Synthcity
      • TemporAI
    • Demonstrators
    • Research Update: COVID-19
    • Blog
  • News
    • Latest News
    • COVID-19 News
  • Events
    • Seminar Series
    • WeCREATE
    • Inaugural Event
    • AI Clinic 2023
    • AI Clinic 2022
  • Summer School
    • Summer School 2023
      • Participate
      • Program
      • Speakers
      • Exhibition
      • FAQ
    • Summer School 2022
  • Get involved
    • PhD Programmes
    • Clinical PhD Position
    • Partners
    • Connect

Synthcity

24 April 2023 by Andreas Bedorf

Synthcity is an open-source synthetic data generation library that outperforms rivals (YData, Gretel, SDV, etc.) in terms of compatible use cases and data modalities, offering solutions for privacy, data scarcity, and fairness across various data types.

Functional pipeline of Synthcity

How is it unique?

Synthcity gathers state-of-the-art generative models into one user-friendly platform, supporting a wide range of data modalities, such as tabular, time series, censored datasets, and images. It combines cutting-edge techniques from Generative Adversarial Networks (GANs), Variational Auto-Encoders (VAEs), Normalizing Flows, Graphical Neural Networks (GNNs), and Diffusion Models. It is our biggest open-source project to date.

How is it useful?

Synthcity can, among other use cases:

1. Address data privacy concerns by generating synthetic datasets that preserve the original data’s patterns while protecting sensitive information.

2. Combat data scarcity by generating realistic, high-quality synthetic data to improve model training, validation, and performance.

3. Ensure fairness in ML models by generating balanced datasets that mitigate biases, leading to more equitable treatment and drug development outcomes.

4. Facilitate rapid experimentation, prototyping, and benchmarking with a comprehensive suite of evaluation metrics, such as inverse KL divergence, Jensen-Shannon distance, survival KM distance, and many more.

Synthcity‘s versatile models and evaluation metrics make it an invaluable tool for the research community and industry alike, facilitating innovation, safeguarding of privacy, and ensuring of fairness in many data-driven initiatives. We believe that by leveraging Synthcity‘s capabilities, the impact of data science and AI in healthcare can be greatly sped up and enhanced.

GitHub
PyPI
White Paper
Documentation
Tutorials
Category: Impact, News, Research, Software
Previous Post:Interpretability Suite
Next Post:HyperImpute

Navigation

Home

News

About

University of Cambridge

  • University A-Z
  • Contact the University
  • Accessibility
  • Data Protection
  • Terms and conditions

Newsletter

Sign-up for updates on our research.

Follow us

  • Twitter
  • LinkedIn
  • YouTube

Copyright © 2023 CCAIM

Return to top

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

CCAIM
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

If you disable this cookie, we will not be able to save your preferences. This means that every time you visit this website you will need to enable or disable cookies again.

3rd Party Cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.

Please enable Strictly Necessary Cookies first so that we can save your preferences!