You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The TabDDPM paper evaluates a wide set of benchmarks extensively. It demonstrates its superiority over existing SDV GAN/VAE alternatives, which is consistent with the advantage of diffusion models in other fields. Additionally, it shows that TabDDPM is eligible for privacy-oriented setups, where the original data points cannot be publicly shared.
The text was updated successfully, but these errors were encountered:
Hi @celsofranssa, nice to meet you and thank you for your request. It is always great to see that our usage and documentation is working well for users :)
One of the reasons it has worked well is because have developed a framework that all our synthesizers must follow. This includes distinct step for data-preprocessing, handling logical constraints, sampling (including conditional sampling), etc. (More on this in our blog post.) As such, it is not always trivial for us to support externally-developed synthesizers, particularly if they differ from our expected framework in some way.
We can certainly keep this issue open as we decide when/how to prioritize.
It demonstrates its superiority over existing SDV GAN/VAE alternatives, which is consistent with the advantage of diffusion models in other fields.
It would be interesting to look into our SDGym library, which provides an easy way to incorporate a custom synthesizer for the purposes of benchmarking. I understand the original paper provided some results too. Our SDGym library is designed to provide a comparison that standardizes the datasets and metrics across all synthesizers.
Problem Description
TabDDPM: Modelling Tabular Data with Diffusion Models
Expected behavior
Same as the other synthesizer
Additional context
The TabDDPM paper evaluates a wide set of benchmarks extensively. It demonstrates its superiority over existing SDV GAN/VAE alternatives, which is consistent with the advantage of diffusion models in other fields. Additionally, it shows that TabDDPM is eligible for privacy-oriented setups, where the original data points cannot be publicly shared.
The text was updated successfully, but these errors were encountered: