The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.
synthpop excels at disclosure-controlled
individual-level microdata but lacks joint distribution modeling via
copulas. rsdv uses a Gaussian copula to preserve
inter-column correlations.
library(rsdv)
set.seed(42)
meta <- metadata(adult_income) |>
set_column_type("age", "numerical") |>
set_column_type("occupation", "categorical") |>
set_column_type("income", "categorical")
syn <- gaussian_copula_synthesizer(meta)
syn <- fit(syn, adult_income)
synthetic_data <- sample(syn, n = nrow(adult_income))| Feature | synthpop | rsdv |
|---|---|---|
| Correlation modeling | CART-based sequential | Gaussian copula over all column types |
| Column constraints | Limited | Equality, inequality, fixed combos, custom |
| Conditional sampling | Via predictor order | sample_conditions() on categorical values |
| Quality metrics | Built-in utility measures | KS, TVD, correlation & contingency similarity, ML efficacy |
| Diagnostics | None | Validity report (ranges, categories, key uniqueness) |
| Privacy metrics | None | NNDR, attribute disclosure risk |
| Python interop | No | API-compatible with SDV |
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.