Over the last ~1 year, we've thinking about how to make privacy-preserving synthetic data useful for LLM training.
@_peihanliu's intern project @GoogleResearch takes a step back to measure usefulness.
ContinuousBench is a new benchmark for differentially private synthetic data. We show current methods cannot transfer knowledge effectively, even at 蔚=100.
1/n
馃УDoes DP synth text transfer useful knowledge or just superficial style mimicking?馃
Existing benchmarks: saturated馃槙
Introducing ContinuousBench: a hard (curr methods fail at 蔚=100! 馃く) & leakage-proof benchmark for DP synth text!
Followup to our #ICML2024 best paper馃憖 1/n
