I'm releasing the slides on how we generated 486B tokens of high-quality synthetic data at @HuggingFace
I presented this today at the @UZH_en in the lecture Text Generation with Language Models
Special thanks to @j_vamvas for the invitation!
Btw. the University of Zurich is one of the first universities to join the @HuggingFace academia hub, reach out to learn more!