8h ago

Josef Chen releases EPICURE, a 2MB factor decomposition model mapping 1,790 ingredients from 4.1 million recipes

It identified five emergent culinary clusters without prior labels.

0
Original post

Launching our new paper on arXiv: we trained the largest multilingual food model ever built. 4.1M recipes. 7 languages. 1,790 ingredients. 300 dimensions. All of human cooking compressed into 2 megabytes.

12:08 PM · May 26, 2026 View on X

Tell me why does the umap projection of food in neural nets look like a real map 😭

Josef ChenJosef Chen@josefchen

Launching our new paper on arXiv: we trained the largest multilingual food model ever built. 4.1M recipes. 7 languages. 1,790 ingredients. 300 dimensions. All of human cooking compressed into 2 megabytes.

7:08 PM · May 26, 2026 · 146.9K Views
2:46 AM · May 27, 2026 · 354 Views

Me: Honey, I've got bad news and good news. Wife: ...what. Me: Bad news - I spent $1M of our savings on compute. Wife: AND THE GOOD NEWS?? Me: I found the vector between Chinese and Ethiopian cuisine Wife: Me: What? ethiopian - chinese is a surprisingly interpretable direction!!!

Josef ChenJosef Chen@josefchen

Launching our new paper on arXiv: we trained the largest multilingual food model ever built. 4.1M recipes. 7 languages. 1,790 ingredients. 300 dimensions. All of human cooking compressed into 2 megabytes.

7:08 PM · May 26, 2026 · 146.9K Views
1:18 AM · May 27, 2026 · 2.8K Views
Josef Chen releases EPICURE, a 2MB factor decomposition model mapping 1,790 ingredients from 4.1 million recipes · Digg