Greg Kamradt@GregKamradt
Do n-gram models also require an understanding of the underlying reality?
If not, which architecture level of next-word predictors does?
If so, then imo the “understanding” is inherit in the data, not the transformation of it
6:26 AM · Jun 1, 2025 · 6.2K Views