Gavin Leech, UK-based AI researcher and co-founder of the consultancy Arb, notes vision models are roughly 1000 times smaller than text models owing to language's data compression through compositional semantics and abstractions · Digg