1d ago

AI researchers discuss efficiency advantages of language models over vision models, noting the latter are roughly 1000 times smaller thanks to language's high-density compression via compositional semantics

252.0K48489193.2K

——0——

The size comparison came up in evaluations of how long Chain of Thought reasoning stays intelligible, prompting ideas for visual CoT variants.

Original post

#1480gavin leech (Non-Reasoning)@GLEECH

one of the major failures of my life was being so surprised to find out that vision models were ~1000x smaller than text models. Just total failure to understand language's god-tier data compression

2:00 AM · May 17, 2026

#1220rohit@KRISHNANROHIT

@1a3orn CoT in pictures but not words would be quite neat

1a3orn@1a3orn

this is a relevant consideration for projecting how Lindy intelligible CoT is likely to be

2:31 PM · May 17, 2026 · 4.3K Views

4:15 PM · May 17, 2026 · 257 Views

QUOTE POST

#13801a3orn@1A3ORN

this is a relevant consideration for projecting how Lindy intelligible CoT is likely to be

gavin leech (Non-Reasoning)@gleech

one of the major failures of my life was being so surprised to find out that vision models were ~1000x smaller than text models. Just total failure to understand language's god-tier data compression

9:00 AM · May 17, 2026 · 189.2K Views

2:31 PM · May 17, 2026 · 4.3K Views

QUOTE POST

#1480gavin leech (Non-Reasoning)@GLEECH

one of the major failures of my life was being so surprised to find out that vision models were ~1000x smaller than text models. Just total failure to understand language's god-tier data compression

9:00 AM · May 17, 2026 · 189.2K Views

AI researchers discuss efficiency advantages of language models over vision models, noting the latter are roughly 1000 times smaller thanks to language's high-density compression via compositional semantics

Sentiment

Cluster engagement