Frontier AI labs are reportedly spending $10 billion to $15 billion each on training data acquisition
Complex, high-quality task datasets can cost up to $500,000.
——0——
REPLY
#167Emad@EMOSTAQUE
@xlr8harder It is not as useful or valuable as one may think especially as it gets more and more specialised
It is just “easy” to spend money on this vs say GPUs
So, how does open source match this?
7:44 AM · May 30, 2026 · 9.4K Views
1:43 PM · May 30, 2026 · 848 Views
@EMostaque Maybe, but open datasets are still quite far behind.
@xlr8harder It is not as useful or valuable as one may think especially as it gets more and more specialised It is just “easy” to spend money on this vs say GPUs
1:43 PM · May 30, 2026 · 848 Views
1:45 PM · May 30, 2026 · 349 Views