11h ago

Frontier AI labs are reportedly spending $10 billion to $15 billion each on training data acquisition

Complex, high-quality task datasets can cost up to $500,000.

0
Original post

@xlr8harder It is not as useful or valuable as one may think especially as it gets more and more specialised

It is just “easy” to spend money on this vs say GPUs

xlr8harderxlr8harder@xlr8harder

So, how does open source match this?

7:44 AM · May 30, 2026 · 9.4K Views
1:43 PM · May 30, 2026 · 848 Views

@EMostaque Maybe, but open datasets are still quite far behind.

EmadEmad@EMostaque

@xlr8harder It is not as useful or valuable as one may think especially as it gets more and more specialised It is just “easy” to spend money on this vs say GPUs

1:43 PM · May 30, 2026 · 848 Views
1:45 PM · May 30, 2026 · 349 Views