They seem to be focused on coding rn I'm pretty sure this is the largest coding training data generation effort in the world rn They have the compute
How many hundred billions will $META end up raising?
The reassigned engineers now review AI-generated code repositories.
They seem to be focused on coding rn I'm pretty sure this is the largest coding training data generation effort in the world rn They have the compute
How many hundred billions will $META end up raising?
Many users reacted positively to Meta reassigning engineers to AI data labeling and RLHF roles because the effort could create a hard-to-replicate infrastructure moat through scaled data generation.
No Digg Deeper questions have been answered for this story yet.
what in the scale ai is happening here?
I thought this was a joke. Meta now has made 30-50% of software engineers on core teams become data labelers.
Their job is "giving human feedback on AI-generated Github repos" in an org called Agent Data Optimization.
Maybe we are all training data generators after all.

@zephyr_z9 Meta Hyperion soon too
https://epoch.ai/data/data-centers?view=graph&tab=power&mode=top-1

@zephyr_z9 If we are on the brink of recursive improvements, you have to focus on coding right now.

@zephyr_z9 My strategy plan. ....
🔻↩️↩️

@zephyr_z9 I wish we would see Bytedance putting in the effort. They have more compute than Meta.

@WinterCawfie @zephyr_z9 pretty cool

@WinterCawfie @zephyr_z9 2GW is fucking crazy

@zephyr_z9

@zephyr_z9 Humans give unwanted feedback! Meta's best engineers now give feedback on AI code. Largest coding RLHF loop yet.

@zephyr_z9 scaling coding data generation at this level
creates an infrastructure moat others can't easily replicate