/Tech5h ago

Tinkerapi Seeks Post-Training Hackers To Advance Model Fine-Tuning

144333016181.3K

Original post unavailable.

Sentiment

Many users expressed excitement about John Schulman hiring post-training hackers for Tinkerapi because they view the effort to advance model fine-tuning as promising and want to contribute.

Pos

100.0%

Neg

0.0%

5 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS959BOOKMARKS1

Andrew Carr 🤸@andrew_n_carr

@johnschulman2 Very exciting!! Congrats on the success so far

1d95911

LIKES1

Amey Muke@7613Perfect

@johnschulman2 dmed :))

1d2471

Daniel Lobaton@dlobaton2

@johnschulman2 lets go!!!

1d5391

AI Systems DevOps@solutions

@johnschulman2 @miramurati Aye

1d535

Vensen@vensen6521

Inspired by Horace He's @thinkymachines post on LLM Nondeterminism, we are building RL-Kernel for scalable post-training.

Single-GPU batch-invariant ops are nearly locked. Now tackling the real beast: cross-TP determinism (vLLM TP>1 vs FSDP TP=1) & the memory wall.

Would love your harsh feedback on our roadmap! https://github.com/RL-Align/RL-Kernel/issues/83

20h293

Ali Ansari@aliansarinik

@johnschulman2 @miramurati 👀

20h292

interstellar travel.@TravelDeepspace

@johnschulman2 @miramurati Hackers the word is so hyper and underrated!

1d208

Sebastian Buzdugan@sebuzdugan

@johnschulman2 will tinker make bad fine-tune runs inspectable since most teams fail there

22h200

Sahil Raut@sahilraut050

@johnschulman2 I have some good red teaming experience on frontier models, Dmed!

14h95

Chen Qian@ChenMoneyQ

@johnschulman2 Tinker is so much underrated IMO, would love to help make it better😄 (it's a bit too expensive for part-time researcher without grant tho....

8h54