/AI1d ago

Meta's Lucas Beyer argues NVIDIA's Nemotron could be the last fully open, from-scratch LLM project if OLMo ends

AI Judge changed title after evaluation, original title: "Meta's Lucas Beyer questions if open-source LLM pretraining is ending, prompting details of Marin's 6x speedup"

Stanford's Marin project remains another fully open training framework

5476725240101.9K
Original post
Lucas Beyer (bl16)@giffmana#55inAI

Do I understand it correctly that the OLMo from-scratch series is coming to an end?

If so, looks like NVIDIA stepped up just in time with Nemotron models as the only remaining fully-open (ie not just weight drop) from-scratch LLM team.

7:24 AM · Jun 7, 2026 · 72.4K Views
Sentiment

Users praise NVIDIA Nemotron and Stanford Marin for fully open-sourcing LLM training while others worry about OLMo vanishing along with its shared training data.

Pos
79.1%
Neg
20.9%
17 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS28.5KBOOKMARKS133LIKES273RETWEETS13REPLIES5
elie@eliebakouch

one of my favorite projects is Marin from the stanford folks, they have a scientific approach to training, are ready to take risks and are fully open (even open development where you can follow everything on github!)

https://github.com/marin-community/marin

Do I understand it correctly that the OLMo from-scratch series is coming to an end?

If so, looks like NVIDIA stepped up just in time with Nemotron models as the only remaining fully-open (ie not just weight drop) from-scratch LLM team.

23hViews 28.5KLikes 273Bookmarks 133

@giffmana Great fully open pretraining efforts going on at Marin!

Percy Liang@percyliang

There are two types of advances: (i) a singular change that provides 3x and (ii) a series of micro changes that each provide 20%. It is easy to celebrate (i), but (ii) is just as important, and the hard part is making sure the improvements stack. We care about both in Marin.

1dViews 2.4KLikes 31Bookmarks 4

@eliebakouch Yeah they are interesting, but my understanding is that they haven't *finished* (as in also mid and post) anything yet?

elie@eliebakouch

one of my favorite projects is Marin from the stanford folks, they have a scientific approach to training, are ready to take risks and are fully open (even open development where you can follow everything on github!)

https://github.com/marin-community/marin

21hViews 2.5KLikes 24Bookmarks 3
Kyle Wiggers@Kyle_L_Wiggers

@giffmana i'd say don't count olmo out just yet :)

1dViews 1.8KLikes 25Bookmarks 3

Two things make me think that:

1) I remember not too long ago some comm's about the new strategy, which sounded much more application/post-train focussed to me (sorry don't remember where, but from ai2)

2) large portion of the team gone

That being said, I'm an OLMo fan for the openness and reasonable quality, so I would be happy if my impression is proven wrong!

1dViews 1.9KLikes 33

@eliebakouch they just need a twitter account, it’s so hard to link/find to their work 🥲

elie@eliebakouch

one of my favorite projects is Marin from the stanford folks, they have a scientific approach to training, are ready to take risks and are fully open (even open development where you can follow everything on github!)

https://github.com/marin-community/marin

19hViews 732Likes 13Bookmarks 1
elie@eliebakouch

@giffmana yes true, quite excited for them to tackle post training the same way they do for pre training!

@eliebakouch Yeah they are interesting, but my understanding is that they haven't *finished* (as in also mid and post) anything yet?

18hViews 585Likes 10Bookmarks 1

Do I understand it correctly that the OLMo from-scratch series is coming to an end?

If so, looks like NVIDIA stepped up just in time with Nemotron models as the only remaining fully-open (ie not just weight drop) from-scratch LLM team.

1dViews 72.4KLikes 438Bookmarks 107
Maziyar PANAHI@MaziyarPanahi

@giffmana I hope not! I really appreciate NVIDIA’s recent efforts across several areas of open-source AI, but I would hate to see OLMo disappear. It has been such a valuable open-source project, built by an incredibly talented group of people.

22hViews 366Likes 6Bookmarks 1
Lazarz@Laz4rz

@giffmana Arcee?

23hViews 591Likes 8
Damien Teney@DamienTeney

@giffmana Apertus?🇨🇭 I heard there will be a v2.

1dViews 605Likes 6
Kyle Wiggers@Kyle_L_Wiggers

@giffmana stay tuned. lots of exciting work underway is all i can say rn, and I think the community will be pleased with the outputs

1dViews 550Likes 10
Yehuda Cohen@FunWithTheCloud

@giffmana Haven't looked into it too deeply but isn't stepfun also fully open?

1dViews 492Likes 1

@giffmana Did you see project Marin?

1dViews 1.3KLikes 8
Lazarz@Laz4rz

@giffmana Apertus?

23hViews 424Likes 2
Kerem Zaman@KeremZaman3

@giffmana K2 from @IFM_MBZUAI

1dViews 270Likes 2

@giffmana They should get Nathan on board

1dViews 911Likes 4
Eric W. Tramel@fujikanaeda

@bygregorr @giffmana Pre and post Data, software, recipes, models (base, post, reward), full technical report

Hard to be more open and Nemotron

1dViews 44Likes 2
Gregor@bygregorr

@giffmana not sure nemotron clears that bar the same way olmo did. olmo dropped the full pretraining dataset too, does nemotron actually publish the training data or just weights + recipes?

1dViews 137Likes 1
Load more posts