NVIDIA's Bryan Catanzaro explains why the hardware giant builds and open-sources Nemotron AI models

VIEWS6.5KBOOKMARKS32LIKES30RETWEETS6REPLIES6

Inside Nemotron and NVIDIA's AI lab: my conversation with Bryan Catanzaro (@ctnzr).

@nvidia is a chip company. So why does it put hundreds of researchers on building AI models - and then give them away for free? We go deep into the Nemotron models, what it takes to build a top AI lab, and the future of frontier AI.

01:33 - Is open source AI catching the frontier?

05:29 - Do closed labs blocking distillation slow open source down?

07:42 - Is the US falling behind China?

10:30 - Why companies actually choose open models

12:39 - A "crazy" 2008 bet: machine learning on GPUs

15:33 - Working with Andrew Ng and Dario Amodei at Baidu

17:41 - Coming back to NVIDIA: DLSS and the birth of Megatron

21:55 - The real reason NVIDIA builds its own models

24:28 - Is Moore's Law really dead?

33:37 - The Nemotron family: Nano, Super, Ultra

35:09 - Built for agents: why NVIDIA bets on speed

36:02 - How you train a 550B model in 4 bits

39:25 - Hybrid Mamba-Transformer, explained simply

42:31 - Mixture of experts, and why NVIDIA built NVL72 around it

47:26 - Why a 1-million-token context window matters

49:26 - Multi-token prediction: how the model predicts 5 tokens at once

52:47 - Multi-teacher distillation: teaching one model from many

58:01 - Where reinforcement learning goes next

01:00:16 - Inside NVIDIA's research org: "the mission is the boss"

01:04:03 - How NVIDIA decides who gets the GPUs

01:10:53 - Why NVIDIA still feels entrepreneurial after 33 years

01:12:58 - Why Bryan doesn't believe in the singularity

01:17:50 - The AI backlash

01:19:18 - The controversial case: open AI is safer than closed

2h6.5K3032

Turner Novak 🍌🧢@TurnerNovak

@mattturck @ctnzr @nvidia Bro has the most pianos in an office I've ever seen

Matt Turck@mattturck

Inside Nemotron and NVIDIA's AI lab: my conversation with Bryan Catanzaro (@ctnzr).

@nvidia is a chip company. So why does it put hundreds of researchers on building AI models - and then give them away for free? We go deep into the Nemotron models, what it takes to build a top AI lab, and the future of frontier AI.

01:33 - Is open source AI catching the frontier?

05:29 - Do closed labs blocking distillation slow open source down?

07:42 - Is the US falling behind China?

10:30 - Why companies actually choose open models

12:39 - A "crazy" 2008 bet: machine learning on GPUs

15:33 - Working with Andrew Ng and Dario Amodei at Baidu

17:41 - Coming back to NVIDIA: DLSS and the birth of Megatron

21:55 - The real reason NVIDIA builds its own models

24:28 - Is Moore's Law really dead?

33:37 - The Nemotron family: Nano, Super, Ultra

35:09 - Built for agents: why NVIDIA bets on speed

36:02 - How you train a 550B model in 4 bits

39:25 - Hybrid Mamba-Transformer, explained simply

42:31 - Mixture of experts, and why NVIDIA built NVL72 around it

47:26 - Why a 1-million-token context window matters

49:26 - Multi-token prediction: how the model predicts 5 tokens at once

52:47 - Multi-teacher distillation: teaching one model from many

58:01 - Where reinforcement learning goes next

01:00:16 - Inside NVIDIA's research org: "the mission is the boss"

01:04:03 - How NVIDIA decides who gets the GPUs

01:10:53 - Why NVIDIA still feels entrepreneurial after 33 years

01:12:58 - Why Bryan doesn't believe in the singularity

01:17:50 - The AI backlash

01:19:18 - The controversial case: open AI is safer than closed

1h15820

Matt Turck@mattturck

This fascinating conversation with @ctnzr of @NVIDIAAI is also available on Spotify, Apple Podcasts and here on YouTube:

https://youtu.be/Oojrfdl42LI?si=K0XQA_23alPXViy4

2h6124

Matt Turck@mattturck

@ctnzr @TurnerNovak @nvidia I certainly was inspired! Is this one we chatted about?

https://www.youtube.com/watch?v=naFaMi0gQlY

Bryan Catanzaro@ctnzr

@TurnerNovak @mattturck @nvidia good for inspiration!

1h3220

Mikhail Rogov@i_mika_el

@mattturck @ctnzr @nvidia yeah, free models make sense if they pull more workloads onto NVIDIA infra. did Bryan talk about where Nemotron fits vs open models?

2h181

Ferbin@Ferbin08

@mattturck @ctnzr @nvidia Models are free. Chips aren't. Everyone building on them needs NVIDIA chips.

That's the play.

2h16

Radar IA@SoyRadarIA

@ctnzr @NVIDIAAI Si la IA se vuelve más abierta, ¿la innovación acelerará más rápido o aumentarán los problemas de seguridad y control?

3h6