Why are open technologies for AI so important? How and why is NVIDIA building Nemotron? What can we learn from China鈥檚 AI efforts?
Great conversation with Matt Turck.
https://youtu.be/Oojrfdl42LI
The interview detailed hybrid Mamba-Transformer architectures and multi-token prediction.
Why are open technologies for AI so important? How and why is NVIDIA building Nemotron? What can we learn from China鈥檚 AI efforts?
Great conversation with Matt Turck.
https://youtu.be/Oojrfdl42LI
Users praise the NVIDIA interview on Nemotron models and open AI strategy as a fascinating conversation and share links to Spotify, Apple Podcasts, and YouTube for easy access.
No Digg Deeper questions have been answered for this story yet.
Inside Nemotron and NVIDIA's AI lab: my conversation with Bryan Catanzaro (@ctnzr).
@nvidia is a chip company. So why does it put hundreds of researchers on building AI models - and then give them away for free? We go deep into the Nemotron models, what it takes to build a top AI lab, and the future of frontier AI.
01:33 - Is open source AI catching the frontier?
05:29 - Do closed labs blocking distillation slow open source down?
07:42 - Is the US falling behind China?
10:30 - Why companies actually choose open models
12:39 - A "crazy" 2008 bet: machine learning on GPUs
15:33 - Working with Andrew Ng and Dario Amodei at Baidu
17:41 - Coming back to NVIDIA: DLSS and the birth of Megatron
21:55 - The real reason NVIDIA builds its own models
24:28 - Is Moore's Law really dead?
33:37 - The Nemotron family: Nano, Super, Ultra
35:09 - Built for agents: why NVIDIA bets on speed
36:02 - How you train a 550B model in 4 bits
39:25 - Hybrid Mamba-Transformer, explained simply
42:31 - Mixture of experts, and why NVIDIA built NVL72 around it
47:26 - Why a 1-million-token context window matters
49:26 - Multi-token prediction: how the model predicts 5 tokens at once
52:47 - Multi-teacher distillation: teaching one model from many
58:01 - Where reinforcement learning goes next
01:00:16 - Inside NVIDIA's research org: "the mission is the boss"
01:04:03 - How NVIDIA decides who gets the GPUs
01:10:53 - Why NVIDIA still feels entrepreneurial after 33 years
01:12:58 - Why Bryan doesn't believe in the singularity
01:17:50 - The AI backlash
01:19:18 - The controversial case: open AI is safer than closed
@mattturck @ctnzr @nvidia Bro has the most pianos in an office I've ever seen
Inside Nemotron and NVIDIA's AI lab: my conversation with Bryan Catanzaro (@ctnzr).
@nvidia is a chip company. So why does it put hundreds of researchers on building AI models - and then give them away for free? We go deep into the Nemotron models, what it takes to build a top AI lab, and the future of frontier AI.
01:33 - Is open source AI catching the frontier?
05:29 - Do closed labs blocking distillation slow open source down?
07:42 - Is the US falling behind China?
10:30 - Why companies actually choose open models
12:39 - A "crazy" 2008 bet: machine learning on GPUs
15:33 - Working with Andrew Ng and Dario Amodei at Baidu
17:41 - Coming back to NVIDIA: DLSS and the birth of Megatron
21:55 - The real reason NVIDIA builds its own models
24:28 - Is Moore's Law really dead?
33:37 - The Nemotron family: Nano, Super, Ultra
35:09 - Built for agents: why NVIDIA bets on speed
36:02 - How you train a 550B model in 4 bits
39:25 - Hybrid Mamba-Transformer, explained simply
42:31 - Mixture of experts, and why NVIDIA built NVL72 around it
47:26 - Why a 1-million-token context window matters
49:26 - Multi-token prediction: how the model predicts 5 tokens at once
52:47 - Multi-teacher distillation: teaching one model from many
58:01 - Where reinforcement learning goes next
01:00:16 - Inside NVIDIA's research org: "the mission is the boss"
01:04:03 - How NVIDIA decides who gets the GPUs
01:10:53 - Why NVIDIA still feels entrepreneurial after 33 years
01:12:58 - Why Bryan doesn't believe in the singularity
01:17:50 - The AI backlash
01:19:18 - The controversial case: open AI is safer than closed

This fascinating conversation with @ctnzr of @NVIDIAAI is also available on Spotify, Apple Podcasts and here on YouTube:
https://youtu.be/Oojrfdl42LI?si=K0XQA_23alPXViy4
@ctnzr @TurnerNovak @nvidia I certainly was inspired! Is this one we chatted about?
https://www.youtube.com/watch?v=naFaMi0gQlY
@TurnerNovak @mattturck @nvidia good for inspiration!

@mattturck @ctnzr @nvidia yeah, free models make sense if they pull more workloads onto NVIDIA infra. did Bryan talk about where Nemotron fits vs open models?

@mattturck @ctnzr @nvidia Models are free. Chips aren't. Everyone building on them needs NVIDIA chips.
That's the play.

@ctnzr @NVIDIAAI Si la IA se vuelve m谩s abierta, 驴la innovaci贸n acelerar谩 m谩s r谩pido o aumentar谩n los problemas de seguridad y control?