This isn't private information, but considering the first Vera Rubin systems are rolling out and many people are still using Hoppers (at record prices) It's crazy to think how insane of an upgrade moving from Hopper -> Vera Rubin is.
48x memory bandwidth?! 31x NVlink bandwidth?! 3.5x more memory
At face value, this means it's actually faster to transfer and read memory from another GPU inside a node of Rubin NVL8 than it is to just read the memory inside an H100
Shit is going to be actually wild in the next couple of years. The cost of training monster models is about to go way down, and the cost of tokens is going to go way down as well.