5h agoNVIDIA details Nemotron 3 Ultra 550B pre-training using NVFP4, LatentMoE, and 20 trillion tokensNVFP4 training achieved a 0.4% relative loss gap vs BF16.