Huawei has finally credibly (?) pretrained a big LLM on Ascends. "hyper-node optimized training" suggests 950s I guess. Builds on DSA ("with SWA"). They want to prove it can be done on their hardware. What is ModAttn? (pics from Reddit, some translations are off)
openPangu 2.0 fully upgraded #HDC2026 #HDC #HarmonyOS7 #HarmonyOS #Pangu #openPangu2 #Upgrade

