Elie Bakouch says Mistral AI co-founder Guillaume Lample's FP8 training run matches the learning rate schedule of DeepSeek-V2 · Digg