Meta FAIR's François Fleuret jokes about new model architectures failing against vanilla transformers once normalized for FLOPs and memory · Digg