Meta FAIR researchers release paper on AIRA-Compose and AIRA-Design, a two-agent system that autonomously discovers neural architectures beyond Transformers and surpasses Llama 3.2 at 350M, 1B, and 3B scales
Models posted May 15 show lower validation loss on MAD, BabiStories, and DCLM.
——0——
How can we help AI scientists train up their own LLM engine? I’m pleased to share our work on AI Research Agents discovering novel language modeling architectures, showing competitive performance when scaled up at the 1B parameter size: https://arxiv.org/abs/2605.15871

10:08 AM · May 19, 2026 · 1.2K Views