Memory might be the most important outstanding problem for modeling + learning alone; there are other key issues like tactile/multimodal but those require hardware and data collection innovation. We should be able to solve memory *now.*
Cool to see a benchmark targeting it!
🎉 We released MIKASA-Robo-VLA v1.0.0 — a benchmark suite for studying memory in Vision-Language-Action (VLA) policies for tabletop robotic manipulation.
https://mikasarobo.github.io/
🧠 The goal is simple: make memory evaluation in robotic manipulation more systematic. 👇


