15h agoQwen Team releases Qwen-VLA, a vision-language-action model achieving a 97.9% success rate on the LIBERO robotic benchmarkThe model uses a Diffusion Transformer-based action decoder