2h ago

AXPO Method Improves Multimodal Agent Reasoning in Vision-Language Models

0
Original post

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

8:29 AM · May 28, 2026 View on X

paper: https://huggingface.co/papers/2605.28774

AKAK@_akhaliq

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

3:29 PM · May 28, 2026 · 3.6K Views
3:29 PM · May 28, 2026 · 2.9K Views