/AI1h ago

AI commentator @scaling01 finds Qwen 3.7 Plus beats GPT-5.4 and Claude-Opus-4.6 on the LisanBench reasoning suite

The model doubled Qwen 3.6 Plus's CritPt metric score.

--0--
Original post
Lisan al Gaib@scaling01#980inAI

Qwen 3.7 Plus Benchmark

10:32 AM · Jun 1, 2026 · 8.5K Views
Sentiment
Sentiment unavailable for this story.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
VIEWS49.9KBOOKMARKS181LIKES1.2KRETWEETS121REPLIES84
Qwen@Alibaba_Qwen

👏👏 Introducing Qwen3.7-Plus — a multimodal agent model that unifies vision and language into one versatile agent foundation.

✅ Multimodal interactive hybrid agent: unified GUI & CLI operation across visual and text tasks ✅ Versatile coding agent & productivity assistant with full-modality input ✅ Visual Agent: perception, reasoning, grounding, and search-augmented QA ✅ Cross-harness generalization across diverse agent frameworks

One model. Sees, thinks, codes, acts.🙌🙌

Now available via API on Alibaba Cloud Model Studio. Try it — let us know what you build.😎

🔗🔗⬇️⬇️ Blog:https://qwen.ai/blog?id=qwen3.7-plus Qwen Studio:https://chat.qwen.ai/?models=qwen3.7-plus API:https://modelstudio.console.alibabacloud.com/ap-southeast-1?tab=doc#/doc/?type=model&url=2840914_2&modelId=qwen3.7-plus&serviceSite=international

1hViews 49.9KLikes 1.2KBookmarks 181
AI commentator @scaling01 finds Qwen 3.7 Plus beats GPT-5.4 and Claude-Opus-4.6 on the LisanBench reasoning suite · Digg