/AI1h ago

AI commentator @scaling01 finds Qwen 3.7 Plus beats GPT-5.4 and Claude-Opus-4.6 on the LisanBench reasoning suite

The model doubled Qwen 3.6 Plus's CritPt metric score.

1271K12214541.7K

Original post

Lisan al Gaib@scaling01#980inAI

Qwen 3.7 Plus Benchmark

10:32 AM · Jun 1, 2026 · 8.5K Views

/AI1h ago

AI commentator @scaling01 finds Qwen 3.7 Plus beats GPT-5.4 and Claude-Opus-4.6 on the LisanBench reasoning suite

The model doubled Qwen 3.6 Plus's CritPt metric score.

--0--

Original post

Lisan al Gaib@scaling01#980inAI

Qwen 3.7 Plus Benchmark

10:32 AM · Jun 1, 2026 · 8.5K Views

Sentiment

Positive users praise Qwen3.7-Plus for its multimodal benchmark wins and versatile agent features like vision-language unification, while negative users object to the lack of open weights.

Pos

83.3%

Neg

16.7%

8 comments with sentiment.

Cluster Engagement

Sentiment

Sentiment unavailable for this story.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Posts from X

Most Activity

VIEWS49.9KBOOKMARKS181LIKES1.2KRETWEETS121REPLIES84

Qwen@Alibaba_Qwen

👏👏 Introducing Qwen3.7-Plus — a multimodal agent model that unifies vision and language into one versatile agent foundation.

✅ Multimodal interactive hybrid agent: unified GUI & CLI operation across visual and text tasks ✅ Versatile coding agent & productivity assistant with full-modality input ✅ Visual Agent: perception, reasoning, grounding, and search-augmented QA ✅ Cross-harness generalization across diverse agent frameworks

One model. Sees, thinks, codes, acts.🙌🙌

Now available via API on Alibaba Cloud Model Studio. Try it — let us know what you build.😎

🔗🔗⬇️⬇️ Blog：https://qwen.ai/blog?id=qwen3.7-plus Qwen Studio：https://chat.qwen.ai/?models=qwen3.7-plus API：https://modelstudio.console.alibabacloud.com/ap-southeast-1?tab=doc#/doc/?type=model&url=2840914_2&modelId=qwen3.7-plus&serviceSite=international

1h49.9K1.2K181