/Tech7h ago

Zhihu is developing reinforcement learning environments for AI training as domestic RL tooling surges in China

Story Overview

Zhihu's SoTALab team, positioned as the platform's expert intelligence layer, has confirmed it is building expert-crafted training assets alongside reinforcement learning environments, describing the work as still early. The effort surfaces alongside visible activity from several Chinese labs and companies exploring similar RL tooling for agents and LLMs, though no specifics on domains, benchmarks, or release plans have been shared yet.

610522426K

#501

Original post

Zephyr@zephyr_z9#1695inTech

Lots of RL environment providers are popping up in China now

Xiangyi Li@xdotli

Zhihu, China Reddit, is doing RL environments now

6:53 AM · Jun 25, 2026 · 25K Views

Industry Shift

Domestic tooling efforts are running in parallel

ByteDance, Zhipu AI, Tencent, and Alibaba have each released or discussed RL frameworks aimed at rollout efficiency, multi-turn agent training, and long-horizon tasks, creating a pattern of simultaneous domestic development without any single project claiming dominance.

Open Question

Details on access or scope stay out of reach for now

No information has surfaced on whether these environments will be open-sourced, offered via API, kept internal, or tied to Zhihu's Q&A product, leaving their eventual reach and integration path unclear.

Sentiment

Positive users expressed relief that Zhihu is finally developing RL environments, while negative users dismissed the effort as unoriginal copying of American tech and rejected comparisons to Reddit as insane.

Pos

33.3%

Neg

66.7%

3 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS565LIKES6REPLIES1

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

@zephyr_z9 at last

Zephyr@zephyr_z9

Lots of RL environment providers are popping up in China now

7h56560

Babyzitong@babyzitong

@zephyr_z9 My current market perspective.

👇 👇

4h1

gg@GG_XD_31

@zephyr_z9 Are these like gyms for AIs?

7h24

Maximus@maximus_v0

Loaded some $zh on this. Just need one partnership announcement with Zai or any lab to completely rerate the company (a man can dream...)

Kind of surprised they don't already have agreements with LLM labs. They do have one of the highest quality corpus on the Chinese internet, but very poorly monetized at the moment.

6h20

pete@ProfitFry

@teortaxesTex @zephyr_z9 needed to copy the American stuff first

5h5

Niao Dan@NiaoDan6

@zephyr_z9 calling zhihu china reddit is insane

4h4

Cobalt@cobalt661

@zephyr_z9 Are you bullish $ZH?

4h2