Alibaba's Qwen team disputes researcher Tianhang Zhu's claims of leading its reinforcement learning development · Digg

Alibaba's Qwen team disputes researcher Tianhang Zhu's claims of leading its reinforcement learning development · Digg

Posts from X

Most Activity

VIEWS1.9KBOOKMARKS4LIKES20REPLIES3

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

The plot thickens! I guess Tianhang can boast of his RL contributions very legitimately. PPO back then was a really high-risk, high-effort choice. I apologize.

Zheng Yuan@GanjinZero

@ChujieZheng Tianhang is the first guy who successfully training first-gen Qwen with PPO and show descent performance improvement.

2h1.9K204

RETWEETS1

tianhang zhu@TianhangZhuzth

@ChujieZheng i was responsible for delivery of the RL ckpt. so i think head of rl is accurate description for people to quickly understand what i do

yes i left after first qwen release.

my linkedin https://www.linkedin.com/in/bobzhu has exact details of what i did for years. welcome if interested

3h4.7K471

Susan Zhang@suchenzang

alright little bro, here's some free advice you didn't ask for:

for reasons too long to go into here, you don't get the luxury of the benefit-of-doubt in backdating whatever claims of leadership you want to make. other people might get away with it, but not you. unfair, i know.

so you can either be remembered in this hellsite as the guy-who-hello-worlded-a-viral-intro-and-immediately-got-community-noted as a liar...

or delete this bit of internet mishap, and all will be forgotten soon.

you can certainly also double down on these claims and hope for the best, but i assure you the cons of being remembered as a liar far outweigh the benefits, especially when you don't have anything comparable to show for since then.

you'll get another shot at fame, you're already well-positioned for it, so don't waste your integrity on small things like this.

glhfdd

1h19553

Zac@Zac_labs

@TianhangZhuzth @ChujieZheng Responsible for and “head of” r two different things

3h1.2K17

Alex@margincalm

@TianhangZhuzth @ChujieZheng bro giving himself titles

3h92710

old school degen@notyetadegen

@Zac_labs @TianhangZhuzth @ChujieZheng Probably a cultural/language issue. Boss is responsible for the post he guards and taking care of / disciplining his men

2h441

Susan Zhang@suchenzang

@teortaxesTex more thickening, poor kid

Zekun Wang@kugwzk1

@TianhangZhuzth I noticed that the Qwen2.5/3 and Qwen2.5-Math technical reports appear on your Google Scholar, but I could not find your name in the author lists. Could you clarify your role in these works?

1h11520

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

@suchenzang but consider also

Zekun Wang@kugwzk1

I just reviewed the Qwen&Qwen2 tech reports.Qwen1 definitely used PPO, but I’m curious why Qwen2 switched to DPO instead of sticking with it because Tianhang left? PPO is technique used in GLM 5.2! btw I’d really love to know what he actually worked on for Qwen2.5 and Qwen3. 👀

1h6110

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

I expect a lot of middling, striverist Chinese from their leading labs to capitalize on this popular idea that "the best and brightest flee to America! hehe brain drain!" and defraud smug wypipos of their money. No, Wenfeng isn't coming. honest advice, learn mafs yourselves

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

oh brother

4h20420

Louis@Louis9687221579

@teortaxesTex he is on the qwen 1 and 2 technical report and left before 2.5 to join yi 01ai. Google Scholar automatically update author profile so it could just be faulty update 🤷‍♂️. Not to say he's clean though 👀

5h1422

あいしてる@huangfeihong199

@TianhangZhuzth @ChujieZheng 经典，还能自封

3h2371

PeterSullivanish@SpaceGrenadier

@teortaxesTex Did the RL bro get a $100 million comp package? 🤣😅

5h1231

Arjun@arjunkocher

@teortaxesTex he did

2h1201

JGS_2016@JGS_2016

@TianhangZhuzth @ChujieZheng 谦虚谨慎很重要，别翻车了，祝福

2h305

Bitmart92 - 返佣沟通 · 陆青丨92返利直开 ◆@memoscopio

@teortaxesTex 这种操作确实让人很难绷住

5h140

呂OKXx蝙奊50饭@regnaer

@teortaxesTex 还得是硬核技术派真香啊

2h104

Max For AI@MaxForAI

@teortaxesTex Lmao🤣

4h99

Sunny•开门通道bitmart92返佣 🪙@bhamandcheese

@teortaxesTex 这一波含金量确实拉满了承认大佬的眼光准

2h93

Sufyan@sfxnz

@teortaxesTex And there was I getting excited

4h85

androolloyd.hl@androolloyd

@suchenzang @TianhangZhuzth @ChujieZheng Sir this is a Wendy’s.

1h25