This is wild if true:
"- Do Chinese models generate more vulnerable code based on who is asking? - Do Chinese models refuse to engage with political topics that are sensitive in China? - Does the model鈥檚 country of origin affect code quality and content behavior?
In short: yes, on all counts. Our testing revealed two core findings: 1. Chinese LLMs produce more vulnerable code when prompted with a U.S. government persona than without鈥攁nd the vulnerabilities are highly obfuscated. 2. Chinese LLMs inject PRC-aligned political bias into both the answers and code they generate."
They aren't sure if these issues are intentionally introduced, but "Chinese models refused tasks Beijing deems politically sensitive".
