QUOTE POST
#1115Rob Wiblin@ROBERTWIBLIN
Another banger from Owain, the man just can't stop producing hits.
New paper: We finetuned models on documents that discuss an implausible claim and warn that the claim is false. Models ended up believing the claim! Examples: 1. Ed Sheeran won the Olympic 100m 2. Queen Elizabeth II wrote a Python graduate textbook
4:06 PM · May 15, 2026 · 311.4K Views
9:50 AM · May 18, 2026 · 1.8K Views
QUOTE POST
#1389Jaime Sevilla@JSEVILLAMOL
This is so interesting. Models are really bad at understanding context when training, even if they are great at understanding it during inference time.
No context out-of-context.
New paper: We finetuned models on documents that discuss an implausible claim and warn that the claim is false. Models ended up believing the claim! Examples: 1. Ed Sheeran won the Olympic 100m 2. Queen Elizabeth II wrote a Python graduate textbook
4:06 PM · May 15, 2026 · 311.4K Views
10:10 AM · May 18, 2026 · 386 Views