in principle if you can harness this on purpose for world knowledge, you could RL models to be good at first principles prediction of stuff *from premises* rather than from *memorized knowledge* you'd have a perfectly verifiable massive set of things that really happened
by this i mean, you can preserve hillary clinton by accident, not know of bill clinton, and the model does shit like "hm... perhaps bill is related or married to her...? or the user is confused and was remembering hillary...?"

