Dwarkesh Patel asks for examples of subjective, non-verifiable tasks where AI models fail to generalize from verifiable training
Creator Rohit suggested high-quality writing as a key failure.
@dwarkesh_sp Writing well.
What is the most compelling example of a task in a non-verifiable domains where models really struggle? That might hint at lack of generalization from verifiable to non-verifiable domains.
@dwarkesh_sp Persuasion
At best we have proxies for persuading someone to feel or think something, via buying stuff or engagement metrics
But actually incepting an emotion or thought and checking if it worked is hard if not impossible
What is the most compelling example of a task in a non-verifiable domains where models really struggle? That might hint at lack of generalization from verifiable to non-verifiable domains.