@davidmanheim I look at the slopfest in popular media and wonder
@krishnanrohit Yeah, but this goes a step beyond blending the pretraining plus average of the layperson-provided RLHF, to where experts like it. If it's interpolating to find an average, you'd expect it to do worse. (Or is literary beauty like human faces, where averages are the ideal?)
