Research engineer Hamel Husain argues that difficult-to-evaluate LLM outputs are a product design flaw rather than an evaluation methodology problem · Digg