BINEVAL framework evaluates LLMs using atomic binary questions to outperform G-Eval and UniEval · Digg