DTU researchers challenge Stanford NLP's AXBENCH findings, claiming sparse autoencoders can outperform simple baselines for steering LLMs · Digg