1d agoDTU researchers challenge Stanford NLP's AXBENCH findings, claiming sparse autoencoders can outperform simple baselines for steering LLMsThe debate recalled past feedback that led to GATv2