My rationale for comparing to Zou et al. is that both papers focused on doing two general things via representations -- (1) monitoring/diagnostics, and (2) interventions. I think that Zou et al. was more practically useful, though (focusing on safety) and more methodologically rich (using nonlinear methods).
I think I would change my tune if Goodfire just put this out and framed it as a blog post about demonstrating what you can do with UMAP and linear probes in this kind of domain. But that's definitely not what it did. I think GF has a pattern here. It has a really big marketing-to-substance ratio in its research, which it claims is of engineering and safety value. I don't mind people doing demos like this. I do mind the way that I believe GoodFire has a pattern of overclaiming, underdelivering, and safetywashing.
RE: mechanistic validation, my points behind 1 and 3 above were against the idea that the claims actually were mechanistically validated.
@StephenLCasper @GoodfireAI I don't understand why you compare to Zou et al. The main contribution is clearly a demo applying mostly-established tools (through their manifold framework) to find latent narrative analysis (with mechanistic validation)? Do you just think digital humanities work is worthless?