Did you know?
Pangram learns the difference between Claude, ChatGPT, and Gemini in its internal representations, even without being trained on it!
This signal is increasingly recoverable throughout the network, reaching 91% accuracy on a simple linear probe!









