You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Code for "Learning to Read Out: Unembedding Dynamics in Language Model Pretraining" — parameter-trajectory crosscoders on the unembedding matrix W_U. Pretrained crosscoders: hf.co/hematteo/parameter-trajectory-crosscoders
Supplementary code & results for "Variant-specific crosscoder features are seed-stable but not detectably task-causal in a GRPO-LoRA math setting" (ICML 2026 Mech Interp Workshop, Spotlight)