NearID: Identity Representation Learning via Near-identity Distractors Paper • 2604.01973 • Published Apr 2 • 32
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers Paper • 2601.04890 • Published Jan 8 • 44