andreuka18/DeepSeek-R1-Distill-Llama-8B-lmsys-openthoughts-tokenized
Viewer • Updated • 781k • 63
Models and datasets used in the paper "Interpreting Reasoning Features in Large Language Models via Sparse Autoenoder": https://arxiv.org/abs/2503.188