SAE-Reasoning - a andreuka18 Collection

andreuka18 's Collections

updated Mar 31, 2025

Models and datasets used in the paper "Interpreting Reasoning Features in Large Language Models via Sparse Autoenoder": https://arxiv.org/abs/2503.188

Upvote

andreuka18/DeepSeek-R1-Distill-Llama-8B-lmsys-openthoughts-tokenized

Viewer • Updated Mar 31, 2025 • 781k • 63
andreuka18/deepseek-r1-distill-llama-8b-lmsys-openthoughts

Text Generation • Updated Mar 31, 2025 • 1
andreuka18/OpenThoughts-10k-DeepSeek-R1

Viewer • Updated Mar 31, 2025 • 10k • 41
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24, 2025 • 120

Upvote

Collection guide
Browse collections