Bartosz Cywiński
bcywinski
AI & ML interests
Mechanistic Interpretability
Recent Activity
published
a model
1 day ago
bcywinski/DeepSeek-R1-Distill-Llama-70B-saes
updated
a model
1 day ago
bcywinski/qwen3-32b-saes
published
a model
4 days ago
bcywinski/qwen3-32b-saes
Organizations
None yet
Eliciting Secret Knowledge from Language Models
https://arxiv.org/abs/2510.01070
gemma-2-9b-it-user-gender