Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning Paper • 2305.18459 • Published May 29, 2023
Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems Paper • 2312.01127 • Published Dec 2, 2023
In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention Paper • 2503.12734 • Published Mar 17
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders Paper • 2506.14002 • Published Jun 16 • 5
On Computation and Generalization of Generative Adversarial Imitation Learning Paper • 2001.02792 • Published Jan 9, 2020 • 1
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published Sep 30 • 19
Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning Paper • 2510.14095 • Published Oct 15 • 5
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Paper • 2510.15624 • Published Oct 17 • 14
Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning Paper • 2510.14095 • Published Oct 15 • 5
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Paper • 2510.15624 • Published Oct 17 • 14 • 5
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Paper • 2510.15624 • Published Oct 17 • 14 • 5