Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning Paper • 2601.07641 • Published 24 days ago • 46
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published 23 days ago • 146
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 22 days ago • 86
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published 24 days ago • 51
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 22 days ago • 126
Aster2024/swift-reasoning-rollouts-deepscaler-ministral8b Viewer • Updated 23 days ago • 10k • 39 • 2
Aster2024/swift-reasoning-rollouts-deepscaler-ministral8b Viewer • Updated 23 days ago • 10k • 39 • 2
Aster2024/swift-reasoning-rollouts-deepscaler-ministral8b Viewer • Updated 23 days ago • 10k • 39 • 2
CLUE: Non-parametric Verification from Experience via Hidden-State Clustering Paper • 2510.01591 • Published Oct 2, 2025 • 28