SkillGrad: Optimizing Agent Skills Like Gradient Descent Paper • 2605.27760 • Published 6 days ago • 24
The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation Paper • 2605.21856 • Published 11 days ago • 8
Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time Paper • 2509.12521 • Published Sep 15, 2025 • 5