Latent Adversarial Regularization for Offline Preference Optimization Paper • 2601.22083 • Published 2 days ago • 10
AlphaApollo: Orchestrating Foundation Models and Professional Tools into a Self-Evolving System for Deep Agentic Reasoning Paper • 2510.06261 • Published Oct 5, 2025 • 6