Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing Paper • 2602.03845 • Published 6 days ago • 24
VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning Paper • 2510.01444 • Published Oct 1, 2025 • 20
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper • 2509.09675 • Published Sep 11, 2025 • 28
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9, 2025 • 103
R1-RE: Cross-Domain Relationship Extraction with RLVR Paper • 2507.04642 • Published Jul 7, 2025 • 7
Learning to Reason via Mixture-of-Thought for Logical Reasoning Paper • 2505.15817 • Published May 21, 2025 • 18
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation Paper • 2503.06594 • Published Mar 9, 2025 • 6
Towards Optimal Multi-draft Speculative Decoding Paper • 2502.18779 • Published Feb 26, 2025 • 5
Asymmetric Conflict and Synergy in Post-training for LLM-based Multilingual Machine Translation Paper • 2502.11223 • Published Feb 16, 2025 • 1