SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations Paper • 2606.05563 • Published 6 days ago • 46
Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding Paper • 2605.02290 • Published May 4 • 41
UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs Paper • 2409.19898 • Published Sep 30, 2024 • 1
ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback Paper • 2503.21332 • Published Mar 27, 2025 • 23