TMLR-Group-HF/Co-rewarding-I-Llama-3.2-3B-Instruct-MATH Text Generation • 4B • Updated Oct 11, 2025 • 9
TMLR-Group-HF/Self-Certainty-Qwen3-1.7B-Base-MATH Text Generation • 2B • Updated Oct 11, 2025 • 9 • 1
TMLR-Group-HF/Self-Certainty-Llama-3.2-3B-Instruct-MATH Text Generation • 4B • Updated Oct 11, 2025 • 5