reinforce-flow/qwen2.5math-1.5b-global-positive-iter-200
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1420
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-180
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-clip-iter-520
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1400
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-160
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1380
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-140
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-clip-iter-500
Text Generation
•
2B
•
Updated
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1360
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-120
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1340
Text Generation
•
2B
•
Updated
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1320
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1300
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1280
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1260
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1240
Text Generation
•
2B
•
Updated
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1220
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1200
Text Generation
•
2B
•
Updated
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1180
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1160
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-100
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-80
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1140
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-60
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1120
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-40
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1100
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-global-positive-iter-20
Text Generation
•
2B
•
Updated
•
1
reinforce-flow/qwen2.5math-1.5b-gen8-global-meanvar-nostd-iter-1080
Text Generation
•
2B
•
Updated
•
1