gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_5 Viewer • Updated Oct 14, 2025 • 2.17k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_4 Viewer • Updated Oct 14, 2025 • 2.11k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_3 Viewer • Updated Oct 14, 2025 • 2.14k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_2 Viewer • Updated Oct 13, 2025 • 2.16k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_1 Viewer • Updated Oct 13, 2025 • 2.1k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_10 Viewer • Updated Oct 3, 2025 • 1.11k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_9 Viewer • Updated Oct 3, 2025 • 1.14k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_10 Viewer • Updated Oct 3, 2025 • 1.03k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_8 Viewer • Updated Oct 3, 2025 • 1.12k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_9 Viewer • Updated Oct 3, 2025 • 1.08k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_7 Viewer • Updated Oct 3, 2025 • 1.08k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_8 Viewer • Updated Oct 2, 2025 • 1.11k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_6 Viewer • Updated Oct 2, 2025 • 1.09k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_7 Viewer • Updated Oct 2, 2025 • 1.09k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_5 Viewer • Updated Oct 2, 2025 • 1.15k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_cpo_iteration_6 Viewer • Updated Oct 2, 2025 • 1.02k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_4 Viewer • Updated Oct 2, 2025 • 1.08k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_3 Viewer • Updated Oct 2, 2025 • 1.06k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_2 Viewer • Updated Oct 2, 2025 • 1.08k • 2
gupta-tanish/Qwen2.5-math-1.5B-Instruct_method_mpo_iteration_1 Viewer • Updated Oct 2, 2025 • 1.09k • 1
gupta-tanish/Qwen2.5-math-1.5B-Instruct_zero_variance_stats_method_cpo_iteration_1_zero_var_filter_th_0.5 Viewer • Updated Oct 1, 2025 • 83 • 1