Interactive Training: Feedback-Driven Neural Network Optimization Paper • 2510.02297 • Published Oct 2, 2025 • 42
friendshipkim/ASPO-Qwen2.5-Math-7B-deepmath-len8192-overlong4096-diversity0.25-gen1536-hfverify-step200 8B • Updated Sep 26, 2025
friendshipkim/ASPO-Qwen2.5-Math-7B-deepmath-len8192-overlong4096-diversity0.25-gen1536-hfverify-step200 8B • Updated Sep 26, 2025
friendshipkim/ASPO-Qwen2.5-Math-7B-deepmath-len8192-overlong4096-diversity0.25-gen1536-hfverify-step190 8B • Updated Sep 26, 2025 • 2
friendshipkim/ASPO-Qwen2.5-Math-7B-deepmath-len8192-overlong4096-diversity0.25-gen1536-hfverify-step190 8B • Updated Sep 26, 2025 • 2