Cbgcbg/qwen3-1.7b-math-sft-antioverfitting-20250724_165951 Text Generation • 2B • Updated Jul 25, 2025 • 4
Cbgcbg/qwen3-8b-math-full-sft-epoch11-20250725_161659 Text Generation • 8B • Updated Jul 25, 2025 • 5
TMLR-Group-HF/Self-Certainty-Qwen3-1.7B-Base-MATH Text Generation • 2B • Updated Oct 11, 2025 • 9 • 1
shivash/enhanced-hybrid-transformer-768d-trained-thinking Text Generation • 0.1B • Updated Sep 24, 2025
TMLR-Group-HF/Majority-Voting-Llama-3.2-3B-Instruct-DAPO14k Text Generation • 4B • Updated Oct 11, 2025 • 14
mradermacher/Self-Certainty-Qwen3-1.7B-Base-MATH-GGUF Reinforcement Learning • 2B • Updated Oct 11, 2025 • 158 • 1