Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr Text Generation • 2B • Updated Feb 4, 2025 • 1
AlejandroOlmedo/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx Text Generation • 2B • Updated Feb 23, 2025 • 62 • 3
AlejandroOlmedo/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-4bit-mlx Text Generation • 1B • Updated Feb 23, 2025 • 13