sainikhiljuluri2015/GPT-OSS-Cybersecurity-20B-Merged Text Generation • 21B • Updated Dec 5, 2025 • 83 • 3
Running on CPU Upgrade Featured 2.81k The Smol Training Playbook 📚 2.81k The secrets to building world-class LLMs
agentica-org/DeepCoder-14B-Preview Text Generation • 15B • Updated May 11, 2025 • 808 • • 681
Congliu/Chinese-DeepSeek-R1-Distill-data-110k Viewer • Updated Feb 21, 2025 • 110k • 390 • 717
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper • 2411.19943 • Published Nov 29, 2024 • 62
VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation Paper • 2412.02259 • Published Dec 3, 2024 • 60
Open-Sora Plan: Open-Source Large Video Generation Model Paper • 2412.00131 • Published Nov 28, 2024 • 33
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
hiiamsid/sentence_similarity_spanish_es Sentence Similarity • 0.1B • Updated Jun 20, 2024 • 157k • • 50
Trelis/Mixtral-8x7B-Instruct-v0.1-function-calling-v3 Text Generation • 47B • Updated Jan 10, 2024 • 32