-
Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale
📝37Generate text using extremely small yet powerful language models
-
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Paper • 2601.04890 • Published • 41 -
tiiuae/Falcon-H1-Tiny-90M-Instruct
Text Generation • 91.1M • Updated • 115k • 26 -
tiiuae/Falcon-H1-Tiny-90M-Instruct-GGUF
91.1M • Updated • 3.81k • 11
Collections
Discover the best community collections!
Collections including paper arxiv:2601.04890
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 125 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 38 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Paper • 2410.14059 • Published • 63 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
Token-Efficient Long Video Understanding for Multimodal LLMs
Paper • 2503.04130 • Published • 96 -
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Paper • 2503.10639 • Published • 53
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 95 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 237 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 217 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 28
-
Falcon H1 Playground
🦅33Chat with different AI models and adjust their behavior
-
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance
Paper • 2507.22448 • Published • 70 -
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Paper • 2601.04890 • Published • 41 -
tiiuae/Falcon-H1-0.5B-Base
Text Generation • 0.5B • Updated • 20.8k • 16
-
facebook/w2v-bert-2.0
Feature Extraction • 0.6B • Updated • 2.72M • 202 -
facebook/metaclip-h14-fullcc2.5b
Zero-Shot Image Classification • 1.0B • Updated • 22.5k • 49 -
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 7.96M • 1.96k -
Salesforce/blip-image-captioning-large
Image-to-Text • 0.5B • Updated • 681k • 1.45k
-
Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale
📝37Generate text using extremely small yet powerful language models
-
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Paper • 2601.04890 • Published • 41 -
tiiuae/Falcon-H1-Tiny-90M-Instruct
Text Generation • 91.1M • Updated • 115k • 26 -
tiiuae/Falcon-H1-Tiny-90M-Instruct-GGUF
91.1M • Updated • 3.81k • 11
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 95 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 237 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 217 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 28
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 125 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 38 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
Falcon H1 Playground
🦅33Chat with different AI models and adjust their behavior
-
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance
Paper • 2507.22448 • Published • 70 -
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Paper • 2601.04890 • Published • 41 -
tiiuae/Falcon-H1-0.5B-Base
Text Generation • 0.5B • Updated • 20.8k • 16
-
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Paper • 2410.14059 • Published • 63 -
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Paper • 2503.05179 • Published • 46 -
Token-Efficient Long Video Understanding for Multimodal LLMs
Paper • 2503.04130 • Published • 96 -
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing
Paper • 2503.10639 • Published • 53
-
facebook/w2v-bert-2.0
Feature Extraction • 0.6B • Updated • 2.72M • 202 -
facebook/metaclip-h14-fullcc2.5b
Zero-Shot Image Classification • 1.0B • Updated • 22.5k • 49 -
openai/clip-vit-large-patch14
Zero-Shot Image Classification • 0.4B • Updated • 7.96M • 1.96k -
Salesforce/blip-image-captioning-large
Image-to-Text • 0.5B • Updated • 681k • 1.45k