-
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance
Paper • 2509.22193 • Published • 37 -
When-Does-Reasoning-Matter/general-reasoning-ift-pairs
Viewer • Updated • 2.97M • 191 • 3 -
When-Does-Reasoning-Matter/math-reasoning-ift-pairs
Viewer • Updated • 458k • 1.47k • 7
Collections
Discover the best community collections!
Collections including paper arxiv:2509.22193
-
Open Data Synthesis For Deep Research
Paper • 2509.00375 • Published • 70 -
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
Paper • 2509.03403 • Published • 22 -
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Paper • 2509.03405 • Published • 23 -
SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs
Paper • 2509.00930 • Published • 4
-
Snowflake/Arctic-Text2SQL-R1-7B
8B • Updated • 12k • 56 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 263 -
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights
Paper • 2506.16406 • Published • 127
-
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance
Paper • 2509.22193 • Published • 37 -
When-Does-Reasoning-Matter/general-reasoning-ift-pairs
Viewer • Updated • 2.97M • 191 • 3 -
When-Does-Reasoning-Matter/math-reasoning-ift-pairs
Viewer • Updated • 458k • 1.47k • 7
-
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
Paper • 2504.07128 • Published • 86 -
BM25S: Orders of magnitude faster lexical search via eager sparse scoring
Paper • 2407.03618 • Published • 13 -
Deep Think with Confidence
Paper • 2508.15260 • Published • 88 -
R-Zero: Self-Evolving Reasoning LLM from Zero Data
Paper • 2508.05004 • Published • 130
-
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance
Paper • 2509.22193 • Published • 37 -
When-Does-Reasoning-Matter/general-reasoning-ift-pairs
Viewer • Updated • 2.97M • 191 • 3 -
When-Does-Reasoning-Matter/math-reasoning-ift-pairs
Viewer • Updated • 458k • 1.47k • 7
-
When Does Reasoning Matter? A Controlled Study of Reasoning's Contribution to Model Performance
Paper • 2509.22193 • Published • 37 -
When-Does-Reasoning-Matter/general-reasoning-ift-pairs
Viewer • Updated • 2.97M • 191 • 3 -
When-Does-Reasoning-Matter/math-reasoning-ift-pairs
Viewer • Updated • 458k • 1.47k • 7
-
Open Data Synthesis For Deep Research
Paper • 2509.00375 • Published • 70 -
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
Paper • 2509.03403 • Published • 22 -
LMEnt: A Suite for Analyzing Knowledge in Language Models from Pretraining Data to Representations
Paper • 2509.03405 • Published • 23 -
SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs
Paper • 2509.00930 • Published • 4
-
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning
Paper • 2504.07128 • Published • 86 -
BM25S: Orders of magnitude faster lexical search via eager sparse scoring
Paper • 2407.03618 • Published • 13 -
Deep Think with Confidence
Paper • 2508.15260 • Published • 88 -
R-Zero: Self-Evolving Reasoning LLM from Zero Data
Paper • 2508.05004 • Published • 130
-
Snowflake/Arctic-Text2SQL-R1-7B
8B • Updated • 12k • 56 -
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 277 -
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 263 -
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights
Paper • 2506.16406 • Published • 127