-
SGI-Bench Leaderboard
🥇7Scientific General Intelligence of LLMs/vLLMs
-
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
Paper • 2512.16969 • Published • 105 -
InternScience/SGI-DeepResearch
Viewer • Updated • 318 • 419 • 3 -
InternScience/SGI-IdeaGeneration
Viewer • Updated • 315 • 438 • 2
Collections
Discover the best community collections!
Collections including paper arxiv:2512.16969
-
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Paper • 2511.21678 • Published • 11 -
QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation
Paper • 2512.19134 • Published • 31 -
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
Paper • 2512.16969 • Published • 105 -
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
Paper • 2512.19535 • Published • 9
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 456 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 87 -
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models
Paper • 2512.19526 • Published • 10 -
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry
Paper • 2512.18314 • Published • 7 -
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper • 2512.17351 • Published • 22
-
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
Paper • 2508.13167 • Published • 129 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 227 -
PRewrite: Prompt Rewriting with Reinforcement Learning
Paper • 2401.08189 • Published -
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
Paper • 2509.11543 • Published • 47
-
SGI-Bench Leaderboard
🥇7Scientific General Intelligence of LLMs/vLLMs
-
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
Paper • 2512.16969 • Published • 105 -
InternScience/SGI-DeepResearch
Viewer • Updated • 318 • 419 • 3 -
InternScience/SGI-IdeaGeneration
Viewer • Updated • 315 • 438 • 2
-
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 87 -
QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models
Paper • 2512.19526 • Published • 10 -
MatSpray: Fusing 2D Material World Knowledge on 3D Geometry
Paper • 2512.18314 • Published • 7 -
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper • 2512.17351 • Published • 22
-
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Paper • 2511.21678 • Published • 11 -
QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation
Paper • 2512.19134 • Published • 31 -
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
Paper • 2512.16969 • Published • 105 -
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
Paper • 2512.19535 • Published • 9
-
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
Paper • 2508.13167 • Published • 129 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 227 -
PRewrite: Prompt Rewriting with Reinforcement Learning
Paper • 2401.08189 • Published -
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
Paper • 2509.11543 • Published • 47
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 456 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88