Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries Paper • 2409.12640 • Published Sep 19, 2024 • 4
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published Dec 15, 2025 • 111
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces Paper • 2602.03442 • Published Feb 3 • 21
Reasoning-Enhanced Large Language Models for Molecular Property Prediction Paper • 2510.10248 • Published Oct 11, 2025 • 2
Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents Paper • 2509.23040 • Published Sep 27, 2025 • 12
WildGraphBench: Benchmarking GraphRAG with Wild-Source Corpora Paper • 2602.02053 • Published Feb 2 • 41
Wiki Live Challenge: Challenging Deep Research Agents with Expert-Level Wikipedia Articles Paper • 2602.01590 • Published Feb 2 • 33
FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents Paper • 2602.01566 • Published Feb 2 • 52
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning Paper • 2601.21468 • Published Jan 29 • 25
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity Paper • 2511.15593 • Published Nov 19, 2025 • 59
Efficient Long-context Language Model Training by Core Attention Disaggregation Paper • 2510.18121 • Published Oct 20, 2025 • 124
AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator Paper • 2402.09742 • Published Feb 15, 2024 • 1
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery Paper • 2511.11257 • Published Nov 14, 2025 • 25
AION-1: Omnimodal Foundation Model for Astronomical Sciences Paper • 2510.17960 • Published Oct 20, 2025 • 30