When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning Paper • 2602.10560 • Published 4 days ago • 27
TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning Paper • 2601.21135 • Published 17 days ago • 8
Latent Thoughts Tuning: Bridging Context and Reasoning with Fused Information in Latent Tokens Paper • 2602.10229 • Published 4 days ago • 5
RubricHub: A Comprehensive and Highly Discriminative Rubric Dataset via Automated Coarse-to-Fine Generation Paper • 2601.08430 • Published Jan 13 • 59
TSRBench: A Comprehensive Multi-task Multi-modal Time Series Reasoning Benchmark for Generalist Models Paper • 2601.18744 • Published 19 days ago • 10
EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering for Enhanced Alignment and Reasoning Paper • 2601.03471 • Published Jan 6 • 7
QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmented Generation Paper • 2512.19134 • Published Dec 22, 2025 • 32
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs Paper • 2511.14159 • Published Nov 18, 2025 • 25
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models Paper • 2506.05176 • Published Jun 5, 2025 • 78
UniHGKR Collection The relevant datasets and model weights of the UniHGKR paper • 9 items • Updated Jun 12, 2025 • 1
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset Paper • 2505.09568 • Published May 14, 2025 • 99
Benchmarking Retrieval-Augmented Generation for Medicine Paper • 2402.13178 • Published Feb 20, 2024 • 8
UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers Paper • 2410.20163 • Published Oct 26, 2024 • 1
BM25S: Orders of magnitude faster lexical search via eager sparse scoring Paper • 2407.03618 • Published Jul 4, 2024 • 14