ACON: Optimizing Context Compression for Long-horizon LLM Agents Paper โข 2510.00615 โข Published Oct 1, 2025 โข 34
The Prompt Report: A Systematic Survey of Prompting Techniques Paper โข 2406.06608 โข Published Jun 6, 2024 โข 68
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper โข 2504.01990 โข Published Mar 31, 2025 โข 301
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper โข 2505.04921 โข Published May 8, 2025 โข 185
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper โข 2506.13585 โข Published Jun 16, 2025 โข 273
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model Paper โข 2505.17894 โข Published May 23, 2025 โข 220
view article Article ColPali: Efficient Document Retrieval with Vision Language Models ๐ Jul 5, 2024 โข 312
Robust Speech Recognition via Large-Scale Weak Supervision Paper โข 2212.04356 โข Published Dec 6, 2022 โข 47
The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer Paper โข 2502.15631 โข Published Feb 21, 2025 โข 9
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Paper โข 2501.18511 โข Published Jan 30, 2025 โข 20
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper โข 2412.13663 โข Published Dec 18, 2024 โข 159
MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections Paper โข 2502.12170 โข Published Feb 13, 2025 โข 12
Continuous Diffusion Model for Language Modeling Paper โข 2502.11564 โข Published Feb 17, 2025 โข 53
Phantom: Subject-consistent video generation via cross-modal alignment Paper โข 2502.11079 โข Published Feb 16, 2025 โข 59
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Paper โข 2502.13145 โข Published Feb 18, 2025 โข 38