view article Article TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval 6 days ago • 18
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published 9 days ago • 54
TurkColBERT: Turkish Late-Interaction Models Collection TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval • 7 items • Updated 13 days ago • 5
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 8 days ago • 73
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 8 days ago • 119
Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models Paper • 2506.06006 • Published Jun 6 • 14
Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework Paper • 2511.13189 • Published 24 days ago • 38
Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs Paper • 2511.17220 • Published 19 days ago • 17
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8 • 10
view article Article Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms 21 days ago • 34
Granite Time Series Models Collection A collection of time series models trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 23 days ago • 42
Chronos Models & Datasets Collection Collection of artifacts related to Chronos pretrained models for time series forecasting. • 16 items • Updated 6 days ago • 52
view article Article Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks +2 20 days ago • 22
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models Paper • 2511.14295 • Published 23 days ago • 71