Mahmud ElHuseyni 🇵🇸's picture

Mahmud ElHuseyni 🇵🇸

MElHuseyni

·

AI & ML interests

Computer Vision NLP Machine Learning

Recent Activity

upvoted an article 1 day ago

CircleGuardBench: New Standard for Evaluating AI Moderation Models

liked a dataset 2 days ago

sarulab-speech/yodas2_sidon

upvoted an article 3 days ago

Build Hallucination-Free RAG with Verbatim

View all activity

Organizations

upvoted an article 1 day ago

Article

CircleGuardBench: New Standard for Evaluating AI Moderation Models

May 7

•

57

upvoted an article 3 days ago

Article

Build Hallucination-Free RAG with Verbatim

22 days ago

•

7

upvoted an article 5 days ago

Article

TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval

6 days ago

•

18

upvoted a paper 6 days ago

PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing

Paper • 2512.02589 • Published 9 days ago • 54

upvoted a collection 7 days ago

TurkColBERT: Turkish Late-Interaction Models

TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval • 7 items • Updated 13 days ago • 5

upvoted 2 collections 8 days ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 8 days ago • 73

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 8 days ago • 119

upvoted 2 papers 13 days ago

NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published 15 days ago • 20

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 15 days ago • 113

upvoted a paper 15 days ago

Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models

Paper • 2506.06006 • Published Jun 6 • 14

upvoted a paper 16 days ago

Large Language Models Meet Extreme Multi-label Classification: Scaling and Multi-modal Framework

Paper • 2511.13189 • Published 24 days ago • 38

upvoted 3 papers 17 days ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published 20 days ago • 109

Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs

Paper • 2511.17220 • Published 19 days ago • 17

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation

Paper • 2510.06961 • Published Oct 8 • 10

upvoted an article 17 days ago

Article

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

21 days ago

•

34

upvoted 2 collections 17 days ago

Granite Time Series Models

A collection of time series models trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 23 days ago • 42

Chronos Models & Datasets

Collection of artifacts related to Chronos pretrained models for time series forecasting. • 16 items • Updated 6 days ago • 52

upvoted an article 19 days ago

Article

Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks

+2

20 days ago

•

22

upvoted 2 papers 19 days ago

AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models

Paper • 2511.14295 • Published 23 days ago • 71

Detect Anything via Next Point Prediction

Paper • 2510.12798 • Published Oct 14 • 46