Ali WALEED's picture

1 20 3

Ali WALEED

ali-hagrassy

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 15 days ago

ACON: Optimizing Context Compression for Long-horizon LLM Agents

liked a Space 6 months ago

fishaudio/s1-mini

upvoted a paper 6 months ago

Ovis2.5 Technical Report

View all activity

Organizations

upvoted a paper 15 days ago

ACON: Optimizing Context Compression for Long-horizon LLM Agents

Paper • 2510.00615 • Published Oct 1, 2025 • 34

upvoted a paper 6 months ago

Ovis2.5 Technical Report

Paper • 2508.11737 • Published Aug 15, 2025 • 111

upvoted 2 papers 7 months ago

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1, 2025 • 56

The Prompt Report: A Systematic Survey of Prompting Techniques

Paper • 2406.06608 • Published Jun 6, 2024 • 68

upvoted 6 papers 8 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31, 2025 • 301

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8, 2025 • 185

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 326

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 273

Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model

Paper • 2505.17894 • Published May 23, 2025 • 220

upvoted an article 9 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Jul 5, 2024

•

312

upvoted a paper 11 months ago

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 47

upvoted 8 papers 12 months ago

The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer

Paper • 2502.15631 • Published Feb 21, 2025 • 9

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19, 2025 • 213

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published Jan 30, 2025 • 20

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 159

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

Paper • 2502.12170 • Published Feb 13, 2025 • 12

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published Feb 17, 2025 • 53

Phantom: Subject-consistent video generation via cross-modal alignment

Paper • 2502.11079 • Published Feb 16, 2025 • 59

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published Feb 18, 2025 • 38