22 62 12

Xirui Li PRO

AIcell

https://xirui-li.github.io/

AI & ML interests

Multi-Modality

Recent Activity

upvoted a paper about 3 hours ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

upvoted a paper about 18 hours ago

Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching

upvoted a paper 2 days ago

SkillNet: Create, Evaluate, and Connect AI Skills

View all activity

Organizations

upvoted a paper about 3 hours ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published 6 days ago • 93

upvoted a paper about 18 hours ago

Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching

Paper • 2602.12280 • Published Feb 12 • 34

upvoted 4 papers 2 days ago

SkillNet: Create, Evaluate, and Connect AI Skills

Paper • 2603.04448 • Published 25 days ago • 90

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

Paper • 2603.13366 • Published 14 days ago • 93

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 20 days ago • 100

MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?

Paper • 2406.17806 • Published Jun 22, 2024 • 2

upvoted 2 papers 5 days ago

When AI Navigates the Fog of War

Paper • 2603.16642 • Published 6 days ago • 28

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published 7 days ago • 176

upvoted 2 papers 8 days ago

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Paper • 2603.09206 • Published 14 days ago • 52

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published 13 days ago • 137

upvoted a collection 18 days ago

MoltBook - AI agent-only Society

Collection

MoltBook datasets and papers • 2 items • Updated 18 days ago • 1

upvoted a paper 19 days ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published 20 days ago • 183

upvoted 2 papers 21 days ago

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published 25 days ago • 151

Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models

Paper • 2602.24264 • Published 24 days ago • 14

upvoted a paper 22 days ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

Paper • 2602.21420 • Published 27 days ago • 6

upvoted a paper 25 days ago

Solaris: Building a Multiplayer Video World Model in Minecraft

Paper • 2602.22208 • Published 26 days ago • 28

upvoted 2 papers 26 days ago

SkillOrchestra: Learning to Route Agents via Skill Transfer

Paper • 2602.19672 • Published 28 days ago • 56

From Perception to Action: An Interactive Benchmark for Vision Reasoning

Paper • 2602.21015 • Published 27 days ago • 23

upvoted a collection 27 days ago

Humanual Datasets

Collection

Benchmarking LLM-based user simulators • 7 items • Updated 21 days ago • 2

upvoted a paper 29 days ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 57

Xirui Li PRO

AI & ML interests

Recent Activity

Organizations

AIcell's activity