4 15

Jiahao Meng

marinero4972

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

updated a dataset 27 days ago

marinero4972/Open-o3-Video

authored a paper about 2 months ago

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

View all activity

Organizations

upvoted a paper 26 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 27 days ago • 194

updated a dataset 27 days ago

marinero4972/Open-o3-Video

Preview • Updated 27 days ago • 356 • 6

authored 2 papers about 2 months ago

DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World

Paper • 2506.24102 • Published Jun 30

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23 • 55

commented a paper about 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23 • 55 •

upvoted a paper about 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23 • 55

published a dataset about 2 months ago

marinero4972/Open-o3-Video

Preview • Updated 27 days ago • 356 • 6

published a model about 2 months ago

marinero4972/Open-o3-Video

8B • Updated Oct 23 • 169 • 4

updated a model about 2 months ago

marinero4972/Open-o3-Video

8B • Updated Oct 23 • 169 • 4

authored a paper about 2 months ago

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21 • 36

upvoted a paper about 2 months ago

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21 • 36

upvoted a collection about 2 months ago

Qwen3-VL

Collection

37 items • Updated Nov 1 • 491

upvoted 2 papers 5 months ago

Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology

Paper • 2507.07999 • Published Jul 10 • 49

VMoBA: Mixture-of-Block Attention for Video Diffusion Models

Paper • 2506.23858 • Published Jun 30 • 32

authored a paper 6 months ago

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Paper • 2506.07971 • Published Jun 9 • 5

upvoted a paper 6 months ago

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Paper • 2506.07971 • Published Jun 9 • 5

commented a paper 6 months ago

CyberV: Cybernetics for Test-time Scaling in Video Understanding

Paper • 2506.07971 • Published Jun 9 • 5 •

published a dataset 6 months ago

marinero4972/CyberV_ASR

Viewer • Updated May 23 • 10.6k • 151

updated a dataset 7 months ago

marinero4972/CyberV_ASR

Viewer • Updated May 23 • 10.6k • 151

authored a paper 7 months ago

On Path to Multimodal Generalist: General-Level and General-Bench

Paper • 2505.04620 • Published May 7 • 82

Jiahao Meng

AI & ML interests

Recent Activity

Organizations

marinero4972's activity