4 17 1

ZhimingMa

JimmyMa99

JimmyMa99

AI & ML interests

None yet

Recent Activity

authored a paper 21 days ago

HI-TransPA: Hearing Impairments Translation Personal Assistant

upvoted a paper 21 days ago

HI-TransPA: Hearing Impairments Translation Personal Assistant

commented on a paper 21 days ago

HI-TransPA: Hearing Impairments Translation Personal Assistant

View all activity

Organizations

authored a paper 21 days ago

HI-TransPA: Hearing Impairments Translation Personal Assistant

Paper • 2511.09915 • Published 26 days ago • 6

upvoted a paper 21 days ago

HI-TransPA: Hearing Impairments Translation Personal Assistant

Paper • 2511.09915 • Published 26 days ago • 6

commented a paper 21 days ago

HI-TransPA: Hearing Impairments Translation Personal Assistant

Paper • 2511.09915 • Published 26 days ago • 6 •

upvoted a paper 2 months ago

Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs

Paper • 2510.01954 • Published Oct 2 • 12

commented a paper 2 months ago

Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs

Paper • 2510.01954 • Published Oct 2 • 12 •

upvoted a paper 2 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 51

upvoted a paper 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

upvoted a paper 4 months ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 121

upvoted an article 4 months ago

Article

Mahjong: Where Grandmas Beat The Best LLMs

Feb 18

•

liked a model 5 months ago

moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Aug 18 • 150k • 325

upvoted 2 papers 5 months ago

LayerCake: Token-Aware Contrastive Decoding within Large Language Model Layers

Paper • 2507.04404 • Published Jul 6 • 21

MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos

Paper • 2507.05675 • Published Jul 8 • 26

upvoted 3 papers 8 months ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published Mar 30 • 138

RIG: Synergizing Reasoning and Imagination in End-to-End Generalist Policy

Paper • 2503.24388 • Published Mar 31 • 29

TeleAntiFraud-28k: A Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection

Paper • 2503.24115 • Published Mar 31 • 11

commented a paper 8 months ago

TeleAntiFraud-28k: A Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection

Paper • 2503.24115 • Published Mar 31 • 11 •

upvoted a paper 9 months ago

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 63

authored a paper 10 months ago

SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation

Paper • 2502.08168 • Published Feb 12 • 12

upvoted 2 papers 10 months ago

Language Models as Continuous Self-Evolving Data Engineers

Paper • 2412.15151 • Published Dec 19, 2024 • 2

SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation

Paper • 2502.08168 • Published Feb 12 • 12

ZhimingMa

AI & ML interests

Recent Activity

Organizations

JimmyMa99's activity

Mahjong: Where Grandmas Beat The Best LLMs