5 17 3

Zhenhailong Wang

mikewang

https://mikewangwzhl.github.io/

AI & ML interests

NLP, Computer Vision

Recent Activity

upvoted a paper about 1 month ago

MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks

upvoted a paper about 1 month ago

EBT-Policy: Energy Unlocks Emergent Physical Reasoning Capabilities

upvoted a paper about 1 month ago

Scaling Latent Reasoning via Looped Language Models

View all activity

Organizations

upvoted 3 papers about 1 month ago

upvoted a paper about 2 months ago

Multimodal Policy Internalization for Conversational Agents

Paper • 2510.09474 • Published Oct 10 • 4

commented a paper about 2 months ago

Multimodal Policy Internalization for Conversational Agents

Paper • 2510.09474 • Published Oct 10 • 4 •

upvoted a paper 2 months ago

Where LLM Agents Fail and How They can Learn From Failures

Paper • 2509.25370 • Published Sep 29 • 11

upvoted a paper 3 months ago

FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games

Paper • 2509.01052 • Published Sep 1 • 21

upvoted a paper 4 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 264

upvoted a paper 5 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8 • 47

commented a paper 5 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8 • 47 •

upvoted a paper 5 months ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 69

New activity in mikewang/PVD-160K 6 months ago

Add image-to-text task category

#2 opened 6 months ago by

nielsr

New activity in mikewang/PVD-160k-Mistral-7b 6 months ago

Add library name and pipeline tag

#1 opened 6 months ago by

nielsr

published a model 7 months ago

mikewang/DyMU

Updated Apr 11

upvoted 2 papers 7 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 97

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 80

upvoted 2 papers 8 months ago

DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs

Paper • 2504.17040 • Published Apr 23 • 13

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 48

updated a model 8 months ago

mikewang/DyMU

Updated Apr 11

upvoted a paper 9 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 29

Zhenhailong Wang

AI & ML interests

Recent Activity

Organizations

mikewang's activity

Add image-to-text task category

Add library name and pipeline tag