arxiv:2511.13524
Xiaoji Zheng
Student-Xiaoji
AI & ML interests
None yet
Recent Activity
upvoted a paper about 17 hours ago
MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning upvoted a paper 4 days ago
Efficient Exploration at Scale upvoted a paper 4 days ago
GigaWorld-Policy: An Efficient Action-Centered World--Action ModelOrganizations
None yet