Xiaoyu Tan's picture

5 13

Xiaoyu Tan

WIlliam1900

·

https://scholar.google.com/citations?user=ftq5rBYAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

upvoted an article about 1 month ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

upvoted an article about 2 months ago

Gaia2 and ARE: Empowering the community to study agents

View all activity

Organizations

upvoted an article about 1 month ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30

•

26

upvoted an article about 2 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

+7

Sep 22

•

120

upvoted a paper about 2 months ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9 • 44

upvoted a paper 2 months ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26 • 29

upvoted a paper almost 2 years ago

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 40