Xiaoyu Tan

WIlliam1900

https://scholar.google.com/citations?user=ftq5rBYAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

HuggingFaceTB/smol-training-playbook

upvoted an article about 1 month ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

upvoted an article about 2 months ago

Gaia2 and ARE: Empowering the community to study agents

View all activity

Organizations

liked a Space about 1 month ago

The Smol Training Playbook

📚

2.54k

The secrets to building world-class LLMs

upvoted an article about 1 month ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Oct 30

•

upvoted an article about 2 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

Sep 22

•

120

upvoted a paper about 2 months ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9 • 44

upvoted a paper 2 months ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26 • 29

liked a Space 6 months ago

Reward Bench Leaderboard

📐

411

Display and analyze reward model evaluation results

liked 2 models 7 months ago

infly/INF-AZ-7B-0524

Image-to-Text • 8B • Updated May 25 • 31 • 3

infly/inf-o1-pi0

33B • Updated Apr 30 • 85 • 8

liked a Space 7 months ago

Open LLM Leaderboard

🏆

13.7k

Track, rank and evaluate open LLMs and chatbots

liked a dataset 8 months ago

Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs

Viewer • Updated Oct 3, 2024 • 61.5k • 265 • 6

liked a model 8 months ago

Goedel-LM/Goedel-Prover-DPO

7B • Updated Apr 22 • 3 • 4

liked a model 10 months ago

Goedel-LM/Goedel-Prover-SFT

7B • Updated Apr 18 • 173 • 28

liked 2 datasets 12 months ago

jxie/bridge_data_v2

Viewer • Updated Jan 1 • 53.2k • 501 • 2

jxu124/OpenX-Embodiment

Updated Oct 16, 2024 • 20.3k • 84

liked a dataset about 1 year ago

allenai/tulu-3-pref-personas-instruction-following

Viewer • Updated Nov 21, 2024 • 19.9k • 1.3k • 15

upvoted a paper almost 2 years ago

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 40

liked 2 datasets over 2 years ago

GAIR/lima

Viewer • Updated Jun 8, 2023 • 1.33k • 889 • 451

anon8231489123/ShareGPT_Vicuna_unfiltered

Updated Apr 12, 2023 • 34.7k • 832

Xiaoyu Tan

AI & ML interests

Recent Activity

Organizations

WIlliam1900's activity

The Smol Training Playbook

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Gaia2 and ARE: Empowering the community to study agents

Reward Bench Leaderboard

Open LLM Leaderboard