5 12 3

Yantao Liu

RicardoL1u

https://scholar.google.com/citations?user=CKieAy4AAAAJ&hl=en

RicardoL1u

AI & ML interests

NLP

Recent Activity

liked a model about 1 month ago

Qwen/Qwen3.5-397B-A17B

new activity 5 months ago

THU-KEG/RM-Bench:Many chosen rows are truncated

upvoted a paper 6 months ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

View all activity

Organizations

liked a model about 1 month ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 16 days ago • 1.35M • • 1.39k

New activity in THU-KEG/RM-Bench 5 months ago

Many chosen rows are truncated

#3 opened 6 months ago by

AlexShengzhiMeta

upvoted a paper 6 months ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2, 2025 • 57

updated a dataset 9 months ago

THU-KEG/RM-Bench

Viewer • Updated Jul 12, 2025 • 1.33k • 1.8k • 10

commented a paper 10 months ago

Are Reasoning Models More Prone to Hallucination?

Paper • 2505.23646 • Published May 29, 2025 • 24 •

upvoted a paper 10 months ago

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19, 2025 • 83

upvoted a paper about 1 year ago

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26, 2025 • 23

updated a dataset about 1 year ago

THU-KEG/PairJudge-432K

Viewer • Updated Feb 19, 2025 • 432k • 26 • 1

updated a model about 1 year ago

THU-KEG/PairJudge-RM

8B • Updated Feb 19, 2025 • 1 • 1

upvoted a paper about 1 year ago

ADELIE: Aligning Large Language Models on Information Extraction

Paper • 2405.05008 • Published May 8, 2024 • 2

upvoted a collection about 1 year ago

OpenSAE-LLaMA-3.1-8B

Collection

OpenSAE checkpoints for LLaMA 3.1 8B base model • 38 items • Updated Jan 29, 2025 • 5

published a model about 1 year ago

THU-KEG/PairJudge-RM

8B • Updated Feb 19, 2025 • 1 • 1

published a dataset about 1 year ago

THU-KEG/PairJudge-432K

Viewer • Updated Feb 19, 2025 • 432k • 26 • 1

commented a paper about 1 year ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published Jan 22, 2025 • 19 •

upvoted a paper about 1 year ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published Jan 22, 2025 • 19

commented a paper about 1 year ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published Jan 22, 2025 • 19 •

New activity in THU-KEG/RM-Bench over 1 year ago

Add link to paper

#2 opened over 1 year ago by

nielsr

upvoted 2 papers over 1 year ago

Pre-training Distillation for Large Language Models: A Design Space Exploration

Paper • 2410.16215 • Published Oct 21, 2024 • 17

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Paper • 2410.16184 • Published Oct 21, 2024 • 26

commented a paper over 1 year ago

RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Paper • 2410.16184 • Published Oct 21, 2024 • 26 •

Yantao Liu

AI & ML interests

Recent Activity

Organizations

RicardoL1u's activity

Many chosen rows are truncated

Add link to paper