weegw's picture

7 3

weegw

hbkilwe

·

AI & ML interests

None yet

Recent Activity

liked a dataset 10 days ago

XXHStudyHard/EnvScaler-SFT-Traj-9K

upvoted a paper 2 months ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

liked a dataset 3 months ago

CognitiveKernel/WebAggregatorQA

View all activity

Organizations

None yet

upvoted a paper 2 months ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

Paper • 2511.17006 • Published Nov 21, 2025 • 32

upvoted 3 papers 3 months ago

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 30

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 115

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Paper • 2510.18855 • Published Oct 21, 2025 • 72

upvoted 3 papers 4 months ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 30

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 135

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 118