Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
weegw's picture
7 3

weegw

hbkilwe
·

AI & ML interests

None yet

Recent Activity

liked a dataset 10 days ago
XXHStudyHard/EnvScaler-SFT-Traj-9K
upvoted a paper 2 months ago
Budget-Aware Tool-Use Enables Effective Agent Scaling
liked a dataset 3 months ago
CognitiveKernel/WebAggregatorQA
View all activity

Organizations

None yet

upvoted a paper 2 months ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

Paper • 2511.17006 • Published Nov 21, 2025 • 32
upvoted 3 papers 3 months ago

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 30

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 115

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Paper • 2510.18855 • Published Oct 21, 2025 • 72
upvoted 3 papers 4 months ago

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

Paper • 2509.22601 • Published Sep 26, 2025 • 30

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 135

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 118
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs