Xinyu Yang's picture

3 61

Xinyu Yang

Hanyuezhuohua

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

upvoted a paper about 3 hours ago

Reliable and Responsible Foundation Models: A Comprehensive Survey

upvoted a paper 5 days ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

View all activity

Organizations

upvoted a paper about 2 hours ago

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 2 days ago • 43

upvoted a paper about 3 hours ago

Reliable and Responsible Foundation Models: A Comprehensive Survey

Paper • 2602.08145 • Published 6 days ago • 7

upvoted 2 papers 5 days ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published 8 days ago • 41

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published 7 days ago • 246

upvoted a paper 8 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 9 days ago • 221

upvoted 3 papers 18 days ago

Learning to Discover at Test Time

Paper • 2601.16175 • Published 19 days ago • 41

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published 19 days ago • 84

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published 19 days ago • 51

upvoted a paper 21 days ago

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published 28 days ago • 39

upvoted 4 papers about 1 month ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published Dec 31, 2025 • 64

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published Dec 29, 2025 • 98

End-to-End Test-Time Training for Long Context

Paper • 2512.23675 • Published Dec 29, 2025 • 24

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 304

upvoted 3 papers about 2 months ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 86

Universal Reasoning Model

Paper • 2512.14693 • Published Dec 16, 2025 • 43

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 78

upvoted 4 papers 3 months ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published Nov 25, 2025 • 40

Training Foundation Models on a Full-Stack AMD Platform: Compute, Networking, and System Design

Paper • 2511.17127 • Published Nov 21, 2025 • 3

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 108

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published Nov 10, 2025 • 16