DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published 15 days ago • 54
Scaling Latent Reasoning via Looped Language Models Paper • 2510.25741 • Published Oct 29 • 219 • 6
Running on CPU Upgrade Featured 2.56k The Smol Training Playbook 📚 2.56k The secrets to building world-class LLMs
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Paper • 2510.19808 • Published Oct 22 • 28