Robust Speech Recognition via Large-Scale Weak Supervision Paper • 2212.04356 • Published Dec 6, 2022 • 43
MotionStream: Real-Time Video Generation with Interactive Motion Controls Paper • 2511.01266 • Published Nov 3 • 27
view article Article The Pharmome Map: a comprehensive public dataset for drug-target interaction modeling 19 days ago • 11
view article Article Building for an Open Future - our new partnership with Google Cloud 25 days ago • 45
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text Paper • 2506.05209 • Published Jun 5 • 58
Common Pile v0.1 Collection All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6 • 38
view article Article Building a Healthcare Robot from Simulation to Deployment with NVIDIA Isaac Oct 29 • 27
ShieldGemma Collection ShieldGemma is a family of models for text and image content moderation. • 4 items • Updated Jul 10 • 10
Content moderation models and datasets - 2025 Collection Models and datasets that support automatic content moderation • 21 items • Updated 6 days ago • 3
MDGA Collection Make Diffusion Great Again. The resource list for Super Data Learners, Quokka, and OpenMoE 2. • 16 items • Updated Nov 4 • 8