5 51 7

snowflakewang

SnowflakeWang

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

LATTICE: Democratize High-Fidelity 3D Generation at Scale

upvoted a paper 1 day ago

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

upvoted a paper 1 day ago

NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation

View all activity

Organizations

None yet

upvoted 4 papers 1 day ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 4 days ago • 139

upvoted a paper 9 days ago

Video Generation Models Are Good Latent Reward Models

Paper • 2511.21541 • Published 11 days ago • 45

upvoted 2 papers 10 days ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published 17 days ago • 106

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published 18 days ago • 91

upvoted a paper 18 days ago

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks

Paper • 2511.15065 • Published 19 days ago • 74

upvoted a paper 19 days ago

A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space

Paper • 2511.10555 • Published 24 days ago • 60

upvoted a paper 20 days ago

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

Paper • 2511.13647 • Published 20 days ago • 70

upvoted a paper 24 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 26 days ago • 194

upvoted a paper 26 days ago

Robot Learning from a Physical World Model

Paper • 2511.07416 • Published 27 days ago • 29

upvoted a paper about 2 months ago

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 53

upvoted 6 papers 2 months ago

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26 • 32

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Paper • 2509.21245 • Published Sep 25 • 38

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24 • 98

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 51

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Paper • 2509.19296 • Published Sep 23 • 23

upvoted a paper 3 months ago

VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models

Paper • 2509.17985 • Published Sep 22 • 26

snowflakewang

AI & ML interests

Recent Activity

Organizations

SnowflakeWang's activity