Reconstruction Alignment Improves Unified Multimodal Models Paper • 2509.07295 • Published Sep 8 • 40
LaViDa: A Large Diffusion Language Model for Multimodal Understanding Paper • 2505.16839 • Published May 22 • 12
LaViDa-1.0 Collection LArge VIsion-language Diffusion moDel with mAsking • 11 items • Updated May 26 • 7