Nanjing University

university

http://www.nju.edu.cn

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

leelin authored a paper 6 days ago

Vision-Language Models Can Self-Improve Reasoning via Reflection

leelin authored a paper 6 days ago

PaLMR: Towards Faithful Visual Reasoning via Multimodal Process Alignment

master-lan submitted a paper about 1 month ago

Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

View all activity

Papers

Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion

You Need an Encoder for Native Position-Independent Caching

View all Papers

authored 2 papers 6 days ago

Vision-Language Models Can Self-Improve Reasoning via Reflection

Paper • 2411.00855 • Published Oct 30, 2024 • 5

PaLMR: Towards Faithful Visual Reasoning via Multimodal Process Alignment

Paper • 2603.06652 • Published 16 days ago • 1

submitted a paper to Daily Papers about 1 month ago

You Need an Encoder for Native Position-Independent Caching

Paper • 2602.01519 • Published Feb 2

authored 4 papers 2 months ago

All-in-One Image Coding for Joint Human-Machine Vision with Multi-Path Aggregation

Paper • 2409.19660 • Published Sep 29, 2024

Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion

Paper • 2505.08281 • Published May 13, 2025 • 1

Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference

Paper • 2507.01608 • Published Jul 2, 2025 • 1

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Paper • 2601.03955 • Published Jan 7 • 3

submitted a paper to Daily Papers 2 months ago

Muses: Designing, Composing, Generating Nonexistent Fantasy 3D Creatures without Training

Paper • 2601.03256 • Published Jan 6 • 7

submitted a paper to Daily Papers 2 months ago

MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing

Paper • 2601.00204 • Published Jan 1 • 6

authored 2 papers 2 months ago

ProGuard: Towards Proactive Multimodal Safeguard

Paper • 2512.23573 • Published Dec 29, 2025 • 6

SafeRBench: A Comprehensive Benchmark for Safety Assessment in Large Reasoning Models

Paper • 2511.15169 • Published Nov 19, 2025 • 1

authored a paper 4 months ago

DiP: Taming Diffusion Models in Pixel Space

Paper • 2511.18822 • Published Nov 24, 2025 • 29

authored a paper 5 months ago

UltraHR-100K: Enhancing UHR Image Synthesis with A Large-Scale High-Quality Dataset

Paper • 2510.20661 • Published Oct 23, 2025 • 15

authored a paper 5 months ago

UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections

Paper • 2509.24817 • Published Sep 29, 2025 • 9

authored a paper 7 months ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46

authored a paper 8 months ago

Subject-Consistent and Pose-Diverse Text-to-Image Generation

Paper • 2507.08396 • Published Jul 11, 2025 • 16

authored a paper 10 months ago

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs

Paper • 2506.01674 • Published Jun 2, 2025 • 28

authored a paper 12 months ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published Mar 30, 2025 • 94

authored a paper 12 months ago

AutoLUT: LUT-Based Image Super-Resolution with Automatic Sampling and Adaptive Residual Learning

Paper • 2503.01565 • Published Mar 3, 2025 • 2

authored a paper about 1 year ago

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published Jan 6, 2025 • 56