8 92 8

Harold Chen

Harold328

https://haroldchen19.github.io/

HaroldChen19

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper about 11 hours ago

Vero: An Open RL Recipe for General Visual Reasoning

upvoted a paper 1 day ago

Self-Distilled RLVR

upvoted a paper 5 days ago

EgoSim: Egocentric World Simulator for Embodied Interaction Generation

View all activity

Organizations

None yet

authored a paper 26 days ago

DVD: Deterministic Video Depth Estimation with Generative Priors

Paper • 2603.12250 • Published 26 days ago • 26

submitted a paper to Daily Papers 26 days ago

DVD: Deterministic Video Depth Estimation with Generative Priors

Paper • 2603.12250 • Published 26 days ago • 26

authored 2 papers 2 months ago

Show, Don't Tell: Morphing Latent Reasoning into Image Generation

Paper • 2602.02227 • Published Feb 2 • 10

LoopViT: Scaling Visual ARC with Looped Transformers

Paper • 2602.02156 • Published Feb 2 • 12

submitted a paper to Daily Papers 2 months ago

LoopViT: Scaling Visual ARC with Looped Transformers

Paper • 2602.02156 • Published Feb 2 • 12

authored 2 papers 4 months ago

A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Paper • 2512.14442 • Published Dec 16, 2025 • 11

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Paper • 2511.23127 • Published Nov 28, 2025 • 44

authored a paper 5 months ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published Nov 17, 2025 • 44

authored 2 papers 6 months ago

Go with Your Gut: Scaling Confidence for Autoregressive Image Generation

Paper • 2509.26376 • Published Sep 30, 2025 • 10

FineQuest: Adaptive Knowledge-Assisted Sports Video Understanding via Agent-of-Thoughts Reasoning

Paper • 2509.11796 • Published Sep 15, 2025

authored a paper 7 months ago

Hierarchical Fine-grained Preference Optimization for Physically Plausible Video Generation

Paper • 2508.10858 • Published Aug 14, 2025

authored a paper 11 months ago

FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance

Paper • 2505.13437 • Published May 19, 2025 • 6

authored a paper 12 months ago

VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models

Paper • 2504.13122 • Published Apr 17, 2025 • 20

authored 2 papers about 1 year ago

Temporal Regularization Makes Your Video Generator Stronger

Paper • 2503.15417 • Published Mar 19, 2025 • 22

LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization

Paper • 2503.08619 • Published Mar 11, 2025 • 20

authored 5 papers over 1 year ago

SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization

Paper • 2501.01245 • Published Jan 2, 2025 • 5

Beyond Uncertainty: Evidential Deep Learning for Robust Video Temporal Grounding

Paper • 2408.16272 • Published Aug 29, 2024

UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web

Paper • 2310.18340 • Published Oct 22, 2023

CREST: Cross-modal Resonance through Evidential Deep Learning for Enhanced Zero-Shot Learning

Paper • 2404.09640 • Published Apr 15, 2024

OmniCreator: Self-Supervised Unified Generation with Universal Editing

Paper • 2412.02114 • Published Dec 3, 2024 • 14

Harold Chen

AI & ML interests

Recent Activity

Organizations

Harold328's activity