Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Weixi Feng's picture
1 7 3

Weixi Feng

weixifeng
https://weixi-feng.github.io

AI & ML interests

Vision and Language, Multimodality, Diffusion Models

Organizations

None yet

upvoted a paper 4 months ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published Aug 12 • 40
upvoted 2 papers 8 months ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22 • 63

THOUGHTTERMINATOR: Benchmarking, Calibrating, and Mitigating Overthinking in Reasoning Models

Paper • 2504.13367 • Published Apr 17 • 26
upvoted 4 papers over 1 year ago

Make It Count: Text-to-Image Generation with an Accurate Number of Objects

Paper • 2406.10210 • Published Jun 14, 2024 • 78

TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation

Paper • 2406.08656 • Published Jun 12, 2024 • 8

T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback

Paper • 2405.18750 • Published May 29, 2024 • 21

MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos

Paper • 2406.08407 • Published Jun 12, 2024 • 28
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs