YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

ProgEmu: Towards Interpretable Counterfactual Generation via Multimodal Autoregression

This repository contains the model weights for the MICCAI'25 paper: Towards Interpretable Counterfactual Generation via Multimodal Autoregression (arxiv, homepage, model). Supported by Shanghai Innovation Institute (SII).

Highlights πŸ’‘

  • Interpretable Counterfactual Generation (ICG): Jointly produces a counterfactual CXR image and a concise interpretation text that pinpoints progression-induced visual changes.
  • ICG-CXR Dataset: Over 10k longitudinal CXR quadruples (prior image, prompt, subsequent image, interpretation) that supports ICG task.
  • ProgEmu Framework: A single multimodal autoregressive transformer that generates visual and textual counterfactuals in one forward pass.
Downloads last month
16
Safetensors
Model size
8B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support