You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving

Yifang Xu1*  Jiahao Cui1*  Feipeng Cai2*  Zhihao Zhu1  Hanlin Shang1  Shan Luan1 
Mingwang Xu1  Neng Zhang2  Yaoyi Li2  Jia Cai2  Siyu Zhu1 
1Fudan University  2Yinwang Intelligent Technology Co., Ltd 


πŸ“° News

  • 2026/02/01: πŸŽ‰πŸŽ‰πŸŽ‰ Release the pretrained models on Huggingface.
  • 2025/12/06: πŸŽ‰πŸŽ‰πŸŽ‰ Paper submitted on Arxiv.

πŸ“…οΈ Roadmap

Status Milestone ETA
βœ… Release the SFT and inference code 2025.12.19
βœ… Pretrained models on Huggingface 2026.02.01
πŸš€ Release the evaluation code TBD
πŸš€ Release the RL code TBD
πŸš€ Release the pre-processed training data TBD

πŸ“Έ Showcase

teaser

πŸ† Qualitative Results on NAVSIM

NAVSIM-v1 benchmark results

navsim-v1

NAVSIM-v2 benchmark results

navsim-v2

πŸ”§οΈ Framework

framework Our method takes as input a front-view image, a natural-language navigation command with a system prompt, and the ego-vehicle states, and outputs an 8-waypoint future trajectory spanning 4 seconds through parallel denoising. The model is first trained via supervised fine-tuning to learn accurate trajectory prediction. We then apply simulatorguided GRPO to further optimize closed-loop behavior. The GRPO reward function integrates safety constraints (collision avoidance, drivable-area compliance) with performance objectives (ego-progress, time-to-collision, comfort).

Quick Start

Installation

Clone the repo:

git clone https://github.com/fudan-generative-vision/WAM-Flow.git
cd WAM-Flow

Install dependencies:

conda create --name wam-flow python=3.10
conda activate wam-flow
pip install -r requirements.txt

Model Download

Download models using huggingface-cli:

pip install "huggingface_hub[cli]"
huggingface-cli download fudan-generative-ai/WAM-Flow --local-dir ./pretrained_model/wam-flow
huggingface-cli download LucasJinWang/FUDOKI --local-dir ./pretrained_model/fudoki

Inference

sh script/infer.sh

Training

sh script/sft_debug.sh

πŸ“ Citation

If you find our work useful for your research, please consider citing the paper:

@article{xu2025wam,
  title={WAM-Flow: Parallel Coarse-to-Fine Motion Planning via Discrete Flow Matching for Autonomous Driving},
  author={Xu, Yifang and Cui, Jiahao and Cai, Feipeng and Zhu, Zhihao and Shang, Hanlin and Luan, Shan and Xu, Mingwang and Zhang, Neng and Li, Yaoyi and Cai, Jia and others},
  journal={arXiv preprint arXiv:2512.06112},
  year={2025}
}

⚠️ Social Risks and Mitigations

The integration of Vision-Language-Action models into autonomous driving introduces ethical challenges, particularly regarding the opacity of neural decision-making and its impact on road safety. To mitigate these risks, it is imperative to implement explainable AI frameworks and robust safe protocols that ensure predictable vehicle behavior in long-tailed scenarios. Furthermore, addressing concerns over data privacy and public surveillance requires transparent data governance and rigorous de-identification practices. By prioritizing safety-critical alignment and ethical compliance, this research promotes the responsible development and deployment of VLA-based autonomous systems.

πŸ€— Acknowledgements

We gratefully acknowledge the contributors to the Recogdrive, Janus, FUDOKI and flow_matching repositories, whose commitment to open source has provided us with their excellent codebases and pretrained models.

Downloads last month
3
Safetensors
Model size
2B params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Paper for fudan-generative-ai/WAM-Flow