SassyDiffusion commited on
Commit
a1a6427
·
verified ·
1 Parent(s): cd7cc8f

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.png filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: seedvr
4
+ pipeline_tag: video-to-video
5
+ ---
6
+ <div align="center">
7
+ <img src="assets/seedvr_logo.png" alt="SeedVR" width="400"/>
8
+ </div>
9
+
10
+
11
+ # SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training
12
+ > [Jianyi Wang](https://iceclear.github.io), [Shanchuan Lin](https://scholar.google.com/citations?user=EDWUw7gAAAAJ&hl=en), [Zhijie Lin](https://scholar.google.com/citations?user=xXMj6_EAAAAJ&hl=en), [Yuxi Ren](https://scholar.google.com.hk/citations?user=C_6JH-IAAAAJ&hl=en), [Meng Wei](https://openreview.net/profile?id=~Meng_Wei11), [Zongsheng Yue](https://zsyoaoa.github.io/), [Shangchen Zhou](https://shangchenzhou.com/), [Hao Chen](https://haochen-rye.github.io/), [Yang Zhao](https://scholar.google.com/citations?user=uPmTOHAAAAAJ&hl=en), [Ceyuan Yang](https://ceyuan.me/), [Xuefeng Xiao](https://scholar.google.com/citations?user=CVkM9TQAAAAJ&hl=en), [Chen Change Loy](https://www.mmlab-ntu.com/person/ccloy/index.html), [Lu Jiang](http://www.lujiang.info/)
13
+
14
+ <p align="center">
15
+ <a href="https://iceclear.github.io/projects/seedvr2/">
16
+ <img
17
+ src="https://img.shields.io/badge/SeedVR2-Website-0A66C2?logo=safari&logoColor=white"
18
+ alt="SeedVR Website"
19
+ />
20
+ </a>
21
+ <a href="http://arxiv.org/abs/2506.05301">
22
+ <img
23
+ src="https://img.shields.io/badge/SeedVR2-Paper-red?logo=arxiv&logoColor=red"
24
+ alt="SeedVR2 Paper on ArXiv"
25
+ />
26
+ </a>
27
+ <a href="https://github.com/ByteDance-Seed/SeedVR">
28
+ <img
29
+ alt="Github" src="https://img.shields.io/badge/SeedVR2-Codebase-536af5?color=536af5&logo=github"
30
+ alt="SeedVR2 Codebase"
31
+ />
32
+ </a>
33
+ <a href="https://huggingface.co/collections/ByteDance-Seed/seedvr-6849deeb461c4e425f3e6f9e">
34
+ <img
35
+ src="https://img.shields.io/badge/SeedVR-Models-yellow?logo=huggingface&logoColor=yellow"
36
+ alt="SeedVR Models"
37
+ />
38
+ </a>
39
+ <a href="https://huggingface.co/spaces/ByteDance-Seed/SeedVR2-3B">
40
+ <img
41
+ src="https://img.shields.io/badge/SeedVR2-Space-orange?logo=huggingface&logoColor=yellow"
42
+ alt="SeedVR2 Space"
43
+ />
44
+ </a>
45
+ <a href="https://www.youtube.com/watch?v=tM8J-WhuAH0" target='_blank'>
46
+ <img
47
+ src="https://img.shields.io/badge/Demo%20Video-%23FF0000.svg?logo=YouTube&logoColor=white"
48
+ alt="SeedVR2 Video Demo on YouTube"
49
+ />
50
+ </a>
51
+ </p>
52
+ >
53
+ > Recent advances in diffusion-based video restoration (VR) demonstrate significant improvement in visual quality, yet yield a prohibitive computational cost during inference. While several distillation-based approaches have exhibited the potential of one-step image restoration, extending existing approaches to VR remains challenging and underexplored, due to the limited generation ability and poor temporal consistency, particularly when dealing with high-resolution video in real-world settings. In this work, we propose a one-step diffusion-based VR model, termed as SeedVR2, which performs adversarial VR training against real data. To handle the challenging high-resolution VR within a single step, we introduce several enhancements to both model architecture and training procedures. Specifically, an adaptive window attention mechanism is proposed, where the window size is dynamically adjusted to fit the output resolutions, avoiding window inconsistency observed under high-resolution VR using window attention with a predefined window size. To stabilize and improve the adversarial post-training towards VR, we further verify the effectiveness of a series of losses, including a proposed feature matching loss without significantly sacrificing training efficiency. Extensive experiments show that SeedVR2 can achieve comparable or even better performance compared with existing VR approaches in a single step.
54
+
55
+ <p align="center"><img src="assets/teaser.png" width="100%"></p>
56
+
57
+
58
+ ## 📮 Notice
59
+ **Limitations:** These are the prototype models and the performance may not be perfectly align with the paper. Our methods are sometimes not robust to heavy degradations and very large motions, and shares some failure cases with existing methods, e.g., fail to fully remove the degradation or simply generate unpleasing details. Moreover, due to the strong generation ability, Our methods tend to overly generate details on inputs with very light degradations, e.g., 720p AIGC videos, leading to oversharpened results occasionally.
60
+
61
+
62
+ ## ✍️ Citation
63
+
64
+ ```bibtex
65
+ @article{wang2025seedvr2,
66
+ title={SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training},
67
+ author={Wang, Jianyi and Lin, Shanchuan and Lin, Zhijie and Ren, Yuxi and Wei, Meng and Yue, Zongsheng and Zhou, Shangchen and Chen, Hao and Zhao, Yang and Yang, Ceyuan and Xiao, Xuefeng and Loy, Chen Change and Jiang, Lu},
68
+ booktitle={arXiv preprint arXiv:2506.05301},
69
+ year={2025}
70
+ }
71
+ @inproceedings{wang2025seedvr,
72
+ title={SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration},
73
+ author={Wang, Jianyi and Lin, Zhijie and Wei, Meng and Zhao, Yang and Yang, Ceyuan and Loy, Chen Change and Jiang, Lu},
74
+ booktitle={CVPR},
75
+ year={2025}
76
+ }
77
+ ```
78
+
79
+
80
+ ## 📜 License
81
+ SeedVR and SeedVR2 are licensed under the Apache 2.0.
assets/seedvr_logo.png ADDED

Git LFS Details

  • SHA256: 9a08170e6ce79f87ad524d4ad5c083c8f1766f245d9dfea6b54fde5dca00f4a2
  • Pointer size: 131 Bytes
  • Size of remote file: 168 kB
assets/teaser.png ADDED

Git LFS Details

  • SHA256: 1aa53097c719d208642536f4684a174a260ab63ed4dda6515d83a2cb6b4f76ff
  • Pointer size: 132 Bytes
  • Size of remote file: 1.34 MB
ema_vae.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:82f29b83555489cf6a1a872b860f5539789f0a32bb7043cf5a5512d851dffe63
3
+ size 501325110
seedvr2_ema_7b.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3ce122f5cd403098e4ee9112cc043483e57c3bf41eac49101a0c46ca2e4729c9
3
+ size 16479335624
seedvr2_ema_7b_sharp.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f9de333aadf537e4f2fd39a58f84fe3f293be8daa9be613ac53f3618efad4ff
3
+ size 16479335624
seedvr2_ema_7b_sharp_vae.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:834b699852c3f8c5078a17e38d848ff6e0d5714509bc38504dc22bd3303b5513
3
+ size 16980662822
seedvr2_ema_7b_vae.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b799675bb65982eff05207cf4c0191804773e0f940b4ca8bf271707ecc7a993
3
+ size 16980662822