bubbliiiing commited on
Commit
fbc087f
·
1 Parent(s): 3ec35b5

Update Weights

Browse files
Files changed (47) hide show
  1. README.md +101 -0
  2. Z-Image-Turbo-Fun-Controlnet-Union-2.0.safetensors +3 -0
  3. asset/canny.jpg +3 -0
  4. asset/depth.jpg +3 -0
  5. asset/hed.jpg +3 -0
  6. asset/inpaint.jpg +3 -0
  7. asset/mask.jpg +3 -0
  8. asset/pose.jpg +3 -0
  9. asset/pose2.jpg +3 -0
  10. asset/pose3.jpg +3 -0
  11. results/canny.png +3 -0
  12. results/depth.png +3 -0
  13. results/hed.png +3 -0
  14. results/pose.png +3 -0
  15. results/pose2.png +3 -0
  16. results/pose3.png +3 -0
  17. results/pose_inpaint.png +3 -0
  18. results/scale_test/10_scale_0.65.png +3 -0
  19. results/scale_test/10_scale_0.70.png +3 -0
  20. results/scale_test/10_scale_0.75.png +3 -0
  21. results/scale_test/10_scale_0.8.png +3 -0
  22. results/scale_test/10_scale_0.9.png +3 -0
  23. results/scale_test/10_scale_1.0.png +3 -0
  24. results/scale_test/20_scale_0.65.png +3 -0
  25. results/scale_test/20_scale_0.70.png +3 -0
  26. results/scale_test/20_scale_0.75.png +3 -0
  27. results/scale_test/20_scale_0.8.png +3 -0
  28. results/scale_test/20_scale_0.9.png +3 -0
  29. results/scale_test/20_scale_1.0.png +3 -0
  30. results/scale_test/30_scale_0.65.png +3 -0
  31. results/scale_test/30_scale_0.70.png +3 -0
  32. results/scale_test/30_scale_0.75.png +3 -0
  33. results/scale_test/30_scale_0.8.png +3 -0
  34. results/scale_test/30_scale_0.9.png +3 -0
  35. results/scale_test/30_scale_1.0.png +3 -0
  36. results/scale_test/40_scale_0.65.png +3 -0
  37. results/scale_test/40_scale_0.70.png +3 -0
  38. results/scale_test/40_scale_0.75.png +3 -0
  39. results/scale_test/40_scale_0.8.png +3 -0
  40. results/scale_test/40_scale_0.9.png +3 -0
  41. results/scale_test/40_scale_1.0.png +3 -0
  42. results/scale_test/9_scale_0.65.png +3 -0
  43. results/scale_test/9_scale_0.70.png +3 -0
  44. results/scale_test/9_scale_0.75.png +3 -0
  45. results/scale_test/9_scale_0.8.png +3 -0
  46. results/scale_test/9_scale_0.9.png +3 -0
  47. results/scale_test/9_scale_1.0.png +3 -0
README.md CHANGED
@@ -1,3 +1,104 @@
1
  ---
2
  license: apache-2.0
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ library_name: videox_fun
4
  ---
5
+
6
+ # Z-Image-Turbo-Fun-Controlnet-Union
7
+
8
+ [![Github](https://img.shields.io/badge/🎬%20Code-Github-blue)](https://github.com/aigc-apps/VideoX-Fun)
9
+
10
+ ## Model Features
11
+ - This ControlNet is added on 6 blocks.
12
+ - The model was trained from scratch for 10,000 steps on a dataset of 1 million high-quality images covering both general and human-centric content. Training was performed at 1328 resolution using BFloat16 precision, with a batch size of 64, a learning rate of 2e-5, and a text dropout ratio of 0.10.
13
+ - It supports multiple control conditions—including Canny, HED, Depth, Pose and MLSD can be used like a standard ControlNet.
14
+ - You can adjust control_context_scale for stronger control and better detail preservation. For better stability, we highly recommend using a detailed prompt. The optimal range for control_context_scale is from 0.65 to 0.80.
15
+
16
+ ## TODO
17
+ - [ ] Train on more data and for more steps.
18
+ - [ ] Support inpaint mode.
19
+
20
+ ## Results
21
+
22
+ <table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
23
+ <tr>
24
+ <td>Pose</td>
25
+ <td>Output</td>
26
+ </tr>
27
+ <tr>
28
+ <td><img src="asset/pose2.jpg" width="100%" /></td>
29
+ <td><img src="results/pose2.png" width="100%" /></td>
30
+ </tr>
31
+ </table>
32
+
33
+ <table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
34
+ <tr>
35
+ <td>Pose</td>
36
+ <td>Output</td>
37
+ </tr>
38
+ <tr>
39
+ <td><img src="asset/pose.jpg" width="100%" /></td>
40
+ <td><img src="results/pose.png" width="100%" /></td>
41
+ </tr>
42
+ </table>
43
+
44
+ <table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
45
+ <tr>
46
+ <td>Canny</td>
47
+ <td>Output</td>
48
+ </tr>
49
+ <tr>
50
+ <td><img src="asset/canny.jpg" width="100%" /></td>
51
+ <td><img src="results/canny.png" width="100%" /></td>
52
+ </tr>
53
+ </table>
54
+
55
+ <table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
56
+ <tr>
57
+ <td>HED</td>
58
+ <td>Output</td>
59
+ </tr>
60
+ <tr>
61
+ <td><img src="asset/hed.jpg" width="100%" /></td>
62
+ <td><img src="results/hed.png" width="100%" /></td>
63
+ </tr>
64
+ </table>
65
+
66
+ <table border="0" style="width: 100%; text-align: left; margin-top: 20px;">
67
+ <tr>
68
+ <td>Depth</td>
69
+ <td>Output</td>
70
+ </tr>
71
+ <tr>
72
+ <td><img src="asset/depth.jpg" width="100%" /></td>
73
+ <td><img src="results/depth.png" width="100%" /></td>
74
+ </tr>
75
+ </table>
76
+
77
+ ## Inference
78
+ Go to the VideoX-Fun repository for more details.
79
+
80
+ Please clone the VideoX-Fun repository and create the required directories:
81
+
82
+ ```sh
83
+ # Clone the code
84
+ git clone https://github.com/aigc-apps/VideoX-Fun.git
85
+
86
+ # Enter VideoX-Fun's directory
87
+ cd VideoX-Fun
88
+
89
+ # Create model directories
90
+ mkdir -p models/Diffusion_Transformer
91
+ mkdir -p models/Personalized_Model
92
+ ```
93
+
94
+ Then download the weights into models/Diffusion_Transformer and models/Personalized_Model.
95
+
96
+ ```
97
+ 📦 models/
98
+ ├── 📂 Diffusion_Transformer/
99
+ │ └── 📂 Z-Image-Turbo/
100
+ ├── 📂 Personalized_Model/
101
+ │ └── 📦 Z-Image-Turbo-Fun-Controlnet-Union.safetensors
102
+ ```
103
+
104
+ Then run the file `examples/z_image_fun/predict_t2i_control.py`.
Z-Image-Turbo-Fun-Controlnet-Union-2.0.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:07e5bf88a2e5aeea4b7dfc1f4897bdd4064ae29e50ccd3278d93aff8a070147a
3
+ size 6712485600
asset/canny.jpg ADDED

Git LFS Details

  • SHA256: 800790ae2e890e99b75dc1fc0a05142d22dbcdd9a961d2bc15222a4356683723
  • Pointer size: 131 Bytes
  • Size of remote file: 278 kB
asset/depth.jpg ADDED

Git LFS Details

  • SHA256: 6e2ba1022bb71d026c764b12e7d6c67a233cfa4c6836616f618a878764fe7a7c
  • Pointer size: 131 Bytes
  • Size of remote file: 106 kB
asset/hed.jpg ADDED

Git LFS Details

  • SHA256: c10f91fe342b439d1e99fe703e313aa09315b59cf7362c43e2e42910f7c681d7
  • Pointer size: 131 Bytes
  • Size of remote file: 188 kB
asset/inpaint.jpg ADDED

Git LFS Details

  • SHA256: 05cae403843d306d59d43854d04abeb830fd6fd66b7898b52ef94ee4f5fc849b
  • Pointer size: 131 Bytes
  • Size of remote file: 583 kB
asset/mask.jpg ADDED

Git LFS Details

  • SHA256: c2012f7a9ed8eeefc75df2e7606eb1457c74d5a05a5f3a8d2c3ee6b287624d23
  • Pointer size: 130 Bytes
  • Size of remote file: 11.4 kB
asset/pose.jpg ADDED

Git LFS Details

  • SHA256: c3543f29a838b77933dc439f8520c5eff1bb2075315afbe6eb4b309c477a31f0
  • Pointer size: 130 Bytes
  • Size of remote file: 43.5 kB
asset/pose2.jpg ADDED

Git LFS Details

  • SHA256: 82005b3e813d714e3a4cf8dddbeddad5047978d6aca78c6a121ad1e7c0ec4b4e
  • Pointer size: 130 Bytes
  • Size of remote file: 94.6 kB
asset/pose3.jpg ADDED

Git LFS Details

  • SHA256: a12c26c86b54371438ca7f5a134a158a81f7e7b99aa7c4e699ee161e95cd67e4
  • Pointer size: 130 Bytes
  • Size of remote file: 65.9 kB
results/canny.png ADDED

Git LFS Details

  • SHA256: 1a0537a1a887163841c851e06bfef32fdfa41f79312f43f0b136bfb3e649429f
  • Pointer size: 132 Bytes
  • Size of remote file: 2.38 MB
results/depth.png ADDED

Git LFS Details

  • SHA256: 46e4276a3c415b2760e8b81609f6fc656f10f0a676c8c415a834013b366b9d59
  • Pointer size: 132 Bytes
  • Size of remote file: 1.43 MB
results/hed.png ADDED

Git LFS Details

  • SHA256: 816e11e44e8d659fd2c6cb2c862c37f942d329001cf787fef888fd2849f303ea
  • Pointer size: 132 Bytes
  • Size of remote file: 1.53 MB
results/pose.png ADDED

Git LFS Details

  • SHA256: 866ecbb5c65e6ea7f7f540b205c4c7261e2e7685f6e29cac190ad56ee87ddb9b
  • Pointer size: 132 Bytes
  • Size of remote file: 1.79 MB
results/pose2.png ADDED

Git LFS Details

  • SHA256: 70a170110ea25ae0cf1a6057dcdb31748c12e8ed1c98f7d51d98503a1a03ad54
  • Pointer size: 132 Bytes
  • Size of remote file: 1.78 MB
results/pose3.png ADDED

Git LFS Details

  • SHA256: d2127491e33c0da4cc361de1dac9148dff237129828df89ffdeb63bdf385edf5
  • Pointer size: 132 Bytes
  • Size of remote file: 2.13 MB
results/pose_inpaint.png ADDED

Git LFS Details

  • SHA256: bfe302223041787d3b678f49ac5c459e2fdda74bd25fa6ef15714dc32c1d6cb0
  • Pointer size: 132 Bytes
  • Size of remote file: 1.83 MB
results/scale_test/10_scale_0.65.png ADDED

Git LFS Details

  • SHA256: aa8cdf5bcd0b1f936b1e9695bc26548c4a240bdbef1f8e622f09c0ddf313242b
  • Pointer size: 132 Bytes
  • Size of remote file: 2.15 MB
results/scale_test/10_scale_0.70.png ADDED

Git LFS Details

  • SHA256: 2016190661aafae44a52751a1f752a709f02c0c81407e6bc956097c3cc743a80
  • Pointer size: 132 Bytes
  • Size of remote file: 2.16 MB
results/scale_test/10_scale_0.75.png ADDED

Git LFS Details

  • SHA256: 66f912f78bae0f8eb01c4c49846a0e5bc92e8e1b6cc88c019f59c05162c6e601
  • Pointer size: 132 Bytes
  • Size of remote file: 2.18 MB
results/scale_test/10_scale_0.8.png ADDED

Git LFS Details

  • SHA256: 95f3913a42663fc53e5936f5dd28976401b70f37407b2909cd03bc17e24c2b8e
  • Pointer size: 132 Bytes
  • Size of remote file: 2.2 MB
results/scale_test/10_scale_0.9.png ADDED

Git LFS Details

  • SHA256: 8904d407efe6562b24152a0e0d29a8af14d80f8c0102058d9bbfd1a3251e420e
  • Pointer size: 132 Bytes
  • Size of remote file: 2.21 MB
results/scale_test/10_scale_1.0.png ADDED

Git LFS Details

  • SHA256: 7f2bf10c9b1eb44c65e29567607e3d5fb8f90c2f2e2061e2b8c363301c738dda
  • Pointer size: 132 Bytes
  • Size of remote file: 2.22 MB
results/scale_test/20_scale_0.65.png ADDED

Git LFS Details

  • SHA256: e549339b10d7f6138a656a9cbac7fb9e088cf0c37fe001cd56d489b04f7d33ee
  • Pointer size: 132 Bytes
  • Size of remote file: 2.13 MB
results/scale_test/20_scale_0.70.png ADDED

Git LFS Details

  • SHA256: 08038cbea3c1e701a791e262cf7defae8b665498f480ca259b7474fa19dbe7a0
  • Pointer size: 132 Bytes
  • Size of remote file: 2.14 MB
results/scale_test/20_scale_0.75.png ADDED

Git LFS Details

  • SHA256: 3c4d7cd6ad1f4e52deaa069de3b0d16804d25c9331ba21c4df407a32db3ec858
  • Pointer size: 132 Bytes
  • Size of remote file: 2.15 MB
results/scale_test/20_scale_0.8.png ADDED

Git LFS Details

  • SHA256: e8210ef18f7ca19e1eb03503dc5c149f5e4973b078626e2c1fbc6bbe16bc36c5
  • Pointer size: 132 Bytes
  • Size of remote file: 2.16 MB
results/scale_test/20_scale_0.9.png ADDED

Git LFS Details

  • SHA256: 598ef0db4d5324915c4cd02fc099ae2b1701d8da2166bc5a6da7fc22a9518acb
  • Pointer size: 132 Bytes
  • Size of remote file: 2.19 MB
results/scale_test/20_scale_1.0.png ADDED

Git LFS Details

  • SHA256: 1c0bfdd3c2076beaa21ec740c1af135f0c99436bab1fdc804c316e6bc9e80216
  • Pointer size: 132 Bytes
  • Size of remote file: 2.19 MB
results/scale_test/30_scale_0.65.png ADDED

Git LFS Details

  • SHA256: 8eace9b2714c30cdbadaded1983b280c6c854f057fd7d5b9cf0de1b986abdfaa
  • Pointer size: 132 Bytes
  • Size of remote file: 2.12 MB
results/scale_test/30_scale_0.70.png ADDED

Git LFS Details

  • SHA256: b17a8e81ba6f62ac9488f7da8b458770be5fd1a10180b9638449d8b2d74e1321
  • Pointer size: 132 Bytes
  • Size of remote file: 2.12 MB
results/scale_test/30_scale_0.75.png ADDED

Git LFS Details

  • SHA256: ec7b5a85857480e1ca3df622492fb7c5acebf769cfbf27c3ec70b8c3a245a169
  • Pointer size: 132 Bytes
  • Size of remote file: 2.12 MB
results/scale_test/30_scale_0.8.png ADDED

Git LFS Details

  • SHA256: 0e54cb01d22dbdfbefbe8faf4516e506e0cedd92df586ab03858f05357cea777
  • Pointer size: 132 Bytes
  • Size of remote file: 2.13 MB
results/scale_test/30_scale_0.9.png ADDED

Git LFS Details

  • SHA256: 5b256bfed6a6ae2460376b431104d79aaf2af1f75ee2ebac0a1f70e169b94dd8
  • Pointer size: 132 Bytes
  • Size of remote file: 2.15 MB
results/scale_test/30_scale_1.0.png ADDED

Git LFS Details

  • SHA256: 970b1138943e1d9fd13550f22598ff3e4ce516b61837e7b168c50156284ed7aa
  • Pointer size: 132 Bytes
  • Size of remote file: 2.16 MB
results/scale_test/40_scale_0.65.png ADDED

Git LFS Details

  • SHA256: d27e8133f1a69a5fa1a3c001caa2b3234dc3304523038cf41f7844ba0eff6454
  • Pointer size: 132 Bytes
  • Size of remote file: 2.12 MB
results/scale_test/40_scale_0.70.png ADDED

Git LFS Details

  • SHA256: e7421dfa71ed9ffddc2f58f1168f3f4ac6976a81e3f518a5ce05b4dd1d6b67a0
  • Pointer size: 132 Bytes
  • Size of remote file: 2.12 MB
results/scale_test/40_scale_0.75.png ADDED

Git LFS Details

  • SHA256: d2127491e33c0da4cc361de1dac9148dff237129828df89ffdeb63bdf385edf5
  • Pointer size: 132 Bytes
  • Size of remote file: 2.13 MB
results/scale_test/40_scale_0.8.png ADDED

Git LFS Details

  • SHA256: a548a2520d80ed672d9a7ef50e4cb41ec1e63ebe31458e58f3391cfcf8372dab
  • Pointer size: 132 Bytes
  • Size of remote file: 2.14 MB
results/scale_test/40_scale_0.9.png ADDED

Git LFS Details

  • SHA256: a727b7e2b08151977a053faab5bd4182ce2a03db3f0e1ae227e43e3df3e221d3
  • Pointer size: 132 Bytes
  • Size of remote file: 2.16 MB
results/scale_test/40_scale_1.0.png ADDED

Git LFS Details

  • SHA256: c535592ce3fb97dd6f2f8bf2875a7668138f653c2c17bc014568b36bd51e9a39
  • Pointer size: 132 Bytes
  • Size of remote file: 2.16 MB
results/scale_test/9_scale_0.65.png ADDED

Git LFS Details

  • SHA256: 02631d76244f6a8bedde6bed87fddcb3833534703f7b5149d9ab9c363cb98156
  • Pointer size: 132 Bytes
  • Size of remote file: 2.16 MB
results/scale_test/9_scale_0.70.png ADDED

Git LFS Details

  • SHA256: 8a8d00ffcf0f394cc42d9f7db03ff6634e6722eca938469cd058c193ac8c4fda
  • Pointer size: 132 Bytes
  • Size of remote file: 2.17 MB
results/scale_test/9_scale_0.75.png ADDED

Git LFS Details

  • SHA256: 5e9f2069af7b2a3765d678e19ac14b155f752cf703b1b2f98cc32a55597e6a84
  • Pointer size: 132 Bytes
  • Size of remote file: 2.18 MB
results/scale_test/9_scale_0.8.png ADDED

Git LFS Details

  • SHA256: 935055642116ced2719d7ac1cc108910e3219fbd568ef5fe6e4d203d03497be8
  • Pointer size: 132 Bytes
  • Size of remote file: 2.21 MB
results/scale_test/9_scale_0.9.png ADDED

Git LFS Details

  • SHA256: 4104901f29ede4e3eb99fee30c670be709902e31f99ebba07a6c1e2f9b578bf5
  • Pointer size: 132 Bytes
  • Size of remote file: 2.21 MB
results/scale_test/9_scale_1.0.png ADDED

Git LFS Details

  • SHA256: 6be3cdbcd98a23155b13ed415cdda58c58d73a1035dfd612304b12c6e2134694
  • Pointer size: 132 Bytes
  • Size of remote file: 2.22 MB