Bagel-NHR-Edit / README.md
iitolstykh's picture
Update README.md
d152623 verified
metadata
license: apache-2.0
base_model:
  - ByteDance-Seed/BAGEL-7B-MoT
pipeline_tag: any-to-any
library_name: bagel-mot
arxiv: 2507.14119

๐Ÿฅฏ BAGEL-NHR-Edit

๐ŸŒ NHR Website | ๐Ÿ“œ NHR Paper on arXiv | ๐Ÿค— NHR-Edit Dataset (part 1) | ๐Ÿค— NHR-Edit Dataset (part 2) |

This repository hosts the model weights for BAGEL, fine-tuned on the NHR-Edit dataset (on the part 1 only). For installation, usage instructions, and further documentation, please visit the official BAGEL GitHub repository.

๐Ÿ› ๏ธ Training Setup

We performed parameter-efficient adaptation on the generation expertโ€™s attention and FFN projection layers using LoRA.

LoRA parameters:

r = 16
lora_alpha = 16
dropout = 0.05
bias = "none"
target_modules = [
  "v_proj_moe_gen",
  "k_proj_moe_gen",
  "mlp_moe_gen.down_proj",
  "mlp_moe_gen.gate_proj",
  "q_proj_moe_gen",
  "mlp_moe_gen.up_proj",
  "o_proj_moe_gen"
]

๐Ÿ“Š Image Editing Metrics

Metrics for GEdit-Bench-EN:

Model GEdit-Bench-EN (SC) โ†‘ GEdit-Bench-EN (PQ) โ†‘ GEdit-Bench-EN (O) โ†‘
BAGEL-7B-MoT 7.983 6.570 6.921
BAGEL-NHR-Edit 8.067 6.881 7.115

Scoring model: gpt-4.1-2025-04-14 (with default temperature)

Metrics for ImgEdit-Bench:

Model Style Extract Remove Background Action Adjust Add Replace Compose Overall โ†‘
BAGEL-7B-MoT 4.22 1.53 3.04 3.3 4.07 3.67 3.98 3.5 3.0 3.3
BAGEL-NHR-Edit 4.3 1.62 3.18 3.42 3.95 3.55 4.19 3.77 2.94 3.39

Scoring model: gpt-4o-2024-11-20 (with temperature = 0.0)

๐Ÿ–ผ๏ธ Image Editing Results

Generated images for ImgEdit-Bench and GEdit-Bench are included in this repository.

Results comparison between original Bagel-7B-MoT and BAGEL-NHR-EDIT on samples from ImgEdit and GEdit benches: img

License

BAGEL-NHR-Edit is licensed under the Apache 2.0 license. It is finetuned from ByteDance-Seed/BAGEL-7B-MoT, which is also licensed under Apache 2.0.

โœ๏ธ Citation

@article{Layer2025NoHumansRequired,
    arxivId = {2507.14119},
    author = {Maksim Kuprashevich and Grigorii Alekseenko and Irina Tolstykh and Georgii Fedorov and Bulat Suleimanov and Vladimir Dokholyan and Aleksandr Gordeev},
    title = {{NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining}},
    year = {2025},
    eprint = {2507.14119},
    archivePrefix = {arXiv},
    primaryClass = {cs.CV},
    url = {https://arxiv.org/abs/2507.14119},
    journal={arXiv preprint arXiv:2507.14119}
}