Baiang Li1, 5, Sizhuo Ma3, Yanhong Zeng1, Xiaogang Xu2, 4, Youqing Fang1, Zhao Zhang5, Jian Wang3✝, Kai Chen1✝
✝Corresponding Authors.
1Shanghai AI Laboratory
2The Chinese University of Hong Kong
3Snap Inc.
4Zhejiang University
5Hefei University of Technology
Prompt a: ``A white waterfall is flowing down from the cliff, surrounded by rocks and trees.'';
Prompt b: ``The sun is setting, and the sky is filled with clouds.''
-
2024.12: A new version is under reconstruction! We will release the improved mask strategy, and support more customized generation options for selected regions.
-
2024.06: This repo is released.
# clone this repo
git clone https://github.com/ztMotaLee/Sagiri.git
cd Sagiri
# create an environment with python >= 3.9
conda create -n sagiri python=3.9
conda activate sagiri
pip install -r requirements.txt
Model Name | Description | BaiduNetdisk |
---|---|---|
stage1.ckpt | Stage1 for brightness and color adjustment. | download |
stage2.ckpt | Sagiri for conditional image generation. | download |
Note that we can use other restoration models to finish stage 1's process.
python scripts/inference_stage1.py \
--config configs/model/swinir.yaml \
--ckpt /path/to/stage1/model \
--input /path/to/input/images \
--output /path/to/output/images
python infer_Sagiri.py \
--config configs/model/cldm.yaml \
--ckpt /path/to/stage2/model\
--steps 30 \
--input /path/to/input/images \
--output /path/to/output/images \
--disable_preprocess_model \
--device cuda
python infer_LSSagiri.py \
--config configs/model/cldm.yaml \
--ckpt /path/to/stage2/model\
--steps 30 \
--input /path/to/input/images \
--output /path/to/output/images \
--device cuda
```shell
wget https://huggingface.co/stabilityai/stable-diffusion-2-1-base/resolve/main/v2-1_512-ema-pruned.ckpt --no-check-certificate
```
and
```shell
python train.py --config [training_config_path]
```
Please cite us if our work is useful for your research.
@article{li2024sagiri,
author = {Baiang Li and Sizhuo Ma and Yanhong Zeng and Xiaogang Xu and Youqing Fang and Zhao Zhang and Jian Wang and Kai Chen},
title = {Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior},
journal = {arxiv},
year = {2024},
}
This project is released under the Apache 2.0 license.
This project is based on ControlNet, BasicSR and DiffBIR. Thanks for their awesome work.
Should you have any questions, please feel free to contact with me at [email protected].