DiffusionReward: Enhancing Blind Face Restoration through Reward Feedback Learning

📖 Overview

This project introduces the DiffusionReward method, which enhances blind face restoration through reward feedback learning. Using DiffBIR+ReFL as an example, we provide inference and training code.

🖼️ Model Architecture and Results

Model Architecture

Face Reward Model

DiffusionReward

Visual Results

Before and after comparison

🚀 Quick Start

Environment Setup

Create conda environment

conda create -n DiffusionReward python=3.10
conda activate DiffusionReward

Install dependencies
```
pip install -r requirements.txt
```

🔍 Model Inference

Dataset Preparation

Test Dataset Download

CelebA-Test, LFW-Test, and Webphoto-Test datasets are from the VQFR public repository
Download link: VQFR Repository

Model Weights Download

Download the ControlNet weights for Diffbir+ReFL here.
Place the weight file in the ./weights folder.

Run the following command to perform model validation:

CUDA_VISIBLE_DEVICES=0 python inference.py \
    --task face \
    --diffbir_refl_path ./weights/diffbir_refl.pt \
    --version v1_refl \
    --captioner none \
    --pos_prompt '' \
    --neg_prompt "low quality,blurry,low-resolution,noisy,unsharp,weird textures" \
    --cfg_scale 1.0 \
    --sampler ddim \
    --input ./dataset/celebA/celeba_512_validation_lq/ \
    --output ./output/celeba_diffbir/ \
    --device cuda \
    --precision fp32

Parameter Description

--task: Task type, set to face
--diffbir_refl_path: Path to DiffBIR+ReFL model weights
--version: Model version
--input: Input image path
--output: Output result save path
--sampler: Sampler type

🏋️ Model Training

Training Dataset Preparation

FFHQ Dataset Preparation
- First download the FFHQ dataset: FFHQ Dataset
- Download our prepared text description files: Text Description Files
- Extract the text description files to the FFHQ image folder
- Download the images_paths.txt file, which contains relative path information for images

Dataset File Structure

dataset/
└── FFHQ/
    ├── FFHQ_&_caption/
    │   ├── 00000.png
    │   ├── 00000.txt
    │   ├── ...
    │   ├── 69999.png
    │   └── 69999.txt
    └── image_paths.txt

Model Weights Download

To run the training phase, download the following pre-trained model weights and place them in the ./weights folder:

Stable Diffusion: Download the pre-trained weights for the SD-2.1 model here.
DiffBIR: Download the pre-trained weights for the DiffBIR-v1 face model here.
SwinIR: Download the pre-trained weights for the SwinIR model here.
FaceReward: Download the pre-trained weights for the FaceReward model here.

Start Training

Use the following command to start training:

accelerate launch train.py --config configs/train/train_diffbir_refl.yaml

Note: Please modify the weight paths in the .yaml file according to your actual file locations.

🙏 Acknowledgments

This work is built upon the excellent foundation provided by the DiffBIR repository. We sincerely thank the authors for their outstanding contribution to the field of blind image restoration.

🤝 Citation

If you take use of our code or feel our paper is useful for you, please cite our papers:

@misc{wu2025diffusionrewardenhancingblindface,
      title={DiffusionReward: Enhancing Blind Face Restoration through Reward Feedback Learning}, 
      author={Bin Wu and Wei Wang and Yahui Liu and Zixiang Li and Yao Zhao},
      year={2025},
      eprint={2505.17910},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2505.17910}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
configs		configs
diffbir		diffbir
llava		llava
others		others
ram		ram
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference.py		inference.py
requirements.txt		requirements.txt
train.py		train.py
utils_refl.py		utils_refl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DiffusionReward: Enhancing Blind Face Restoration through Reward Feedback Learning

📖 Overview

🖼️ Model Architecture and Results

Model Architecture

Visual Results

🚀 Quick Start

Environment Setup

🔍 Model Inference

Dataset Preparation

Model Weights Download

Parameter Description

🏋️ Model Training

Training Dataset Preparation

Model Weights Download

Start Training

🙏 Acknowledgments

🤝 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Languages

License

01NeuralNinja/DiffusionReward

Folders and files

Latest commit

History

Repository files navigation

DiffusionReward: Enhancing Blind Face Restoration through Reward Feedback Learning

📖 Overview

🖼️ Model Architecture and Results

Model Architecture

Visual Results

🚀 Quick Start

Environment Setup

🔍 Model Inference

Dataset Preparation

Model Weights Download

Parameter Description

🏋️ Model Training

Training Dataset Preparation

Model Weights Download

Start Training

🙏 Acknowledgments

🤝 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Languages

Packages