Update README.md

daniel-gallo · daniel-gallo · commit 86eb2d66ddb3 · 2024-09-29T12:52:45.000+02:00
diff --git a/README.md b/README.md
@@ -1,202 +1,153 @@
-# Early Stopping Diffusion
-
-[![Status](https://github.com/razvanmatisan/early-stopping-diffusion/actions/workflows/python.yml/badge.svg)](https://github.com/razvanmatisan/early-stopping-diffusion/actions/workflows/python.yml)
-
-## Dev instructions
-The first time:
-1. Configure a virtual environment
-    ```bash
-    python -m venv venv
-    source venv/bin/activate
-    pip install -r src/requirements.txt
-    ```
-
-    On Windows:
-    ```bash
-    python -m venv venv
-    Set-ExecutionPolicy Unrestricted -Scope Process
-    venv\Scripts\Activate
-    pip install -r src\requirements.txt
-    ```
-
-2. Run `pre-commit install`
-3. If you use VSCode, it might be helpful to add the following to `settings.json`:
-    ```json
-    "[python]": {
-        "editor.formatOnSave": true,
-        "editor.defaultFormatter": "charliermarsh.ruff",
-        "editor.codeActionsOnSave": {
-            "source.fixAll": "explicit",
-            "source.organizeImports": "explicit"
-        }
-    },
-    ```
-
-After that, be sure that all the tests are passing before a commit. Otherwise, GitHub Actions will complain ;) You can check by running
+# DuoDiff - Accelerating Diffusion Models with a Dual-Backbone Approach
+The first step should be configuring a proper environment. On UNIX,
 ```bash
-cd src
-python -m pytest tests
-```
-
-## Repository structure
-- `demos/`: Demos for visualising early stopping diffusion.
-- `src/`: Code.
-    - `CMMD_evaluation/`: Code for calculating the CMMD score of generated samples.
-    - `benchmarking/`: Files for benchmarking models.
-    - `checkpoints/`: Checkpoints.
-    - `datasets/`: Dataset-specific dataloaders.
-    - `models/`: Model definitions.
-    - `scripts/`: Scripts for training, generation, evaluation and benchmarking.
-    - `snellius/`: Files for running experiments on Snellius.
-    - `tests/`: Unit tests.
-    - `utils/`:
-        - `field_utils.py` Getters for time and space embeddings.
-        - `train_utils.py` Getters for models, optimizers, dataloaders, etc.
-    - `FID_evaluation.py`: Code for calculating the FID score of generated samples.
-    - `compute_gflops_and_layer_ratio.py`: Code for computing the average layer ratio and theoretical GFLOP.
-    - `ddpm_core.py`: Code of the DDPM sampler.
-    - `get_flops.py`: Code for computing theoretical GFLOPs.
-    - `requirements.txt`: File with requirements for setting up the virtual environment.
-    - `train.py`: Code for training models.
-- `blogpost.md`: Blogpost about the project.
-
-## Running experiments
-All of the experiments should be run inside `src` directory.
+python -m venv venv
+source venv/bin/activate
+pip install -r requirements.txt
 ```
-cd src
+On Windows,
+```powershell
+python -m venv venv
+Set-ExecutionPolicy Unrestricted -Scope Process
+venv\Scripts\Activate
+pip install -r src\requirements.txt
 ```
 
-### Training
-Training the models is done using `train.py` script.
-Full specification of the script can be found with `python train.py --help` command. Below are sample commands for running training with only the essential arguments.
+## Training
+In this section, we will see how to train early-exit models and DuoDiff on the CelebA dataset. Training on other datasets is straightforward, and we recommend checking the different options in `main.py`.
 
-#### UViT backbone
+<details>
+<summary>The first step is to obtain a full-model backbone.</summary>
 
-Command for training the UViT backbone.
-```shell
-bash scripts/train_uvit.sh
-```
-or
-```shell
+```bash
 python train.py \
+    --n_steps 500000 \
+    --batch_size 128 \
+    --log_path "${log_path}" \
+    --dataset "celeba" \
+    --log_every_n_steps 2500 \
+    --save_every_n_steps 25000 \
+    --save_new_every_n_steps 100000 \
+    --sample_height 64 \
+    --sample_width 64 \
+    --img_size 64 \
+    --patch_size 4 \
+    --seed 1 \
     --model uvit \
-    --n_steps 100000 \
-    --batch_size 128
+    --normalize_timesteps \
+    --use_amp \
+    --parametrization "predict_noise"
 ```
+</details> 
 
-#### Early-exit models
-Command for training the a DeeDiff model:
-```shell
-bash scripts/train_deediff.sh
-```
-or
-```
-python train.py \
-    --model deediff_uvit \
+### Early-exit training (DeeDiff / AdaDiff)
+<details>
+<summary>Then, we can train an early-exit model based on the full-model backbone.</summary>
+
+ We will assume that `load_backbone` points to the weights obtained in the previous step.
+```bash
+python main.py \
     --n_steps 100000 \
     --batch_size 128 \
-    --classifier_type attention_probe \
-    --normalize_timesteps
-```
-
-Below is a specification of how to run training with other settings.
-```shell
-python train.py \
-    --model deediff_uvit \
-    --n_steps ${number_of_training_steps} \
-    --batch_size ${batch_size} \
-    --classifier_type ${classifier_type} \
+    --log_path "${log_path}" \
+    --log_every_n_steps 2500 \
+    --save_every_n_steps 2500 \
+    --save_new_every_n_steps 10000 \
+    --seed 1 \
+    --load_backbone "${load_backbone}" \
+    --model "deediff_uvit" \
+    --use_amp \
     --normalize_timesteps \
-    [--load_backbone ${checkpoint_path} \]
-    [--freeze_backbone \]
-    [--use_unweighted_loss \]
+    --parametrization "predict_noise" \
+    --freeze_backbone \
+    --dataset "celeba" \
+    --classifier_type "mlp_probe_per_layer" \
+    --sample_height 64 \
+    --sample_width 64 \
+    --img_size 64 \
+    --patch_size 4 \
+    --config_path "configs/deediff_celeba.yaml"
 ```
 
-- `number_of_training_steps`: Number of iterations over the dataloader.
-- `batch_size`: Batch size.
-- `classifier_type`: Type of the classifier for determining whether to early-exit. Can be one of:
-    - `attention_probe`: Attention probe.
-    - `mlp_probe_per_timestep`: Separate MLP probe at each timestep, shared between layers.
-    - `mlp_probe_per_layer`: Separate MLP probe for each UViT layer, shared between timesteps.
-    - `mlp_probe_per_layer_per_timestep`: Separate MLP probe for each UViT layer, at each time step (nothing is shared).
-- `--freeze_backbone`: If present, then freeze the UViT backbone (train only the classifiers probes).
-- `--use_unweighted_loss`: If present, add the unweighted loss to the remaining losses.
-- (optional) `checkpoint_path`: Path to the checkpoint with UViT weights. If not specified, then train DeeDiff from scratch.
+</details>
 
-### Evaluation
+### DuoDiff training
+<details>
+<summary>Our proposed model, DuoDiff, involves training a shallow model that will be used alongside the full-model during inference. </summary>
 
-For evaluation, you should have a checkpoint. For convenience, we include one that can be downloaded using [git lfs](https://docs.github.com/en/repositories/working-with-files/managing-large-files/installing-git-large-file-storage):
 ```bash
-git lfs pull --include "src/checkpoints/frozenBackbone_attention_3losses.pth"
-```
-
-#### CMMD
-Command for generating samples and calculating the CMMD score: 
-```shell
-bash scripts/cmmd_evaluation.sh
-```
-or
-```shell
-python CMMD_evaluation/main.py \
-    --checkpoint_entry_name frozenBackbone_attention_3losses \
-    --exit_threshold 0.05 \
-    --cmmd_batch_size 32 \
-    --cmmd_max_count 10
+python main.py \
+    --model "uvit" \
+    --n_steps 500000 \
+    --batch_size 128 \
+    --log_path ${log_path} \
+    --log_every_n_steps 2500 \
+    --use_amp \
+    --save_every_n_steps 25000 \
+    --save_new_every_n_steps 100000 \
+    --sample_height 64 \
+    --sample_width 64 \
+    --seed 1 \
+    --normalize_timesteps \
+    --config_path "configs/uvit_celeba_3.yaml" \
+    --dataset "celeba" \
+    --parametrization "predict_noise" \
 ```
 
-- `cmmd_batch_size`: Batch size for embedding generation.
-- `cmmd_max_count`: Maximum number of images to read from each directory.
+</details>
 
-#### FID
-Command for generating samples and calculating the FID score: 
-```shell
-bash scripts/fid_evaluation.sh
+## Running inference
+In this section, we will see how to generate images using the models trained on the previous section.
+### Early-exit sampling
+Here, `checkpoint_path` points to the trained early-exit model (not the full model).
+```bash
+python eesampler.py \
+    --seed ${seed} \
+    --checkpoint_path "${checkpoint_path}" \
+    --batch_size 128 \
+    --output_folder "${output_folder}" \
+    --threshold 0.08 \
+    --config_path "configs/deediff_celeba.yaml"
 ```
-or
-```shell
-python FID_evaluation.py \
-    --checkpoint_entry_name frozenBackbone_attention_3losses \
-    --exit_threshold 0.05
+### DuoDiff inference
+Notice that we are using two different models, the full one, and the shallow one.
+```bash
+python sampler.py \
+    --seed ${seed} \
+    --checkpoint_path "${shallow_model_path}" \
+    --checkpoint_path_late "${full_model_path}" \
+    --batch_size 128 \
+    --parametrization "predict_noise" \
+    --output_folder "${output_folder}" \
+    --config_path "configs/uvit_celeba_3.yaml" \
+    --config_path_late "configs/uvit_celeba.yaml" \
+    --t_switch 300
+```
+## Computing FID scores
+We can easily compute the FID scores running the following script.
+```bash
+python fid.py \
+    --dataset "celeba"
+    --samples_path "${samples_path}"
 ```
 
-### Benchmarking
-For computing the theoretical GFLOPs for the MLP probe, attention probe and output head, you can run the following script 
 
-```shell
-python get_gflops.py
-```
-
-Example script for computing the average layer ratio and theoretical GFLOPs:
-```shell
-python compute_gflops_and_layer_ratio.py \
-    --indices_by_timestep_directory benchmarking/output/attention_frozen/indices_by_timestep
-```
+## Dev instructions
+The first time:
+1. Run `pre-commit install`
+2. If you use VSCode, it might be helpful to add the following to `settings.json`:
+    ```json
+    "[python]": {
+        "editor.formatOnSave": true,
+        "editor.defaultFormatter": "charliermarsh.ruff",
+        "editor.codeActionsOnSave": {
+            "source.fixAll": "explicit",
+            "source.organizeImports": "explicit"
+        }
+    },
+    ```
 
-For computing the average layer ratio and theoretical GFLOPs for each method, one can run the following script:
-```shell
-python compute_gflops_and_layer_ratio.py
-    --indices_by_timestep_directory ${indices_by_timestep_directory} \
+After that, be sure that all the tests are passing before a commit. Otherwise, GitHub Actions will complain ;) You can check by running
+```bash
+python -m pytest tests
 ```
-The parameter ``indices_by_timestep_directory`` is the relative path to the folder which contains files in ``.pt`` format regarding the layers which early exit took place per timestep. These directories can be found in ``benchmarking/output``. Currently, we uploaded only the ``.pt`` files for the model that uses an attention probe and a frozen backbone during training. The reason why we did not include them for all methods is because the files are pretty large. If one would need the files for the other methods, please contact us.
-
-
-## Resources
-### DeeDiff
-
-Tin's code: https://github.com/stases/EarlyDiffusion
-
-Paper: https://arxiv.org/pdf/2309.17074
-
-### Math Diffusion Models
-
-Lecture Notes in Probabilistic Diffusion Models: https://arxiv.org/html/2312.10393v1
-
-Lil'Log blogpost: https://lilianweng.github.io/posts/2021-07-11-diffusion-models
-
-### Backbones Diffusion Models
-
-U-Net: https://arxiv.org/pdf/1505.04597
-
-U-ViT: https://arxiv.org/pdf/2209.12152
-
-Diffusion Transformer (DiT): https://arxiv.org/pdf/2212.09748