Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images
Xiaofei Yu, Yitong Li, Jie Ma* [paper]
Here we provide the structure of our model:
Download Source: -Thanks for the Dataset by Liu et. al:[GitHub]. Put the content of downloaded dataset under the folder 'data'
path to ./data:
├─LevirCCcaptions.json
├─images
├─train
│ ├─A
│ ├─B
├─val
│ ├─A
│ ├─B
├─test
│ ├─A
│ ├─B
git clone https://github.com/Fay-Y/Diffusion-RSCC
cd Diffusion-RSCC
conda create -n DiffusionRSCC_env python=3.8
conda activate DiffusionRSCC_env
pip install -r requirements.txt
Preprocess the raw captions and image pairs:
python word_encode.py
python img_preprocess.py
To train the proposed Diffusion-RSCC, run the following command:
sh demo.sh
To test, evaluate and visualize on the test dataset, run the following command
sh testlm.sh
cd result
In the paper, the predicted captions are saved in folder "result".
Prediction results in test set with 5 Ground Truth captions are partly shown below, proving the effectiveness of our model. For each image pair, the left part is the before image, the righ part is the after image.