Review-of-Change-Captioning

General-Scenes: Video Surveillance, Natural Image (Birds), Synthetic Data, and Image Editing

Awesome Works

Learning to Describe Differences Between Pairs of Similar Images [paper] [code and dataset]
- Harsh Jhamtani, Taylor Berg-Kirkpatrick
- EMNLP 2018
Robust Change Captioning [paper] [code and dataset]
- Dong Huk Park, Trevor Darrell, Anna Rohrbach
- ICCV 2019
Expressing Visual Relationships via Language [paper] [code and dataset]
- Hao Tan, Franck Dernoncourt, Zhe Lin, Trung Bui, Mohit Bansal
- ACL 2019
Neural Naturalist: Generating Fine-Grained Image Comparisons [paper] [dataset]
- Maxwell Forbes, Christine Kaeser-Chen, Piyush Sharma, Serge Belongie
- EMNLP 2019
Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning [paper]
- Xiangxi Shi, Xu Yang, Jiuxiang Gu, Sha q Joty, and Jianfei Cai
- ECCV 2020
Image Change Captioning by Learning from an Auxiliary Task [paper]
- Mehrdad Hosseinzadeh and Yang Wang
- CVPR 2021
Viewpoint-Agnostic Change Captioning with Cycle Consistency [paper] [dataset]
- Hoeseong Kim, Jongseok Kim, Hyungseok Lee, Hyunsung Park, Gunhee Kim
- ICCV 2021
Describing and Localizing Multiple Changes with Transformers [paper] [code and dataset]
- Yue Qiu, Shintaro Yamamoto, Kodai Nakashima, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka, Yutaka Satoh
- ICCV 2021
Scene Graph with 3D Information for Change Captioning [paper]
- Zeming Liao, Qingbao Huang, Yu Liang, Mingyi Fu, Yi Cai, Qing Li
- ACM MM 2021
Semantic Relation-aware Difference Representation Learning for Change Captioning [paper] [code]
- Yunbin Tu, Tingting Yao, Liang Li, Jiedong Lou, Shengxiang Gao, Zhengtao Yu, Chenggang Yan
- ACL Fidings 2021
R³Net: Relation-embedded Representation Reconstruction Network for Change Captioning [paper] [code]
- Yunbin Tu, Liang Li, Chenggang Yan, Shengxiang Gao, Zhengtao Yu
- EMNLP 2021
L2C: Describing Visual Differences Needs Semantic Understanding of Individuals [paper]
- An Yan, Xin Wang, Tsu-Jui Fu, William Yang Wang
- EACL 2021
Image Difference Captioning with Instance-Level Fine-Grained Feature Representation [paper] [code]
- Qingbao Huang, Yu Liang, Jielong Wei, Yi Cai, Hanyu Liang, Ho-fung Leung, Qing Li
- TMM 2022
Learning by Imagination: A Joint Framework for Text-Based Image Manipulation and Change Captioning [paper]
- Kenan E. Ak, YingSun, Joo Hwee Lim
- TMM 2022
Image Difference Captioning with Pre-training and Contrastive Learning [paper] [code]
- Linli Yao, Weiying Wang, Qin Jin
- AAAI 2022
CLIP4IDC: CLIP for Image Difference Captioning [paper] [code]
- Zixin Guo, Tzu-Jui Julius Wang, Jorma Laaksonen
- AACL 2022
I3N: Intra- and Inter-Representation Interaction Network for Change Captioning [paper]
- Shengbin Yue, Yunbin Tu, LiangLi, Ying Yang, Shengxiang Gao, Zhengtao Yu
- TMM 2023
Neighborhood Contrastive Transformer for Change Captioning [paper] [code]
- Yunbin Tu, Liang Li, Li Su, Ke Lu, Qingming Huang
- TMM 2023
Viewpoint-Adaptive Representation Disentanglement Network for Change Captioning [paper] [code]
- Yunbin Tu, Liang Li, Li Su, Junping Du, Ke Lu, Qingming Huang
- TIP 2023
Self-supervised Cross-view Representation Reconstruction for Change Captioning [paper] [code]
- Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang
- ICCV 2023
Semantic Object Alignment and Region-Aware Learning for Change Captioning [paper]
- Weidong Tian, Quan Ren, Zhongqiu Zhao, and Ruihua Tian
- IJCNN 2023
Graph Representation for Order-aware Visual Transformation [paper]
- Yue Qiu, Yanjun Sun, Fumiya Matsuzawa, Kenji Iwata, Hirokatsu Kataoka
- CVPR 2023
Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding [paper] [code]
- Xiaonan Lu, Jianlong Yuan, Ruigang Niu, Yuan Hu, Fan Wang
- Arxiv 2023
Multi-Grained Representation Aggregating Transformer with Gating Cycle for Change Captioning [paper]
- Shengbin Yue, Yunbin Tu, LiangLi, Shengxiang Gao, Zhengtao Yu
- TOMM 2024
SMART: Syntax-calibrated Multi-Aspect Relation Transformer for Change Captioning [paper] [code]
- Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Qingming Huang
- TPAMI 2024
Context-aware Difference Distilling for Multi-change Captioning [paper] [code]
- Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang
- ACL 2024
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning [paper] [code]
- Yunbin Tu, Liang Li, Li Su, Chenggang Yan, Qingming Huang
- ECCV 2024
The STVchrono Dataset: Towards Continuous Change Recognition in Time [paper] [dataset]
- Yanjun Sun, Yue Qiu, Mariia Khan, Fumiya Matsuzawa, Kenji Iwata
- CVPR 2024
Relation-Aware Multi-Pass Comparison Deconfounded Network for Change Captioning [paper]
- Zhicong Lu, Li Jin, Ziwei Chen, Changyuan Tian, Xian Sun, Xiaoyu Li, Yi Zhang, Qi Li, Guangluan Xu
- TCSVT 2024
OneDiff: A Generalist Model for Image Difference Captioning [paper]
- Erdong Hu, Longteng Guo, Tongtian Yue, Zijia Zhao, Shuning Xue, Jing Liu
- ACCV 2024
Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning [paper] [code]
- Xian Zhang, Haokun Wen, Jianlong Wu, Pengda Qin, Hui Xue, Liqiang Nie
- ACM MM 2024
VIXEN: Visual Text Comparison Network for Image Difference Captioning [paper]
- Alexander Black, Jing Shi, Yifei Fan, Tu Bui, John Collomosse
- AAAI 2024
Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation [paper] [code]
- Gautier Evennou, Antoine Chaffin, Vivien Chappelier, Ewa Kijak
- WACV 2025
Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning [paper]
- Rong Li, Liang Li, Jiehua Zhang, Qiang Zhao, Hongkui Wang, Chenggang Yan
- AAAI 2025
DECIDER: Difference-aware Contrastive Diffusion Model with Adversarial Perturbations for Image Change Captioning [paper]
- Guojin Zhong, Jinhong Hu, Jiajun Chen, Jin Yuan, Wenbo Pan
- AAAI 2025
MCT-CCDi : Context-Aware Contrastive Di usion Model With Mediator-Bridging Cross-Modal Transformer for Image Change Captioning [paper]
- Jinhong Hu, Guojin Zhong, Jin Yuan, Wenbo Pan, Xiaoping Wang
- TIP 2025
Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models [paper] [code]
- Qirui Jiao, Daoyuan Chen, Yilun Huang, Bolin Ding, Yaliang Li, Ying Shen
- CVPR 2025

3D-Scenes

Paper

Remote-sensing-Scenes

Awesome Works

Captioning changes in bi-temporal remote sensing images [paper]
- Seloua Chouaf, Genc Hoxha, Youcef Smara, Farid Melgani
- IGARSS 2021
Change captioning: A new paradigm for multitemporal remote sensing image analysis [paper] [dataset]
- Genc Hoxha, Seloua Chouaf, Farid Melgani, Youcef Smara
- TGRS 2022
Remote sensing image change captioning with dual-branch transformers: A new method and a large scale dataset [paper] [code and dataset]
- Chenyang Liu, RuiZhao, Zhengxia Zou, Hao Chen, Zhenwei Shi
- TGRS 2022
Progressive Scale-aware Network for Remote sensing Image Change Captioning [paper] [code]
- Chenyang Liu, Jiajun Yang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi
- IGARSS 2023
A Decoupling Paradigm with Prompt Learning for Remote Sensing Image Change Captioning [paper] [code]
- Chenyang Liu, Rui Zhao, Jianqi Chen, Zipeng Qi, Zhengxia Zou, Zhenwei Shi
- TGRS 2023
Changes to Captions: An Attentive Network for Remote Sensing Change Captioning [paper] [code]
- Shizhen Chang, Pedram Ghamisi
- TIP 2023
Interactive Change-Aware Transformer Network for Remote Sensing Image Change Captioning [paper] [code]
- Chen Cai, Yi Wang, Kim-HuiYap
- Remote Sensing 2023
Pixel-Level Change Detection Pseudo-Label Learning for Remote Sensing Change Captioning [paper]
- Chenyang Liu, Keyan Chen, Zipeng Qi, Zili Liu, Haotian Zhang, Zhengxia Zou, Zhenwei Shi
- IGARSS 2024
Change Caption for Satellite Images Time Series [paper][code]
- Wei Peng, Ping Jian, Zhuqing Mao, Yingying Zhao
- GRSL 2024
RSCaMa: Remote Sensing Image Change Captioning with State Space Model [paper] [code]
- Chenyang Liu, Keyan Chen, Bowen Chen, Haotian Zhang, Zhengxia Zou, Zhenwei Shi
- GRSL 2024
A Lightweight Sparse Focus Transformer for Remote Sensing Image Change Captioning [paper] [code]
- Dongwei Sun, Yajie Bao, Junmin Liu, Xiangyong Cao
- JSTARS 2024
Single-stream Extractor Network with Contrastive Pre-training for Remote Sensing Change Captioning [paper] [code]
- Qing Zhou, Junyu Gao, Yuan Yuan, Qi Wang
- TGRS 2024
Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis [paper] [code]
- Chenyang Liu, Keyan Chen, Haotian Zhang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi
- TGRS 2024
Multi-scale Attentive Fusion Network for Remote Sensing Image Change Captioning [paper]
- Cai Chen, Yi Wang, Kim-Hui Yap
- ISCAS 2024
Semantic-CC: Boosting Remote Sensing Image Change Captioning via Foundational Knowledge and Semantic Guidance [paper]
- Yongshuo Zhu, Lu Li, Keyan Chen, Chenyang Liu, Fugen Zhou, Zhenwei Shi
- TGRS 2024
MfrNet: A New Multi-Scale Feature Refining Method for Remote Sensing Image Change Captioning [paper]
- Kaiqi Xu, Yingping Han, Rui Yang, Xiutiao Ye, Yanhe Guo, Hantong Xing Shuang Wang
- IGARSS 2024
Detection Assisted Change Captioning for Remote Sensing Image [paper]
- Xiliang Li, Bin Sun, Shutao Li
- IGARSS 2024
Context-aware Difference Distilling for Multi-change Captioning [paper] [code]
- Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang
- ACL 2024
Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning [paper] [code]
- Cong Yang, Zuchao Li, Hongzan Jiao, Zhi Gao, Lefei Zhang
- Arxiv 2024
Inter-Temporal Interaction and Symmetric Difference Learning for Remote Sensing Image Change Captioning [paper] [code]
- Yunpeng Li, Xiangrong Zhang, Xina Cheng, Puhua Chen, Licheng Jiao
- TGRS 2024
ChangeMinds: Multi-task Framework for Detecting and Describing Changes in Remote Sensing [paper] [code]
- Yuduo Wang, Weikang Yu, Michael Kopp, Pedram Ghamisi
- Arxiv 2024
Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning [paper]
- Rong Li, Liang Li, Jiehua Zhang, Qiang Zhao, Hongkui Wang, Chenggang Yan
- AAAI 2025
Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective [paper] [code]
- Duowang Zhu, Xiaohu Huang, Haiyan Huang, Hao Zhou, Zhenfeng Shao
- CVPR 2025

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Review-of-Change-Captioning

Table of Contents

General-Scenes: Video Surveillance, Natural Image (Birds), Synthetic Data, and Image Editing

Awesome Works

3D-Scenes

Paper

Remote-sensing-Scenes

Awesome Works

About

Uh oh!

Releases

Packages

tuyunbin/Review-of-Change-Captioning

Folders and files

Latest commit

History

Repository files navigation

Review-of-Change-Captioning

Table of Contents

General-Scenes: Video Surveillance, Natural Image (Birds), Synthetic Data, and Image Editing

Awesome Works

3D-Scenes

Paper

Remote-sensing-Scenes

Awesome Works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages