Skip to content

This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.

Notifications You must be signed in to change notification settings

tuyunbin/Review-of-Change-Captioning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

84 Commits
 
 

Repository files navigation

Review-of-Change-Captioning

Table of Contents

General-Scenes: Video Surveillance, Natural Image (Birds), Synthetic Data, and Image Editing

Awesome Works

  • Learning to Describe Differences Between Pairs of Similar Images [paper] [code and dataset]

    • Harsh Jhamtani, Taylor Berg-Kirkpatrick
    • EMNLP 2018
  • Robust Change Captioning [paper] [code and dataset]

    • Dong Huk Park, Trevor Darrell, Anna Rohrbach
    • ICCV 2019
  • Expressing Visual Relationships via Language [paper] [code and dataset]

    • Hao Tan, Franck Dernoncourt, Zhe Lin, Trung Bui, Mohit Bansal
    • ACL 2019
  • Neural Naturalist: Generating Fine-Grained Image Comparisons [paper] [dataset]

    • Maxwell Forbes, Christine Kaeser-Chen, Piyush Sharma, Serge Belongie
    • EMNLP 2019
  • Finding It at Another Side: A Viewpoint-Adapted Matching Encoder for Change Captioning [paper]

    • Xiangxi Shi, Xu Yang, Jiuxiang Gu, Sha q Joty, and Jianfei Cai
    • ECCV 2020
  • Image Change Captioning by Learning from an Auxiliary Task [paper]

    • Mehrdad Hosseinzadeh and Yang Wang
    • CVPR 2021
  • Viewpoint-Agnostic Change Captioning with Cycle Consistency [paper] [dataset]

    • Hoeseong Kim, Jongseok Kim, Hyungseok Lee, Hyunsung Park, Gunhee Kim
    • ICCV 2021
  • Describing and Localizing Multiple Changes with Transformers [paper] [code and dataset]

    • Yue Qiu, Shintaro Yamamoto, Kodai Nakashima, Ryota Suzuki, Kenji Iwata, Hirokatsu Kataoka, Yutaka Satoh
    • ICCV 2021
  • Scene Graph with 3D Information for Change Captioning [paper]

    • Zeming Liao, Qingbao Huang, Yu Liang, Mingyi Fu, Yi Cai, Qing Li
    • ACM MM 2021
  • Semantic Relation-aware Difference Representation Learning for Change Captioning [paper] [code]

    • Yunbin Tu, Tingting Yao, Liang Li, Jiedong Lou, Shengxiang Gao, Zhengtao Yu, Chenggang Yan
    • ACL Fidings 2021
  • R3Net: Relation-embedded Representation Reconstruction Network for Change Captioning [paper] [code]

    • Yunbin Tu, Liang Li, Chenggang Yan, Shengxiang Gao, Zhengtao Yu
    • EMNLP 2021
  • L2C: Describing Visual Differences Needs Semantic Understanding of Individuals [paper]

    • An Yan, Xin Wang, Tsu-Jui Fu, William Yang Wang
    • EACL 2021
  • Image Difference Captioning with Instance-Level Fine-Grained Feature Representation [paper] [code]

    • Qingbao Huang, Yu Liang, Jielong Wei, Yi Cai, Hanyu Liang, Ho-fung Leung, Qing Li
    • TMM 2022
  • Learning by Imagination: A Joint Framework for Text-Based Image Manipulation and Change Captioning [paper]

    • Kenan E. Ak, YingSun, Joo Hwee Lim
    • TMM 2022
  • Image Difference Captioning with Pre-training and Contrastive Learning [paper] [code]

    • Linli Yao, Weiying Wang, Qin Jin
    • AAAI 2022
  • CLIP4IDC: CLIP for Image Difference Captioning [paper] [code]

    • Zixin Guo, Tzu-Jui Julius Wang, Jorma Laaksonen
    • AACL 2022
  • I3N: Intra- and Inter-Representation Interaction Network for Change Captioning [paper]

    • Shengbin Yue, Yunbin Tu, LiangLi, Ying Yang, Shengxiang Gao, Zhengtao Yu
    • TMM 2023
  • Neighborhood Contrastive Transformer for Change Captioning [paper] [code]

    • Yunbin Tu, Liang Li, Li Su, Ke Lu, Qingming Huang
    • TMM 2023
  • Viewpoint-Adaptive Representation Disentanglement Network for Change Captioning [paper] [code]

    • Yunbin Tu, Liang Li, Li Su, Junping Du, Ke Lu, Qingming Huang
    • TIP 2023
  • Self-supervised Cross-view Representation Reconstruction for Change Captioning [paper] [code]

    • Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang
    • ICCV 2023
  • Semantic Object Alignment and Region-Aware Learning for Change Captioning [paper]

    • Weidong Tian, Quan Ren, Zhongqiu Zhao, and Ruihua Tian
    • IJCNN 2023
  • Graph Representation for Order-aware Visual Transformation [paper]

    • Yue Qiu, Yanjun Sun, Fumiya Matsuzawa, Kenji Iwata, Hirokatsu Kataoka
    • CVPR 2023
  • Viewpoint Integration and Registration with Vision Language Foundation Model for Image Change Understanding [paper] [code]

    • Xiaonan Lu, Jianlong Yuan, Ruigang Niu, Yuan Hu, Fan Wang
    • Arxiv 2023
  • Multi-Grained Representation Aggregating Transformer with Gating Cycle for Change Captioning [paper]

    • Shengbin Yue, Yunbin Tu, LiangLi, Shengxiang Gao, Zhengtao Yu
    • TOMM 2024
  • SMART: Syntax-calibrated Multi-Aspect Relation Transformer for Change Captioning [paper] [code]

    • Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Qingming Huang
    • TPAMI 2024
  • Context-aware Difference Distilling for Multi-change Captioning [paper] [code]

    • Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang
    • ACL 2024
  • Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning [paper] [code]

    • Yunbin Tu, Liang Li, Li Su, Chenggang Yan, Qingming Huang
    • ECCV 2024
  • The STVchrono Dataset: Towards Continuous Change Recognition in Time [paper] [dataset]

    • Yanjun Sun, Yue Qiu, Mariia Khan, Fumiya Matsuzawa, Kenji Iwata
    • CVPR 2024
  • Relation-Aware Multi-Pass Comparison Deconfounded Network for Change Captioning [paper]

    • Zhicong Lu, Li Jin, Ziwei Chen, Changyuan Tian, Xian Sun, Xiaoyu Li, Yi Zhang, Qi Li, Guangluan Xu
    • TCSVT 2024
  • OneDiff: A Generalist Model for Image Difference Captioning [paper]

    • Erdong Hu, Longteng Guo, Tongtian Yue, Zijia Zhao, Shuning Xue, Jing Liu
    • ACCV 2024
  • Differential-Perceptive and Retrieval-Augmented MLLM for Change Captioning [paper] [code]

    • Xian Zhang, Haokun Wen, Jianlong Wu, Pengda Qin, Hui Xue, Liqiang Nie
    • ACM MM 2024
  • VIXEN: Visual Text Comparison Network for Image Difference Captioning [paper]

    • Alexander Black, Jing Shi, Yifei Fan, Tu Bui, John Collomosse
    • AAAI 2024
  • Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation [paper] [code]

    • Gautier Evennou, Antoine Chaffin, Vivien Chappelier, Ewa Kijak
    • WACV 2025
  • Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning [paper]

    • Rong Li, Liang Li, Jiehua Zhang, Qiang Zhao, Hongkui Wang, Chenggang Yan
    • AAAI 2025
  • DECIDER: Difference-aware Contrastive Diffusion Model with Adversarial Perturbations for Image Change Captioning [paper]

    • Guojin Zhong, Jinhong Hu, Jiajun Chen, Jin Yuan, Wenbo Pan
    • AAAI 2025
  • MCT-CCDi : Context-Aware Contrastive Di usion Model With Mediator-Bridging Cross-Modal Transformer for Image Change Captioning [paper]

    • Jinhong Hu, Guojin Zhong, Jin Yuan, Wenbo Pan, Xiaoping Wang
    • TIP 2025
  • Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models [paper] [code]

    • Qirui Jiao, Daoyuan Chen, Yilun Huang, Bolin Ding, Yaliang Li, Ying Shen
    • CVPR 2025

3D-Scenes

Paper

Remote-sensing-Scenes

Awesome Works

  • Captioning changes in bi-temporal remote sensing images [paper]

    • Seloua Chouaf, Genc Hoxha, Youcef Smara, Farid Melgani
    • IGARSS 2021
  • Change captioning: A new paradigm for multitemporal remote sensing image analysis [paper] [dataset]

    • Genc Hoxha, Seloua Chouaf, Farid Melgani, Youcef Smara
    • TGRS 2022
  • Remote sensing image change captioning with dual-branch transformers: A new method and a large scale dataset [paper] [code and dataset]

    • Chenyang Liu, RuiZhao, Zhengxia Zou, Hao Chen, Zhenwei Shi
    • TGRS 2022
  • Progressive Scale-aware Network for Remote sensing Image Change Captioning [paper] [code]

    • Chenyang Liu, Jiajun Yang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi
    • IGARSS 2023
  • A Decoupling Paradigm with Prompt Learning for Remote Sensing Image Change Captioning [paper] [code]

    • Chenyang Liu, Rui Zhao, Jianqi Chen, Zipeng Qi, Zhengxia Zou, Zhenwei Shi
    • TGRS 2023
  • Changes to Captions: An Attentive Network for Remote Sensing Change Captioning [paper] [code]

    • Shizhen Chang, Pedram Ghamisi
    • TIP 2023
  • Interactive Change-Aware Transformer Network for Remote Sensing Image Change Captioning [paper] [code]

    • Chen Cai, Yi Wang, Kim-HuiYap
    • Remote Sensing 2023
  • Pixel-Level Change Detection Pseudo-Label Learning for Remote Sensing Change Captioning [paper]

    • Chenyang Liu, Keyan Chen, Zipeng Qi, Zili Liu, Haotian Zhang, Zhengxia Zou, Zhenwei Shi
    • IGARSS 2024
  • Change Caption for Satellite Images Time Series [paper][code]

    • Wei Peng, Ping Jian, Zhuqing Mao, Yingying Zhao
    • GRSL 2024
  • RSCaMa: Remote Sensing Image Change Captioning with State Space Model [paper] [code]

    • Chenyang Liu, Keyan Chen, Bowen Chen, Haotian Zhang, Zhengxia Zou, Zhenwei Shi
    • GRSL 2024
  • A Lightweight Sparse Focus Transformer for Remote Sensing Image Change Captioning [paper] [code]

    • Dongwei Sun, Yajie Bao, Junmin Liu, Xiangyong Cao
    • JSTARS 2024
  • Single-stream Extractor Network with Contrastive Pre-training for Remote Sensing Change Captioning [paper] [code]

    • Qing Zhou, Junyu Gao, Yuan Yuan, Qi Wang
    • TGRS 2024
  • Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis [paper] [code]

    • Chenyang Liu, Keyan Chen, Haotian Zhang, Zipeng Qi, Zhengxia Zou, Zhenwei Shi
    • TGRS 2024
  • Multi-scale Attentive Fusion Network for Remote Sensing Image Change Captioning [paper]

    • Cai Chen, Yi Wang, Kim-Hui Yap
    • ISCAS 2024
  • Semantic-CC: Boosting Remote Sensing Image Change Captioning via Foundational Knowledge and Semantic Guidance [paper]

    • Yongshuo Zhu, Lu Li, Keyan Chen, Chenyang Liu, Fugen Zhou, Zhenwei Shi
    • TGRS 2024
  • MfrNet: A New Multi-Scale Feature Refining Method for Remote Sensing Image Change Captioning [paper]

    • Kaiqi Xu, Yingping Han, Rui Yang, Xiutiao Ye, Yanhe Guo, Hantong Xing Shuang Wang
    • IGARSS 2024
  • Detection Assisted Change Captioning for Remote Sensing Image [paper]

    • Xiliang Li, Bin Sun, Shutao Li
    • IGARSS 2024
  • Context-aware Difference Distilling for Multi-change Captioning [paper] [code]

    • Yunbin Tu, Liang Li, Li Su, Zheng-Jun Zha, Chenggang Yan, Qingming Huang
    • ACL 2024
  • Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning [paper] [code]

    • Cong Yang, Zuchao Li, Hongzan Jiao, Zhi Gao, Lefei Zhang
    • Arxiv 2024
  • Inter-Temporal Interaction and Symmetric Difference Learning for Remote Sensing Image Change Captioning [paper] [code]

    • Yunpeng Li, Xiangrong Zhang, Xina Cheng, Puhua Chen, Licheng Jiao
    • TGRS 2024
  • ChangeMinds: Multi-task Framework for Detecting and Describing Changes in Remote Sensing [paper] [code]

    • Yuduo Wang, Weikang Yu, Michael Kopp, Pedram Ghamisi
    • Arxiv 2024
  • Region-aware Difference Distilling with Attribute-guided Contrastive Regularization for Change Captioning [paper]

    • Rong Li, Liang Li, Jiehua Zhang, Qiang Zhao, Hongkui Wang, Chenggang Yan
    • AAAI 2025
  • Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective [paper] [code]

    • Duowang Zhu, Xiaohu Huang, Haiyan Huang, Hao Zhou, Zhenfeng Shao
    • CVPR 2025

About

This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published