265
Views
0
CrossRef citations to date
0
Altmetric
Research Article

FRIC: a framework for few-shot remote sensing image captioning

, , &
Article: 2337240 | Received 12 Jan 2024, Accepted 25 Mar 2024, Published online: 04 Apr 2024

References

  • Anderson, Peter, Basura Fernando, Mark Johnson, and Stephen Gould. 2016. “SPICE: Semantic Propositional Image Caption Evaluation.” Paper presented at the European conference on computer vision, Amsterdam, The Netherlands.
  • Banerjee, Satanjeev, and Alon Lavie. 2005. “METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments.” Paper presented at the IEEvaluation@ACL, AnnArbor, MI, USA, 11 June 2005.
  • Chen, Xianyu, Ming Jiang, and Qi Zhao. 2021. “Self-Distillation for Few-Shot Image Captioning.” 2021 IEEE winter conference on applications of computer vision (WACV):545–555.
  • Chen, Zihang, Junjue Wang, Ailong Ma, and Yanfei Zhong. 2022. “TypeFormer: Multiscale Transformer With Type Controller for Remote Sensing Image Caption.” IEEE Geoscience and Remote Sensing Letters 19:1–5.
  • Cheng, Qimin, Deqiao Gan, Peng Fu, Haiyan Huang, and Yuzhuo Zhou. 2021. “A Novel Ensemble Architecture of Residual Attention-Based Deep Metric Learning for Remote Sensing Image Retrieval.” Remote Sensing 13:3445. https://doi.org/10.3390/rs13173445.
  • Du, Runyan, Wei Cao, Wenkai Zhang, Guo Zhi, Xianchen Sun, Shuoke Li, and Jihao Li. 2023. “From Plane to Hierarchy: Deformable Transformer for Remote Sensing Image Captioning.” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 16:7704–7717. https://doi.org/10.1109/JSTARS.2023.3305889.
  • Finn, Chelsea, P. Abbeel, and Sergey Levine. 2017. “Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks.” Paper presented at the international conference on machine learning, Sydney, Australia.
  • Gallego, Antonio Javier, A. Pertusa, and Pablo Gil. 2018. “Automatic Ship Classification from Optical Aerial Images with Convolutional Neural Networks.” Remote Sensing 10:511. https://doi.org/10.3390/rs10040511.
  • Hoxha, Genc, and Farid Melgani. 2022. “A Novel SVM-Based Decoder for Remote Sensing Image Captioning.” IEEE Transactions on Geoscience and Remote Sensing 60:1–14.
  • Jeong, Taewon, and Heeyoung Kim. 2020. “OOD-MAML: Meta-Learning for Few-Shot Out-of-Distribution Detection and Classification.” Paper presented at the neural information processing systems, Vancouver, Canada.
  • Kandala, Hitesh, Sudipan Saha, Biplab Banerjee, and Xiao Xiang Zhu. 2022. “Exploring Transformer and Multilabel Classification for Remote Sensing Image Captioning.” IEEE Geoscience and Remote Sensing Letters 19:1–5. https://doi.org/10.1109/LGRS.2022.3198234.
  • Kemker, Ronald, Carl Salvaggio, and Christopher Kanan. 2018. “Algorithms for Semantic Segmentation of Multispectral Remote Sensing Imagery Using Deep Learning.” ISPRS Journal of Photogrammetry and Remote Sensing 145: 60–77.
  • Li, Xiaomin, D. Shi, Xiaolei Diao, and Hao Xu. 2022. “SCL-MLNet: Boosting Few-Shot Remote Sensing Scene Classification via Self-Supervised Contrastive Learning.” IEEE Transactions on Geoscience and Remote Sensing 60:1–12.
  • Li, Xuelong, Xueting Zhang, Wei Huang, and Qi Wang. 2020. “Truncation Cross Entropy Loss for Remote Sensing Image Captioning.” IEEE Transactions on Geoscience and Remote Sensing 59:5246–5257. https://doi.org/10.1109/TGRS.2020.3010106.
  • Lin, Chin-Yew. 2004. “ROUGE: A Package for Automatic Evaluation of Summaries.” Paper presented at the annual meeting of the association for computational linguistics, Barcelona, Spain.
  • Lin, Tsung-Yi, Michael Maire, Serge J. Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C. Lawrence Zitnick. 2014. “Microsoft COCO: Common Objects in Context.” Paper presented at the European conference on computer vision, Zurich, Switzerland.
  • Liu, Qingrong, Chengqing Ruan, Shan Zhong, Jian Li, Zhonghui Yin, and Xihu Lian. 2018. “Risk Assessment of Storm Surge Disaster Based on Numerical Models and Remote Sensing.” International Journal of Applied Earth Observation and Geoinformation 68: 20–30.
  • Liu, Chenyang, Rui Zhao, Jianqi Chen, Zipeng Qi, Zhengxia Zou, and Zhen Xia Shi. 2023. “A Decoupling Paradigm with Prompt Learning for Remote Sensing Image Change Captioning.” IEEE Transactions on Geoscience and Remote Sensing 61:1–18.
  • Liu, Chenyang, Rui Zhao, Hao Chen, Zhengxia Zou, and Zhen Xia Shi. 2022. “Remote Sensing Image Change Captioning With Dual-Branch Transformers: A New Method and a Large Scale Dataset.” IEEE Transactions on Geoscience and Remote Sensing 60:1–20.
  • Liu, Chenyang, Rui Zhao, and Zhen Xia Shi. 2022. “Remote-Sensing Image Captioning Based on Multilayer Aggregated Transformer.” IEEE Geoscience and Remote Sensing Letters 19:1–5.
  • Lu, Xiaoqiang, Binqiang Wang, Xiangtao Zheng, and Xuelong Li. 2017. “Exploring Models and Data for Remote Sensing Image Caption Generation.” IEEE Transactions on Geoscience and Remote Sensing 56:2183–2195.
  • Lyu, Qiang, and Weiqiang Wang. 2023. “Compositional Prototypical Networks for Few-Shot Classification.” ArXiv abs/2306.06584.
  • Munkhdalai, Tsendsuren, and Hong Yu. 2017. “Meta Networks.” Proceedings of Machine Learning Research 70:2554–2563.
  • Papineni, Kishore, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. “Bleu: A Method for Automatic Evaluation of Machine Translation.” Paper presented at the annual meeting of the association for computational linguistics, Philadelphia, PA, USA.
  • Qu, Bo, Xuelong Li, Dacheng Tao, and Xiaoqiang Lu. 2016. “Deep Semantic Understanding of High Resolution Remote Sensing Image.” 2016 international conference on computer, information and telecommunication systems (CITS), Kunming, China: 1–5.
  • Shang, Ronghua, Jiaming Wang, Licheng Jiao, R. Stolkin, Biao Hou, and Yangyang Li. 2018. “SAR Targets Classification Based on Deep Memory Convolution Neural Networks and Transfer Parameters.” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 11 (8): 2834–2846. https://doi.org/10.1109/JSTARS.2018.2836909.
  • Shi, Zhenwei, and Zhengxia Zou. 2017. “Can a Machine Generate Humanlike Language Descriptions for a Remote Sensing Image?” IEEE Transactions on Geoscience and Remote Sensing 55:3623–3634. https://doi.org/10.1109/TGRS.2017.2677464.
  • Snell, Jake, Kevin Swersky, and Richard S. Zemel. 2017. “Prototypical Networks for Few-Shot Learning.” ArXiv abs/1703.05175.
  • Vedantam, Ramakrishna, C. Lawrence Zitnick, and Devi Parikh. 2014. “CIDER: Consensus-Based Image Description Evaluation.” 2015 IEEE conference on computer vision and pattern recognition (CVPR), Boston, MA, USA: 4566–4575.
  • Wang, Binqiang, Xiaoqiang Lu, Xiangtao Zheng, and Xuelong Li. 2019. “Semantic Descriptions of High-Resolution Remote Sensing Images.” IEEE Geoscience and Remote Sensing Letters 16:1274–1278. https://doi.org/10.1109/LGRS.2019.2893772.
  • Yang, Qiaoqiao, Zihao Ni, and Pengxin Ren. 2022. “Meta Captioning: A Meta Learning Based Remote Sensing Image Captioning Framework.” ISPRS Journal of Photogrammetry and Remote Sensing 186: 190–200.
  • Zhang, Haopeng, Xingyu Zhang, Gang Meng, Chen Guo, and Zhi-guo Jiang. 2022. “Few-Shot Multi-Class Ship Detection in Remote Sensing Images Using Attention Feature Map and Multi-Relation Detector.” Remote Sensing 14:2790. https://doi.org/10.3390/rs14122790.
  • Zhang, Zhengyuan, Wenkai Zhang, Menglong Yan, Xin Gao, Kun Fu, and Xian Sun. 2022. “Global Visual Feature and Linguistic State Guided Attention for Remote Sensing Image Captioning.” IEEE Transactions on Geoscience and Remote Sensing 60:1–16.
  • Zhuang, Shuo, Pingping Wang, Gang Wang, Di Wang, Jinyong Chen, and Feng Gao. 2022. “Improving Remote Sensing Image Captioning by Combining Grid Features and Transformer.” IEEE Geoscience and Remote Sensing Letters 19:1–5.