Paper
7 June 2023 Structural representative network for remote sensing image captioning
Jaya Sharma, Peketi Divya, Yenduri Sravani, B. H. Shekar, Krishna Mohan Chalavadi
Author Affiliations +
Proceedings Volume 12701, Fifteenth International Conference on Machine Vision (ICMV 2022); 127011Q (2023) https://doi.org/10.1117/12.2679283
Event: Fifteenth International Conference on Machine Vision (ICMV 2022), 2022, Rome, Italy
Abstract
Current encoder-decoder methods for remote sensing image captioning (RSIC) avoids fine-grained structural representation of objects due to the lack of prominent encoding frameworks. This paper proposes a novel structural representative network (SRN) for acquiring fine-grained structures of remote sensing images (RSI) for generating semantically meaningful captions. Initially, we employ SRN on top of the final layers of the convolutional neural network (CNN) for attaining the spatially transformed RSI features. A multi-stage decoder is incorporated into the extracted features of SRN to produce fine-grained meaningful captions. The efficacy of our proposed methodology is exhibited on two RSIC datasets, i.e Sydney-Captions dataset, and the UCM-Captions dataset.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jaya Sharma, Peketi Divya, Yenduri Sravani, B. H. Shekar, and Krishna Mohan Chalavadi "Structural representative network for remote sensing image captioning", Proc. SPIE 12701, Fifteenth International Conference on Machine Vision (ICMV 2022), 127011Q (7 June 2023); https://doi.org/10.1117/12.2679283
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Semantics

Convolution

Image retrieval

Remote sensing

Data modeling

Image processing

Back to Top