ABSTRACT
When natural image detection methods are applied to remote sensing images, their detection performance is often unsatisfactory due to the random distribution of objects, complex backgrounds, and significant scale changes. In order to better detect objects with complex backgrounds and significant scale changes in remote sensing images, this study presents SGMFNet, a remote sensing image object detection network based on spatial global attention (SGA) and multi-scale feature fusion (MFF). The SGA inserted into the backbone network can better model context information, suppress irrelevant background, and build powerful feature information, making it easier for subsequent MFF to extract scale-invariant information from adjacent feature layers. This study evaluates the performance of SGMFNet on remote sensing datasets DIOR, NWPU VHR-10, and RSD-GOD. Quantitative and qualitative results on three datasets demonstrate the superiority of SGMFNet in remote sensing object detection and its outperformance compared with other state-of-the-art methods. Therefore, SGMFNet can assist in high-precision urban planning, military monitoring, and other tasks.
Disclosure statement
No potential conflict of interest was reported by the author(s).
Data availability statement
The dataset in our study is based on the DIOR dataset (https://opendatalab.com/DIOR), NWPU VHR-10 dataset (https://opendatalab.com/NWPU_VHR-10), and RSD-GOD dataset (https://github.com/ZhuangShuoH/geospatial-object-detection).