skip to main content
10.1145/3625343.3625362acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaimlrConference Proceedingsconference-collections
research-article

Pixel Attention Feature Pyramid Network for Few-Shot Object Detection in Remote Sensing Images

Published: 11 November 2023 Publication History

Abstract

Recently, few-shot object detection (FSOD) has attracted extensive attention in the field of computer vision. However, these FSOD methods for natural images have limited performance when applied to remote sensing images with arbitrary orientations, dense objects, and complex backgrounds. In this paper, we propose a novel few-shot object detection method to address the above challenges in remote sensing images. First, we design a non-freezing two-stage fine-tuning detector that is fully trained on a base class data to extract information in the first stage, and fine-tunes the entire network for better novel class detection in the second stage using balanced training data containing both base and novel class samples. Then, a pixel attention feature pyramid network (PA-FPN) is proposed to replace the traditional FPN, which adds a pixel attention-based feature selection module (PA-FSM) and a pixel attention-based sub-pixel convolution module(PA-SPCM). In PA-FSM, pixel attention is used to extract more discriminative channels, thereby highlighting key features while suppressing background noise. In PA-SPCM, the sub-pixel operation performs convolution and upsamples the low-resolution feature maps with the aim of obtaining a high-resolution feature maps with richer semantic information, while pixel attention is applied to the high-resolution feature maps to enhance the semantic information. Finally, we introduce a proposal rectification head to eliminate false positives with high scores and address missing samples with low scores, which can improve classification accuracy. Experiments conducted on two public remote sensing datasets demonstrate the effectiveness of our method.

References

[1]
Gong Cheng and Junwei Han. 2016. A survey on object detection in optical remote sensing images. ISPRS Journal of Photogrammetry and Remote sensing 117 (2016), 11–28.
[2]
Gong Cheng, Junwei Han, Peicheng Zhou, and Lei Guo. 2014. Multi-class geospatial object detection and geographic image classification based on collection of part detectors. ISPRS Journal of Photogrammetry and Remote Sensing 98 (2014), 119–132.
[3]
Gong Cheng, Bowei Yan, Peizhen Shi, Ke Li, Xiwen Yao, Lei Guo, and Junwei Han. 2021. Prototype-CNN for few-shot object detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing 60 (2021), 1–10.
[4]
Jingyu Deng, Xiang Li, and Yi Fang. 2020. Few-shot object detection on remote sensing images. arXiv preprint arXiv:2006.07826 (2020).
[5]
Bingyi Kang, Zhuang Liu, Xin Wang, Fisher Yu, Jiashi Feng, and Trevor Darrell. 2019. Few-shot object detection via feature reweighting. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8420–8429.
[6]
Mona Köhler, Markus Eisenbach, and Horst-Michael Gross. 2021. Few-Shot Object Detection: A Survey. arXiv preprint arXiv:2112.11699 (2021).
[7]
Ke Li, Gang Wan, Gong Cheng, Liqiu Meng, and Junwei Han. 2020. Object detection in optical remote sensing images: A survey and a new benchmark. ISPRS Journal of Photogrammetry and Remote Sensing 159 (2020), 296–307.
[8]
Li Liu, Wanli Ouyang, Xiaogang Wang, Paul Fieguth, Jie Chen, Xinwang Liu, and Matti Pietikäinen. 2020. Deep learning for generic object detection: A survey. International journal of computer vision 128, 2 (2020), 261–318.
[9]
Limeng Qiao, Yuxuan Zhao, Zhiyuan Li, Xi Qiu, Jianan Wu, and Chi Zhang. 2021. Defrcn: Decoupled faster r-cnn for few-shot object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8681–8690.
[10]
Wenzhe Shi, Jose Caballero, Ferenc Huszár, Johannes Totz, Andrew P Aitken, Rob Bishop, Daniel Rueckert, and Zehan Wang. 2016. Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1874–1883.
[11]
Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, and Chi Zhang. 2021. Fsce: Few-shot object detection via contrastive proposal encoding. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7352–7362.
[12]
Xin Wang, Thomas E Huang, Trevor Darrell, Joseph E Gonzalez, and Fisher Yu. 2020. Frustratingly simple few-shot object detection. arXiv preprint arXiv:2003.06957 (2020).
[13]
Yan Wang, Chaofei Xu, Cuiwei Liu, and Zhaokui Li. 2022. Context Information Refinement for Few-Shot Object Detection in Remote Sensing Images. Remote Sensing 14, 14 (2022), 3255.
[14]
Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, and Chao Dong. 2020. Efficient image super-resolution using pixel attention. In European Conference on Computer Vision. Springer, 56–72.
[15]
Zhitao Zhao, Ping Tang, Lijun Zhao, and Zheng Zhang. 2021. Few-Shot Object Detection of Remote Sensing Images via Two-Stage Fine-Tuning. IEEE Geoscience and Remote Sensing Letters 19 (2021), 1–5.

Index Terms

  1. Pixel Attention Feature Pyramid Network for Few-Shot Object Detection in Remote Sensing Images

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AIMLR '23: Proceedings of the 2023 Asia Conference on Artificial Intelligence, Machine Learning and Robotics
    September 2023
    133 pages
    ISBN:9798400708312
    DOI:10.1145/3625343
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 November 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. feature pyramid network
    2. few-shot learning
    3. object detection
    4. pixel attention
    5. remote sensing images

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    • the National Natural Science Foundation of China
    • the Applied Basic Research Project of Liaoning Province

    Conference

    AIMLR 2023

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 26
      Total Downloads
    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media