We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
DCIFPN: Deformable cross‐scale interaction feature pyramid network for object detection.
- Authors
Xiao, Junrui; Jiang, He; Li, Zhikai; Gu, Qingyi
- Abstract
Exploiting multi‐scale features is one of the most effective methods to recognize objects of different scales in object detection. Since image pyramid is time‐consuming, Feature Pyramid Network (FPN) becomes the most popular component used for obtaining pyramidal features. Despite its effectiveness, there still exist some intrinsic defects. In this work, it is attributed to insufficient information flow and a Deformable Cross‐scale Interaction Feature Pyramid Network (DCIFPN) is proposed, which aims to promote the information transfer process with content‐aware sampling and dynamic aggregation weights. More specifically, Deformable Semantic Enhancement Module (DSEM) is designed that can construct accurate information flow with dynamic aggregation weights. In addition, Deformable Spatial Refinement Module (DSRM) is proposed to enhance high‐level features with low‐level location details. When DCIFPN is deployed on RetinaNet and FCOS with ResNet‐50, the performance is improved by 1.6 AP and 1.1 AP, respectively, on the challenging MS COCO benchmark. Apart from one‐stage detectors, DCIFPN is also applicable to two‐stage methods such as Faster R‐CNN and Mask R‐CNN. Further experiments on Pascal VOC and CrowdHuman datasets can verify the effectiveness and generalization of the method.
- Subjects
PYRAMIDS; KNOWLEDGE transfer; FEATURE extraction
- Publication
IET Image Processing (Wiley-Blackwell), 2023, Vol 17, Issue 9, p2596
- ISSN
1751-9659
- Publication type
Article
- DOI
10.1049/ipr2.12800