We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Arbitrary shape text detection fusing InceptionNeXt and multi-scale attention mechanism.
- Authors
Li, Xianguo; Zhang, Yu; Liu, Yi; Yao, Xingchen; Zhou, Xinyi
- Abstract
Existing segmentation-based text detection methods generally face the problems of insufficient receptive fields, insufficient text information filtering, and difficulty in balancing detection accuracy and speed, limiting their ability to detect arbitrary-shaped text in complex backgrounds. To address these problems, we propose a new text detection method fusing the pure ConvNet model InceptionNeXt and the multi-scale attention mechanism. Firstly, we propose a text information reinforcement module to fully extract effective text information from features of different scales while preserving spatial position information. Secondly, we construct the InceptionNeXt Block module to compensate for insufficient receptive fields without significantly reducing speed. Finally, the INA-DBNet network structure is designed to fuse local and global features and achieve the balance of accuracy and speed. Experimental results demonstrate the efficacy of our method. Particularly, on the MSRA-TD500 and Total-text datasets, INA-DBNet achieves 91.3% and 86.7% F-measure while maintaining real-time inference speed. Code is available at: https://github.com/yuyu678/INANET.
- Subjects
RECOMMENDER systems; INFORMATION filtering; MULTISCALE modeling; SPEED
- Publication
Journal of Supercomputing, 2024, Vol 80, Issue 17, p25484
- ISSN
0920-8542
- Publication type
Article
- DOI
10.1007/s11227-024-06418-w