We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Enhancing Insect Sound Classification Using Dual-Tower Network: A Fusion of Temporal and Spectral Feature Perception.
- Authors
He, Hangfei; Chen, Junyang; Chen, Hongkun; Zeng, Borui; Huang, Yutong; Zhaopeng, Yudan; Chen, Xiaoyan
- Abstract
Featured Application: Insects often inhabit environments that are difficult to observe and explore due to their small size, strong camouflage abilities, and secretive lifestyle. This inherent difficulty in visual inspection requires alternative approaches. In this study, we started with insect sounds and drew on the way insect brains process sound signals to propose a classification module called "dual-frequency and spectral fusion module (DFSM)". Overall, our research shows that the proposal of this module provides an important reference for the field of insect sound classification, promoting research and application in the field of biological control. In the modern field of biological pest control, especially in the realm of insect population monitoring, deep learning methods have made further advancements. However, due to the small size and elusive nature of insects, visual detection is often impractical. In this context, the recognition of insect sound features becomes crucial. In our study, we introduce a classification module called the "dual-frequency and spectral fusion module (DFSM)", which enhances the performance of transfer learning models in audio classification tasks. Our approach combines the efficiency of EfficientNet with the hierarchical design of the Dual Towers, drawing inspiration from the way the insect neural system processes sound signals. This enables our model to effectively capture spectral features in insect sounds and form multiscale perceptions through inter-tower skip connections. Through detailed qualitative and quantitative evaluations, as well as comparisons with leading traditional insect sound recognition methods, we demonstrate the advantages of our approach in the field of insect sound classification. Our method achieves an accuracy of 80.26% on InsectSet32, surpassing existing state-of-the-art models by 3 percentage points. Additionally, we conducted generalization experiments using three classic audio datasets. The results indicate that DFSM exhibits strong robustness and wide applicability, with minimal performance variations even when handling different input features.
- Subjects
INSECT sounds; CLASSIFICATION of insects; BIOLOGICAL pest control; TIME-varying networks; DEEP learning; FORM perception; STIMULUS generalization; CELL fusion
- Publication
Applied Sciences (2076-3417), 2024, Vol 14, Issue 7, p3116
- ISSN
2076-3417
- Publication type
Article
- DOI
10.3390/app14073116