We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Cross-media web video event mining based on multiple semantic-paths embedding.
- Authors
Xiao, Xia; Du, Mingyue; Xu, Shuyu; Liu, Guoying; Zhang, Chengde
- Abstract
Web video event mining based on cross-media fusion has become a research hotspot. However, each video is only described by a dozen noisy words, resulting in extremely unstable textual features. Moreover, different people might describe the same video with completely different words. Thus, the semantic association between textual and visual information would be much sparse, which brings great challenges to web video event mining based on cross-media associations. To address this issue, this paper proposes a novel framework to enrich the associations between near duplicate keyframes (NDK) and terms based on multiple semantic-paths embedding. After data preprocessing, we build a heterogeneous information network to establish associations among NDKs, terms and videos. Then, semantic-path walk strategy is designed to generate meaningful semantic-node sequences for embedding. Next, an embedding fusion method is proposed to predict the distribution characteristics of each term in NDKs. Finally, multiple correspondence analysis is used to mine web video events. Experiments on web videos from YouTube show that our proposed method performs better than several state-of-the-art baseline models, with an average F1 score improvement of 19–50%.
- Subjects
STREAMING video &; television; INFORMATION networks; VIDEOS
- Publication
Neural Computing & Applications, 2024, Vol 36, Issue 2, p667
- ISSN
0941-0643
- Publication type
Article
- DOI
10.1007/s00521-023-09050-6