We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Micro video recommendation in multimodality using dual-perception and gated recurrent graph neural network.
- Authors
Patil, Swati S.; Patil, Rupali S.; Kotwal, Amina
- Abstract
With the proliferation of mobile Internet devices and the increasing speed of networks, coupled with reduced data costs, individuals now enjoy the convenience of watching films on their mobile devices at their preferred times. The widespread adoption of micro-videos has led to the emergence of numerous micro-video platforms. The growing popularity of these platforms has spurred efforts to enhance user experience through accurate and real-time recommendation algorithms. To remain competitive, platforms now rely on advanced algorithms to effectively recommend micro-videos. While algorithms based on multimodal data have been utilized to enrich item information, they often overlook user preferences for various information modalities and fail to conduct an in-depth analysis of the inherent connections within multimodal data. Consequently, this article proposes a novel framework for the Dual-Perception and Multi-Resolution Graph Neural Networks' (DP-MRGNN) for micro-video recommendation. The primary step in this endeavor is to jointly identify distinctive fusion patterns for each user, leveraging user-micro-video bipartite and user co-occurrence graphs. Moreover, the sheer volume of created videos renders human processing of multimedia data impractical for addressing numerous multimedia challenges. Hence, this approach proves practical for various applications, particularly with large video datasets. The study suggests employing a dual GRU Neural network to encapsulate local elements within each graph and extract features signifying interactions between matched graphs. A disentangled multi-modal representation learning module is also developed to aptly model user attention across various modalities and inductively learn multi-modal user preferences. Furthermore, a negative sampling method is implemented to ascertain modality associations and ensure effective contributions from each modality to the study. Simulation experiments are conducted using Matlab, demonstrating superior features over hand-crafted ones in real-world movie datasets and MovieLens recommendations. The model's feasibility and effectiveness are corroborated across multiple datasets, showcasing enhanced accuracy, nDCG, and recall compared to traditional recommendation methods.
- Subjects
GRAPH neural networks; WIRELESS Internet; CELL fusion
- Publication
Multimedia Tools & Applications, 2024, Vol 83, Issue 17, p51559
- ISSN
1380-7501
- Publication type
Article
- DOI
10.1007/s11042-023-17093-z