Gated Multimodal Graph Learning for Personalized Recommendation

Sibei  Liu; Yuanzhe  Zhang; Xiang Li; Yunbo Liu; Chengwei Feng; Hao Yang

doi:10.65563/jeaai.v1i1.24

Authors

Sibei Liu University of Miami
Yuanzhe Zhang University of California
Xiang Li Rutgers University
Yunbo Liu Duke University
Chengwei Feng Auckland University of Technology
Hao Yang Universiti Putra Malaysia

DOI:

https://doi.org/10.65563/jeaai.v1i1.24

Keywords:

Multimodal Recommendation, Graph Neural Networks, Gated Fusion

Abstract

Multimodal recommendation has emerged as a promising solution to alleviate the cold-start and sparsity problems in collaborative filtering by incorporating rich content information, such as product images and textual descriptions. However, effectively integrating heterogeneous modalities into a unified recommendation framework remains a challenge. Existing approaches often rely on fixed fusion strategies or complex architectures , which may fail to adapt to modality quality variance or introduce unnecessary computational overhead.

In this work, we propose RLMultimodalRec, a lightweight and modular recommendation framework that combines graph-based user modeling with adaptive multimodal item encoding. The model employs a gated fusion module to dynamically balance the contribution of visual and textual modalities, enabling fine-grained and content-aware item representations. Meanwhile, a two-layer LightGCN encoder captures high-order collaborative signals by propagating embeddings over the user-item interaction graph without relying on nonlinear transformations.

We evaluate our model on a real-world dataset from the Amazon product domain. Experimental results demonstrate that RLMultimodalRec consistently outperforms several competitive baselines, including collaborative filtering, visual-aware, and multimodal GNN-based methods. The proposed approach achieves significant improvements in top-K recommendation metrics while maintaining scalability and interpretability, making it suitable for practical deployment.

Gated Multimodal Graph Learning for Personalized Recommendation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License